Robustness of digitally modulated signal features against variation in HF noise model

High frequency (HF) band has both military and civilian uses. It can be used either as a primary or backup communication link. Automatic modulation classification (AMC) is of an utmost importance in this band for the purpose of communications monitoring; e.g., signal intelligence and spectrum management. A widely used method for AMC is based on pattern recognition (PR). Such a method has two main steps: feature extraction and classification. The first step is generally performed in the presence of channel noise. Recent studies show that HF noise could be modeled by Gaussian or bi-kappa distributions, depending on day-time. Therefore, it is anticipated that change in noise model will have impact on features extraction stage. In this article, we investigate the robustness of well known digitally modulated signal features against variation in HF noise. Specifically, we consider temporal time domain (TTD) features, higher order cumulants (HOC), and wavelet based features. In addition, we propose new features extracted from the constellation diagram and evaluate their robustness against the change in noise model. This study is targeting 2PSK, 4PSK, 8PSK, 16QAM, 32QAM, and 64QAM modulations, as they are commonly used in HF communications.


Introduction
Automatic modulation classification (AMC) is the process of identifying modulation type of a detected signal without prior information. This technique has both military and civilian applications, and is currently an important research subject in the design of cognitive radios [1][2][3]. AMC is a complex task especially in a non co-operative environment as in high frequency (HF) communications, where transmission is affected by atmospheric conditions and other transmission interferences [4].
AMC methods are grouped into two categories: likelihood based (LB) and feature based (FB) methods. LB methods have two steps: calculating the likelihood function of the received signal for all candidate modulations, and then using maximum likelihood ratio test (MLRT) for decision-making. In FB methods, features are first extracted from the received signal and then applied to a classifier in order to recognize the modulation type. Most of the recent literatures use the FB methods due to their low processing complexity and high performance [5]. For more details about AMC methods with a comprehensive literature review, the reader is referred to [6]. Figure 1 shows the classification task in a smart radio. The task of the signal detection block is to identify signal transmission, while the AMC contains a feature extractor followed by a classifier. The classifier can be based on fixed threshold as in decision tree methods, or based on pattern recognition (PR) methods as in artificial neural networks (ANNs) and support vector machines (SVM) [7,8]. Most of the features used in literature are based on wavelet [9,10], temporal time domain (TTD) analysis [11][12][13], and higher order cumulants (HOC) [14][15][16]. These features are generally extracted under the assumption that the modulated signals are corrupted by additive white Gaussian noise (AWGN). Although this assumption is valid in many communication environments, recent studies show that HF noise changes between AWG and bi-kappa distributions [17,18]. The effect of these two noise distributions has been taken into account during the design of the AMC algorithms proposed in [19]. The work shows that the change in noise model affects the classification performance, especially at low signal-tonoise ratio (SNR). Therefore, the robustness of commonly used features against variation in noise models needs to be investigated so that more reliable AMC algorithms can be designed for HF signals.
In this paper, we first examine the effect of Gaussian and bi-kappa noise models on wavelet, HOC, and TTD features, when these features are considered for the classification of single carrier modulations commonly used in HF band: 2PSK, 4PSK, 8PSK, 16QAM, 32QAM, and 64QAM [20]. Second, we propose new features based on maximum dissimilarity measures (MDM) in constellation diagram and evaluate their robustness against the change in noise model. Note that the contribution of this article is pertaining to the features extraction stage; hence the results obtained are independent of the classifier being used. However, these results will greatly serve the classifier design stage, as this stage can be based on features that are robust with respect to noise models.
The organization of the article is as follows. 'Signal model' and 'Noise model' sections present signal and channel noise models, respectively. 'Commonly used signal features' section introduces the TTD, HOC, and wavelet based features. 'Proposed features' section presents the proposed features. 'Simulation results' section presents results showing the robustness of the different features against the variation in noise model. 'Conclusion' section presents concluding remarks.

Signal model
The general form of received signal encompassing all modulation schemes under consideration is given by [21]: where C(t)is the complex envelope of modulated signal, n(t) is band limited noise, f c is the carrier frequency, and Re{} denotes the real part. The complex envelope is characterized by the constellation points C k , signal power E, and pulse shaping functionp(t). For Nsymbols with periodicity T, the general form of complex envelope can be expressed as: For MPSK modulation, C k {e -j2πm/M }, where m = 0, 1, ..., m-1. For MQAM modulations, C k a k + jb k , m = 0, 1, ..., (M) 1/2 /2, and

Noise model
Noise model assumed in most of the research related to AMC is AWGN. This research focuses on AMC in HF band, where the AWGN assumption no longer remains valid for all transmission times [17,18]. Instead, the noise varies between AWGN and bi-kappa distributions. The bi-Kappa distribution is characterized by the following probability distribution function: where σ and k are the shaping parameter and tuning factor, respectively. Practical values of these parameters are σ = 46, k = 1.1, and σ = 20, k = 1 [17]. Figure 2 shows the probability distribution function, p(x, k), for different values of σ and k. The figure shows that decreasing the shaping parameter produces a shaper peak and that the bi-kappa distribution approaches the AWG distribution when the tuning parameter is increased. In this work, the parameters of bi-kappa distribution are set to σ = 20, k = 1.
A more realistic noise model can be constructed by passing the bi-kappa noise through a band-limiting filter. The bandwidth of this filter is set to 8γ s where γ s is the symbol rate. This filter is practically used to minimize the transmission bandwidth. Figure 3 shows the constellation diagram of an intercepted 2PSK signal down converted to baseband for different SNR. This figure shows clearly the spiky nature of bi-kappa noise as compared to AWGN, especially at low SNR.

Commonly used signal features
This section gives the general formulas and description of commonly used signal features. Specifically, we consider the TTD, HOS, and wavelet based features.

TTD features
The variations in modulated waveforms can be described by three instantaneous values: frequency, phase, and amplitude [11,12]. All values related to these variations are defined as the TTD features. Two features will be investigated in our study. The first feature is the standard deviation of the absolute value of the centered non-linear component of the instantaneous phase defined as where j NL is the centered non-linear component of the instantaneous phase, t th is the threshold value of the non weak signal, L is the number of samples in j NL .
The second feature is the standard deviation of the absolute value of the normalized-centered instantaneous amplitude; that is, where N s is the number of samples, a cn = a/m a -1,a is the absolute value of the analytic form of the received signal, and m a is its sample mean value.

HOC features
The HOC are used to extract hidden information from non-Gaussian signals. In presence of AWGN, all the HOC are zero for orders greater than two. This makes these features attractive to classify modulated signals corrupted by AWGN. Fourth and sixth HOC considered in this study are defined as follows [14][15][16]: where C 21 is the average power and M pq is the joint moment. The later can be calculated for any values of p and q using the following equation: where x* denotes complex conjugate and E{} is the expectation operation. Table 1 shows the theoretical cumulants for the considered modulation schemes.

Wavelet features
Wavelet transform preserves the time information while providing the frequency information of an analyzed signal. This makes it a good candidate for AMC. As shown in Figure 4, features extraction using wavelet transform passes through three steps: wavelet decomposition using Haar mother waveform, median filtering, and finally calculation of standard deviation [22]. Robustness of wavelet features against noise model has been tested at level three and four.

Proposed features
The PSK and QAM modulations are represented by a constellation diagram in which the modulation symbols are depicted in terms of phase and amplitude variations. This diagram is extracted from the analytic form of the IF signal by multiplication with the complex conjugate of the carrier frequency. Many AMC algorithms are designed using features based on constellation diagram. These algorithms use different classification techniques that include maximum likelihood [23], genetic algorithms [24,25], modified Chi-squared test [26], and subtractive clustering [27]. In this article, we propose a different use of constellation diagram by extracting features based on maximum dissimilarity measures (MDM), firstly to distinguish between different modulation types, such as QAM and PSK signals, and secondly to find the order of a particular modulation. MDM features depend on calculating the dissimilarity between different constellation diagrams after signals normalization. That is, features are extracted from the distance (or dissimilarity measures) between the complex envelope of the received signal and set of reference constellation points for a particular modulation scheme. These reference constellation points are defined by their amplitudes and phases [21]. MDM are computed after normalizing both the received and reference constellation points to their mean values. The dissimilarity function is defined as [28]:  where d is the Euclidian distance between the complex envelope of intercepted signal × and reference constellation points p. For feature extraction, the signal × is randomly generated at a particular SNR. A dissimilarity vector d, whose entries are the distance between a randomly generated constellation point of x and M reference constellation points p, can be obtained. The element of maximum value of vector d is averaged over several independent runs, and then selected as the desired feature. In practice, the mean and/or standard deviation of d max (x, p) will have values based on the noise level. Table 2 shows five proposed features related to the MDM, each of which is responsible for discriminating a specific modulation type or modulation order. As shown in Table 1, the first feature d 1 is used to discriminate between QAM and PSK signals, while d 2 is used to discriminate between 2PSK and other PSK signals of higher orders. For further details see Table 2. Effect of noise model on the proposed features will be discussed in the next section.

Simulation results
To evaluate the robustness of presented features, all the modulations schemes under test were generated in presence of band-limited AWGN and bi-kappa noise, where the bandwidth of the band-limiting filter is 8γ s ; this process is practically used to avoid high bandwidth

Standard Deviation
Median Filtering

Wavelet Decomposition
Level (4) and (3) Extracted Features Received Signal Figure 4 Steps for wavelet features extraction.    transmission [13]. SNR is adjusted by multiplying the output noise by the following factor: where E, N 0 , SNR are the signal power, noise power, and desired SNR, respectively. All constellation points are normalized to zero mean and unity variance. The simulation parameters are given in Table 3.
For evaluation purposes, we measure the absolute value of the percentage deviation of each feature when noise model is changed from AWGN to bi-kappa. This percentage is evaluated using SNR ranging between 0 and 30 dB, and is defined as follows where F AWGN and F Bi-kappa are the values of feature under consideration computed in the presence of AWGN and bi-kappa noise, respectively, at a particular SNR. Figures 10,11,12,13,14,and 15 show the results, averaged over 100 independent realizations, for the following set of modulations: 2PSK, 4PSK, 8PSK, 16QAM, 32QAM, and 64QAM.
The above figures show that at SNR <10 dB, TTD and MDM are more robust than HOC against the change in HF noise model. It is true in general that η decreases as SNR increases. However, for MPSK signals, the instantaneous amplitude feature has lower deviation for SNR <30 dB. This is intuitively not surprising because the difference between F AWGN and F bi-kappa relative to F AWGN in this SNR range is smaller than that of higher SNR values. Another observation is that the wavelet based features have maintained almost the same values of η for all considered modulations. In addition, the proposed MDM have shown excellent performance in the sense that they have the lowest deviation as compared to other features.

Conclusions
In this article, we have investigated the robustness of four features categories for the classification of digitally modulated signals in the presence of HF noise models; AWGN and bi-kappa noise. Specifically, the TTD, HOC, wavelets, and MDM features are considered, where the last feature is proposed in this work. It has been shown through computer simulations that HOC are sensitive to the change in noise model especially at low SNR (<10 dB), while TTD, wavelets, and MDM show good robustness (η < 25%) in the investigated range of SNR. Note