setting the optimal decision threshold and analysis of impact of

SETTING THE OPTIMAL DECISION THRESHOLD AND ANALYSIS OF
IMPACT OF SAMPLE SIZE ON AUTOMATIC MODULATION
CLASSIFICATION BASED ON SIXTH-ORDER CUMULANTS
VLADIMIR D. ORLIC
“Vlatacom d.o.o.”, Belgrade, [email protected]
MIROSLAV L. DUKIC
Faculty of Electrical Engineering, Belgrade, [email protected]
Abstract: In this paper the optimal value of decision threshold in automatic modulation classification process, based on
the value of normalized sixth-order cumulants, is analyzed. Possibilities of positioning the decision threshold at
numerical values defined by variances of particular sixth-order cumulants (corresponding to observed modulation
formats) are considered. Performances of classification process organized in previously mentioned manner, along with
performances of classification with standard threshold setting approach (where threshold is always positioned in the
middle of successive numerical values of sixth-order cumulants), are presented. Since automatic modulation
classification performance is directly implicated by the size of data sample collected for the processing purposes, the
impact of sample size on optimal threshold setting is also considered. Expected classification performance is verified
through Monte-Carlo simulations.
Key words: automatic modulation classification, sixth-order cumulants, variance, decision threshold.
manner) and make decision from their observed values.
Feature-based algorithms are very simple, with nearlyoptimal performance, when designed properly. Their low
computational complexity recommends these algorithms
for practical applications: among many different
algorithms, those based on higher order statistics are
especially suitable for implementation, as being extremely
simple. Higher order statistics – moments and cumulants
can be used for AMC, with some advantage given to
cumulants because of their robustness on presence of
additive white Gaussian noise (AWGN).
1. INTRODUCTION
Implementation of advanced communication systems for
military applications in a crowded electromagnetic
spectrum is a challenging task: while friendly signals
should be transmitted and received securely, hostile signals
should be located, identified and jammed. Such
applications have emerged the need for intelligent and
flexible solutions, where the automatic recognition of the
modulation of a detected signal is a major task [1]. This
recognition process, commonly named – automatic
modulation classification (AMC), is an intermediate step
between detection and demodulation within a receiver.
AMC has been investigated in US military
communications-related research for years and would be
considered to be integrated with the future communication
standards involving adaptive modulation [2].
Employing normalized fourth-order cumulants as features
of interest in AMC was originally discussed in [3]. In the
same work authors explored the variance of the sample
estimates of normalized fourth-order cumulants, and
concluded that value of this variance directly depends on
the sample size. For AMC algorithm based on normalized
sixth-order cumulants, [4], expressions for the variance of
the cumulants’ sample estimates are derived in [5], and
numerical values for some modulation formats of interest
are calculated. In the same paper the direct dependence of
variance on the sample size is reported, and novel
criterion for mutual comparison between different
cumulant-based AMC algorithms is proposed.
In general, typical AMC approaches can be classified in
two major groups: decision-theoretic (likelihood-based)
and pattern-recognition (feature-based) algorithms.
Decision-theoretic algorithms are based on the likelihood
function of the received signal and the decision is made
through comparing the likelihood ratio with appropriate
threshold. Although these algorithms are optimal in
Bayesian sense, they suffer from significant
computational
complexity.
Pattern-recognition
algorithms, on the other hand, exploit several different
features of received signal (commonly chosen in ad hoc
In this paper we present results of further analysis of
normalized sixth-order cumulant’s variance and some
aspects of its impact on AMC procedure: simulated values
511
while the self-normalized sixth-order cumulant is defined as:
of sample estimate variances, the positions of optimal
comparison thresholds defined by variance values,
algorithm performance dependence on sample size and
some strategies for enhancement of AMC algorithm.
Cˆ 63, x = C63, x /(C21, x )3 .
We adopt the following relationships between the
cumulants of x and the cumulants of y (associated with
received sequence y(n)):
2. AMC ALGORITHM ON THE BASIS OF
SIXTH-ORDER CUMULANTS
The received signal sequence y (n) , corrupted by AWGN
only during propagation, can be represented by:
y ( n) = x ( n) + g ( n) ,
(1)
and g ( n) is AWGN with a zero mean and a variance of
transmitted data sequence x(n) , the second-order
cumulant C21, x = cum( x, x* ) is given by:
2
2
2
+12 E ( x 2 ) E ( x ) + 12 E 3 ( x )
Cˆ 63, x =
2
2
,
(6)
C63, y
(C21, y − σ g2 )3
.
(7)
while calculation of second-order and sixth-order
cumulant of received signal practically comes down on
calculation of mean-values over ensemble of collected
signal samples, and their further combining. If number of
samples is represented with N, equation (7) in practical
realization can be rewritten as:
(2)
be expressed as:
4
C21, y = C21, x + σ g2 .
The noise power σ g2 can be measured at receiving point,
The sixth-order cumulant C63, x = cum( x, x, x, x* , x* , x* ) can
6
(5)
Cˆ 63, x =
σ g2 . For zero-mean random variable x, associated with
C21, x = E ( x ) .
C63, y = C63, x ,
Consequently, we have:
where x(n) stands for transmitted modulated symbols,
C63, x = E ( x ) − 9 E ( x ) E ( x ) +
(4)
(3)
2
3
N
⎛ 1 N 2
⎞
⎛1 N
⎛1 N
1 N
1 N
6
4 1
2⎞
2
2⎞
⎜
⎟
−
⋅
+
⋅
+
y
n
y
n
y
n
y
n
y
n
y
n
(
)
9
(
)
(
)
12
(
)
(
)
12
(
)
∑
∑
∑
⎜ ∑
⎟
⎜ ∑
⎟
⎜ N∑
⎟
N n =1
N n =1
N n =1
⎝ N n =1
⎠
⎝ N n =1
⎠
⎝ n =1
⎠
1 N
2
( ∑ y (n) − σ g2 )3
N n =1
order k with m conjugations. For complex signals with N
samples error variance is given by:
2
2
2
2
− 54m4,2
N var(C63, x ) = [ m12,6 − m6,3
] + 9[ m2,1
(48m4,2 m2,1
Table 1. Theoretical normalized sixth-order cumulants
for some constellations
QPSK
16-QAM
64-QAM
Ĉ63
4.000
2.080
1.797
4
2
+ 96m2,1
− 64m6,3m2,1 ) + m4,2 (9m4,2
+ 16m6,3m2,1 (10)
− 2m8,4 ) + m2,1 (17 m8,4 m2,1 − 2m10,5 )]
As it can be noticed from eq. (9) and (10), error variances
are directly proportional with sample size N, and take
different values for different modulation formats [5].
However, limited precision in numerical calculations is
not the only source of dispersion of higher-order
cumulants’ values: dispersion is also implicated by
unequal number of different symbols in randomly
generated messages, and by the presence of the noise.
3. VARIANCE OF CUMULANT ESTIMATES
Theoretical values of higher-order cumulants in AMC
problems, like those shown in Table 1, represent only
expected values of cumulants; some portion of dispersion
around expected values is unavoidable in practical
calculations. This phenomenon was explored and
described in literature for fourth-order cumulants [3], and
for sixth-order cumulants [5]. The error variance due to
limited precision of calculation of C63, x , for real signals
Adopting the procedure described by eq. (8), decision
making process for the modulation recognition is based
on comparison of obtained values of estimates Cˆ 63, x with
with N samples, is given with:
2
2
2
2
4
− 126m4,2
+ 384m2,1
N var(C63, x ) = [m12,6 − m6,3
] + 9[m2,1
(384m4,2m2,1
2
− 128m6,3m2,1 ) + m4,2 (9m4,2
+ 16m6,3m2,1 − 2m8,4 )
(8)
where mk , m = E[ y k − m ( y * ) m ] represents mixed moment of
In Table 1 the theoretic values of the sixth-order
cumulants
for
some
well-adopted
modulation
constellations are shown.
Constellation
.
(9)
+ m2,1 (25m8,4m2,1 − 2m10,5 )]
512
predefined thresholds. Comparison thresholds are
commonly positioned at the middle of intervals between
expected (theoretical) values that correspond with
particular modulation formats. For example, from Table 1
it can be concluded that comparison threshold between
QPSK and 16-QAM signals should be placed at value
(4+2.08)/2=3.04.
However, it is a fact that this manner of thresholds’
positioning is, by theory, appropriate only if decision is to
be made between values having mutually equal variances,
and while occurring with equal probability, [6]. Statistics
of cumulants is considered to obey Gaussian law [3], but
their variances appear to be mutually unequal, for
different modulation formats. In case of decision making
between Gaussian parameters having different variances,
the position of optimal threshold can be derived from
theoretical condition for minimum probability of error:
−
1
e
Pr{s1}
2πσ 1
(VT − μ1 ) 2
2σ12
−
1
= Pr{s2 }
e
2πσ 2
proportion: for QAM signals this proportion
approximately holds under all considered SNR values,
while for QPSK signal higher sensitivity is shown for
changes of N value at higher SNR, and it converges to
~ 1/ N when the SNR value decreases. For all considered
modulation formats, higher values of SNR at fixed sample
size result with lower variances. This is, also, expected
way of behavior.
Table 2. Cˆ 63, x variance under SNR=20dB
(VT − μ 2 )2
2σ 22
QPSK
,
(11)
where Pr{si } stands for a priori probability of occurrence
of event si , whose statistics is Gaussian with mean μi
and variance σ i2 . Optimal threshold VT can under the
terms of mutually equal a priori probabilities Pr{si } be
calculated from equivalent equation:
−
σ1
=e
σ2
(VT − μ1 ) 2
2σ12
+
(12)
σ 2 μ − σ 2 2 μ1
VT = 1 22
σ1 − σ 22
σ1
+ σ 12σ 2 2 ( μ1 − μ2 ) 2 .
σ2
σ 12 − σ 2 2
±
2.2 ⋅ 10−3
N = 2.000
8.7 ⋅ 10−5
4.2 ⋅ 10−3
4.3 ⋅ 10−3
N = 1.000
3.1 ⋅ 10−4
8.3 ⋅ 10−3
9 ⋅ 10−3
N = 500
1.2 ⋅ 10−3
1.7 ⋅ 10−2
1.9 ⋅ 10−2
N = 250
5.2 ⋅ 10−3
3.8 ⋅ 10−2
4.1 ⋅ 10−2
Table 3. Cˆ 63, x variance under SNR=15dB
which after taking the natural logarithm and solving the
following quadratic equation gives this solution:
2(σ 2 2 − σ 12 )σ 2 2σ 12 ln
2.1 ⋅ 10
64-QAM
−3
2.5 ⋅ 10
QPSK
,
16-QAM
N = 4.000
(VT − μ 2 )2
2σ 22
−5
−5
16-QAM
2.5 ⋅ 10
−3
64-QAM
2.7 ⋅ 10−3
N = 4.000
9.7 ⋅ 10
N = 2.000
2.4 ⋅ 10−4
5 ⋅ 10−3
5.3 ⋅ 10−3
N = 1.000
6.7 ⋅ 10−4
1 ⋅ 10−2
1.1 ⋅ 10−2
N = 500
2 ⋅ 10−3
2 ⋅ 10−2
2 ⋅ 10−2
N = 250
6.4 ⋅ 10−3
4.3 ⋅ 10−2
4.7 ⋅ 10−2
Table 4. Cˆ 63, x variance under SNR=10dB
(13)
QPSK
The sign of square-root term in previous equation should
be chosen in the manner which results with solution
having numerical value from interval ( μ1 , μ 2 ) (under
assumption μ1 < μ 2 ).
Thus, it is of interest to determine overall variance of
normalized sixth-order cumulants, which represents the
product of synergy of several effects: limited numerical
precision, random symbol generation and presence of
noise, simultaneously. Having in mind results described
with equations (9) and (10), it is also of interest to explore
dependence of overall variance on sample size N.
−4
16-QAM
3.8 ⋅ 10
64-QAM
−3
4.3 ⋅ 10−3
N = 4.000
8.3 ⋅ 10
N = 2.000
1.8 ⋅ 10−3
7.9 ⋅ 10−3
8.7 ⋅ 10−3
N = 1.000
3.7 ⋅ 10−3
1.6 ⋅ 10−2
1.7 ⋅ 10−2
N = 500
8.8 ⋅ 10−3
3.3 ⋅ 10−2
3.7 ⋅ 10−2
N = 250
2.1 ⋅ 10−2
6.8 ⋅ 10−2
7.9 ⋅ 10−2
Table 5. Cˆ 63, x variance under SNR=5dB
QPSK
With this goal in mind, a number of computer simulations
is performed, and variances of Cˆ 63, x in AWGN channel,
for modulation formats belonging to the set {QPSK, 16QAM, 64-QAM}, through 2.000 mutually independent
experiments per each modulation format, are calculated
under various values of sample sizes N and various values
of signal-to-noise ratio (SNR). Achieved results are
presented in Tables 2 - 6.
−2
16-QAM
1.5 ⋅ 10
−2
64-QAM
1.5 ⋅ 10−2
N = 4.000
1 ⋅ 10
N = 2.000
2 ⋅ 10−2
3 ⋅ 10−2
3.3 ⋅ 10−2
N = 1.000
4.3 ⋅ 10−2
6 ⋅ 10−2
6.7 ⋅ 10−2
N = 500
8 ⋅ 10−2
1.3 ⋅ 10−1
1.4 ⋅ 10−1
N = 250
1.9 ⋅ 10−1
2.6 ⋅ 10−1
2.9 ⋅ 10−1
Table 6. Cˆ 63, x variance under SNR=0dB
QPSK
Results from Tables 2-6 show that variances of estimates
Cˆ 63, x change with sample size N and SNR values. As it
should be expected, larger values of N result with lower
variances. It is interesting to notice that this change is
been made with approximately following ~ 1/ N
513
−1
16-QAM
2.7 ⋅ 10
64-QAM
−1
2.8 ⋅ 10−1
N = 4.000
2.5 ⋅ 10
N = 2.000
5.1 ⋅ 10−1
5.7 ⋅ 10−1
5.5 ⋅ 10−1
N = 1.000
9.7 ⋅ 10−1
1.1
1.2
N = 500
2.2
2.3
2.4
N = 250
4.8
5.5
5.3
Mutual relations of variances corresponding with particular
constellations represent important result of described tests,
since they provide the information needed for deeper
considerations on optimal threshold values (eq. 13).
may result with improvement in overall complexity and
processing time. Precisely, simulations show that
successful (errorless) classification of QPSK signals can
be achieved even with very small sample sizes. This result
comes from the fact that Cˆ 63, x values for QPSK signals
4. DISCUSSION ON OPTIMAL THRESHOLDS
are distanced from Cˆ 63, x values of QAM signals enough to
From Tables 2 – 6 it can be noticed that variances of 16QAM and 64-QAM signals are mutually very close in
numerical values, for all sample sizes and signal-to-noise
ratio values. This means that optimal thresholds for
classification between 16-QAM and 64-QAM signals lay
at positions very close to the middle of interval between
theoretical values of Ĉ63 for these signal formats (i.e. at
3.04 exactly), for every N and SNR. Numerous
simulations of AMC algorithm on the basis of sixth-order
cumulants, which target these constellations (among
others), show that mutual distinguishing of signals
between 16-QAM and 64-QAM formats has the most
significant impact on overall AMC performance [4,5].
provide appropriate decision-making process even under
significant variances of sixth-order cumulants’ estimates.
On the other hand, mutual distinguishing of 16-QAM and
64-QAM signals requires adopting higher values of N.
Thus, as a strategy defined to provide lower algorithm
processing time (and relax memory requirements), twolevel AMC procedure can be defined in following
manner:
− First, estimate Cˆ 63, x value, according to eq. (8), by using
some relatively low number of samples N1 , and
compare it with “middle of the interval” threshold
between QPSK and 16-QAM signal’s cumulants (equal
to 3.04). If signal is within this step recognized as
QPSK, AMC procedure is over;
Further, it can be concluded that variance of QPSK signal
is significantly lower than variances of 16-QAM and 64QAM signals, for higher SNR values, and becoming
asymptotically equal with them as SNR decreases. So, it
is of interest to consider possibility of improving the
performance by adopting optimal thresholds (eq. (13)) for
the purpose of distinguishing QPSK from QAM signals.
We have tested this possibility in detail and simulations
show that, since they have been located at significant
− If estimated Cˆ 63, x value corresponds with QAM signals’
values, repeat the procedure from eq. (8) with higher
number of samples N 2 in order to provide necessary
precision in classification of QAM signals, and
compare it with “middle of the interval” threshold
between 64-QAM and 16-QAM signal’s cumulants.
distance from Cˆ 63, x values of QAM signals, Cˆ 63, x of
QPSK signals are successfully recognized with 100%
success at all SNR values from 5dB and above, with
“middle of interval” threshold setting. For SNR values
lower than 5dB ambiguity in classification evidently
occurs, but it is mainly caused by the noise, whose strong
presence makes variances of all considered modulation
formats to be approximately equal (Table 6). Thus, it can
be concluded that using optimal thresholds (in the
meaning of eq. (13)) does not lead to performance
improvement with QPSK signals neither, since the impact
of noise which causes overlapping in Cˆ 63, x values makes
these optimal values to be very near to the middle of
interval between theoretical values of cumulants.
Analysis of achieved results and given discussion on this
subject, clear the question of optimal threshold setting in
AMC algorithm on the basis of normalized sixth-order
cumulants. It can be concluded that threshold manipulations
due to variations in variance values do not improve
algorithm performance, and setting thresholds at the “middle
positions” between expected values is the most adequate
solution. We have confirmed given conclusion through large
number of Monte-Carlo experiments, which showed that no
significant improvement can be made by using threshold
values different than middle positions of theoretical intervals
shown in Table 1.
In order to evaluate the procedure given above, we have
simulated this procedure through 2.000 Monte-Carlo
experiments, under the terms of probability of correct
classification ( PCC ) versus signal-to-noise ratio (SNR).
Simulated modulation formats are taken from the set
{QPSK, 16-QAM, 64-QAM}. The simulation of proposed
two-level procedure is realized by using the values of
N1 = 500 for sample size in coarse classification of QPSK
signals in the first step, and N 2 = 2.000 samples for fine
classification of 16-QAM and 64-QAM signals in the
second step. Achieved values of PCC are presented in
Picture 1, along with values of PCC achieved by using the
fixed value of N = 2.000 for the number of processed
samples.
As it can be noticed from the Picture 1, proposed twolevel procedure truly follows the performance of
“classical” AMC approach very closely, at all SNR values
higher than, approximately, 5dB. Thus, it can be
concluded that proposed procedure should be expanded
with a pre-processing step for estimation of received
signal power (which is equal to second-order cumulant
C21, y ) and measurement of noise power σ g2 , and further
continuing with two-level classification if estimated SNR
is larger than 5dB, or using fixed higher value of N
otherwise. Since both estimation of C21, y and mesurement
of σ g2
5. MANIPULATIONS WITH SAMPLE SIZE
are already included in considered AMC
algorithm, described pre-processing step does not involve
any additional complexity in classification process.
Analysis of impact of sample size N on AMC algorithm
performance leads to some interesting conclusions, which
514
In scenario where values N1 = 500 and N 2 = 2.000 are
adopted for two-level classification, in comparison with
approach based on fixed value of N = 2.000 samples,
approximately 25% of processing time is saved at higher
SNR values, in average. Two-level classification with
N1 = 500 and N 2 = 4.000 samples, in comparison with
classical procedure with N = 4.000 sample size, results
with approximately 29.17% savings in processing time, in
average, as practically expected at values of SNR higher
than 5dB.
6. CONCLUSION
In this paper the overall variances of sample estimates of
normalized sixth order cumulants, for complex signal
constellations, under various sample sizes and signal-tonoise ratios, are presented. From empirical analysis it is
concluded that classical “middle of the interval”
thresholds should be used in implementation of
considered algorithm. New two-stage classification
procedure, based on manipulations with sizes of samples
being included in processing, is proposed, and its
appropriateness for adopting in considered algorithm is
verified through simulations. Selection of sample sizes
which are commonly considered in AMC literature, for
proposed two-level classification procedure, results with
significant savings in amount of time necessary for
numerical calculations. Future research in this field
should include multipath channel model, as being much
more realistic than channel with AWGN only.
Picture 1. Probability of correct classification of signals
from the set {QPSK, 16-QAM, 64-QAM} versus SNR: i)
new two-level procedure with N1 = 500 , N 2 = 2.000
(yellow), and ii) classical procedure with fixed value of
N = 2.000 (blue).
It is also interesting to estimate the savings in processing
time which should be expected with proposed two-level
procedure; implementation of proposed procedure can be
organized in two different ways, so we calculate expected
savings for both implementation methods. First, two-level
classification can be realized with mutually independent
calculations of formula from eq. (8) with N1 samples at
the first stage and, (if needed) with N 2 samples at the
second stage. For example, if values N1 = 500 and
N 2 = 2.000 are adopted, this means that statistically
33.3% of calculations is done by using only N1 samples,
while 66.7% of calculations require N1 + N 2 samples.
Comparison should be made with approach of using fixed
value of N = 2.000 samples for processing. If number of
calculations in two-level procedure is marked with CTL ,
and number of calculations in classical procedure with
fixed-number of samples is given by CC , we have:
CTL 1 N1 2 N1 + N 2 11
=
+
=
,
CC 3 N 3 N
12
References
[1] Dobre, O. A., Abdi, A., Bar-Ness, Y., Su, W.: A
Survey of Automatic Modulation Classification
Techniques: Classical Approach and New Trends,
IET Commun., Vol. 1, No. 2, pp. 137-156, April
2007.
[2] Wu, H.-C., Saquib, M., Yun, Z.: Novel Automatic
Modulation Classification Using Cumulant Features
for Communications via Multipath Channels, IEEE
Trans. Wireless Commun., Vol. 7, No. 8, pp. 30983105, August 2008.
[3] Swami, A., Sadler, B. M.:“Hierarchical Digital
Modulation Classification Using Cumulants, IEEE
Trans. Commun., Vol. 48, pp. 416-429, 2000.
[4] Orlic, V. D., Dukic, M. L.: Automatic Modulation
Classification Algorithm Using Higher-Order
Cumulants Under Real-World Channel Conditions,”
IEEE Commun. Lett., Vol. 13, Issue 12, pp. 917-919,
December 2009.
[5] Orlic, V. D., Dukic, M. L.: Properties of an
Algorithm for Automatic Modulation Classification
Based on Sixth-Order Cumulants, Proc. of ICEST
2009 Conf., Veliko Tarnovo, Bulgaria, 2009.
[6] Carlson, A. B., Crilly, P. B., Rutledge, J. C.:
Communication Systems, 4th edition, McGraw-Hill,
2002.
(14)
meaning that in average 8.33% savings in processing time
(number of calculations) can be expected from using twolevel classification procedure. In case when the same
method with N1 = 500 and N 2 = 4.000 parameters is
compared with classical method using fixed value of
N = 4.000 samples, achieved savings in processing time
rise up to 20.83% in average.
Also, two-level classification can be organized in such a
manner, that samples used within the first stage of
classification are also used within the second stage (if
needed), i.e. N1 samples from QPSK recognition are
included in N 2 samples for QAM signals recognition.
Since the number of calculations in eq. (8) exceeds all
other calculations in algorithm by far, ratio described in
eq. (14) becomes:
CTL 1 N1 2 N 2
≈
+
.
CC 3 N 3 N
(15)
515