Open Access

A novel pre-processing and adaptive statistical threshold for sphere detection in MIMO systems

EURASIP Journal on Wireless Communications and Networking20132013:275

Received: 10 May 2012

Accepted: 12 November 2013

Published: 5 December 2013


In this paper, an efficient sphere detection (SD) scheme for multiple-input multiple-output (MIMO) systems is presented. The proposed SD scheme can substantially reduce computational complexity by introducing a pre-processing scheme and an adaptive threshold. In particular, our proposed pre-processing scheme is based on an iterative successive interference cancelation (SIC). The proposed adaptive threshold is used to judge whether a traversed path is correct or not and obtained by analyzing the distributions of the correct and erroneous estimates which have the minimum error pattern among all the SD searching paths. Simulation results show that our proposed scheme offers a substantial complexity reduction in terms of the number of visited nodes while keeping a significantly small performance degradation compared to conventional approaches.

1 Introduction

The maximum likelihood (ML) detection is considered as an optimal solution for achieving the minimum error probability of multiple-input multiple-output (MIMO) systems. However, the ML detection, also a well-known non-deterministic polynomial-time (NP)-complete problem, comes at an expense of computational complexity. To overcome this shortcoming, several suboptimal detection algorithms, such as zero forcing (ZF), minimum mean-square error (MMSE), and successive interference cancelation (SIC), have been proposed. Although these algorithms have lower complexity than ML, the achievable performance is far below the performance of the ML receiver.

Sphere detection (SD) algorithm is a tree search strategy that can achieve a similar performance to that of ML and is widely considered to be the most promising approach for MIMO detection. It has polynomial expected complexity for problems with moderate size and can be efficiently used for moderate signal-to-noise ratios (SNRs) [1, 2]. However, the expected complexity is exponential in the number of antennas and constellation size for fixed SNRs [2], and it still requires a large amount of computations compared to MMSE- and ZF-based algorithms, especially for high numbers of antennas and large constellation sizes. Hence, many efforts have been made to modify the SD algorithm to further reduce the computational complexity such as the Schnorr-Euchner (SE) [3] strategy, channel matrix pre-processing (ordering scheme) [46], modification of the lattice representation [7], probabilistic tree pruning strategy [8], and k-best algorithm [9]. All the schemes can improve SD efficiency to some degrees while still maintaining a large amount of computations.

Motivated by all of the above, in this paper, we propose an efficient ordering scheme and adaptive threshold to decrease SD complexity while keeping the system performance satisfactory. An ordering scheme based on SIC which the authors call sorted SIC (SSIC) ordering is proposed. Then, an adaptive threshold which can be seen as the extension of the threshold used in fixed-complexity SD (FSD) [10] is developed. Simulation results show that the proposed scheme combining SSIC ordering with adaptive threshold results in almost minimum complexity (in terms of the number of visited nodes) compared to other existing approaches with neglected performance degradation.

Throughout this paper, the following notations are adopted. Bold uppercase and lowercase letters stand for matrices and vectors, respectively. () T and ()−1 describe the transpose and inverse of a matrix, respectively. Furthermore, and || denote the norm and absolute operator, respectively, and E [ ] is the expectation operator. In addition, () i means the i th row for a matrix or the i th element of a vector.

2 System model and sphere detection

Consider a MIMO system with N transmit and M receive antennas where MN. The input-output relationship can be expressed as
y = H s + w ,
where s = s 1 , s 2 , , s N C N is an N-dimensional transmitted complex vector whose entries s i ,i=1,2,…,N, are drawn from some quadrature amplitude modulation (QAM) constellation with the real and imaginary parts being integers; y = y 1 , y 2 , , y M C M is an M-dimensional received complex vector; H C M × N denotes the M×N MIMO channel matrix; and w C N is the additive white Gaussian noise (AWGN) vector. The matrix H is defined by H= [ h i j ] for 1≤iM and 1≤jN, where h i j , the i th row and j th column entry of H, is assumed to be an independent and identically distributed (i.i.d.) circular symmetric complex Gaussian random variable with 0 mean and 1 variance. The tuples of N-dimensional Gaussian noise vector w are i.i.d. circular symmetric complex Gaussian random variables with 0 mean and σ2 variance. Herein, the SNR at the receiver is defined as
SNR = E s 2 E w 2 = E s 2 N σ 2 .
For simplicity, the complex-valued model given in (1) is usually transformed into a real-valued model, where the dimension is doubled such as m=2M and n=2N. As such, (1) can be rewritten as
y I y = H I H I H H s I s + w I w ,
where () and I() denote the real and imaginary parts of complex numbers, respectively. Since all the complex form of (1) can be written as (3), we assume that (1) is a real-valued model without loss of generality in the following. Suppose perfect channel knowledge is available at the receiver, the ML detection problem can be formulated as
s ML = arg min s ̂ D y H s ̂ 2 ,
where denotes the Euclidean norm and D = A n and A is the signal constellation set, e.g., A = { 3 , 1 , 1 , 3 } for 16QAM. For a general H, this problem is known to be NP-hard. Sphere decoding solves (4) by searching only over those points that satisfy a constraint of the form
y H s ̂ 2 d 0 2 .
In other words, the SD algorithm only searches the lattice points inside the hypersphere centered at the received vector with radius d0 instead of searching all lattice points. By applying QR decomposition to H, (5) can be written as
y Q R 0 m n × n s ̂ 2 = Q 1 T y R s ̂ 2 + Q 2 T y 2 d 0 2 ,
where R is an n×n upper triangular matrix, Q= [ Q 1 Q 2 ] is an m×m orthogonal matrix, and Q 1 and Q 2 represent the first n and last mn orthonormal columns of Q, respectively. Suppose radius d0 is sufficiently large so that the ML solution lies in the hypersphere, and let d 2 = d 0 2 Q 2 T y 2 and z = Q 1 T y = Rs + Q 1 T w ; thus, sphere decoding solves (4) by
s ML = arg min s ̂ D z R s ̂ 2 = arg min ŝ j A i = n 1 z i j = i n R ij ŝ j 2 d 2 ,
where R i j denotes the i th row and j th column tuple of matrix R. From (7), we can see that sML can be solved recursively by following a similar fashion to that in the SIC technique. Specifically, we use a constrained tree search through multiple levels, starting from ŝ n down to ŝ 1 , as shown in Figure 1.
Figure 1

Tree search example for a 16QAM system.

3 Proposed scheme

In this section, we propose the SSIC ordering scheme to significantly improve the ML performance using the SD algorithm. In addition, we further reduce the SD complexity by developing a new adaptive threshold to judge whether an estimate is correct or not, attempting to output the good estimate as early as possible.

3.1 Proposed pre-processing

It has been shown that the SD complexity is highly sensitive to the order of the columns of the channel matrix, which is dependent on both the channel matrix and the received signal. Reordering the channel matrix columns is a simple and efficient pre-processing to reduce the SD complexity while maintaining reliability of the system performance. Some channel matrix reordering approaches, such as V-BLAST ordering [4], balanced sorted QR decomposition (BSQRD) ordering [5, 9], and gradient-based (GB) ordering [6], have been proposed to improve the SD efficiency. Good ordering schemes should exploit properties of the channel matrix and the received signal as fully as possible. Among these ordering approaches, ZF is the only simple and efficient reference for ordering. However, the reliability of ZF is highly sensitive to the noise component, which is amplified by the row norm of the pseudo-inverse of the channel matrix as
H z = s + H v ,
where H =(H T H)−1H T is the pseudo-inverse of channel matrix H, and v = Q 1 T w is an i.i.d. noise vector with the same distribution as w. Therefore, the i th tuple of ZF solutions is given by
H z i = s i + H i v , i = 1 , 2 , , n ,
where (H ) i denotes the i th row of matrix H . It is shown that each component of the ZF solution vector is detected out of various noise levels which is modified by (H ) i . Motivated by the observation, we define the signal reliability:
reliability s i = H z i s i H i ,
where s i A . Based on (10) and due to the similar operation between SD and SIC, the proposed SSIC ordering is described as follows (for i=0,1,…,n−1):
  1. 1.

    Set H 0=H, z 0=z, and N = 1 , 2 , , n .

  2. 2.

    Calculate the matrix H i , where H i is the pseudo-inverse of the i th iteration channel matrix H i .

  3. 3.

    Calculate the ZF solution vector s i zf = s i zf 1 , s i zf 2 , , s i zf n i T = H i z i , where z i is the i th iteration received signal, and for each element s i zf j of s i zf , find s j ( 1 ) A such that s j ( 1 ) s i zf j s j ( ) s i zf j holds for any =2,3,…,L, where s j ( ) A and L is the constellation size.

  4. 4.

    For each element s i zf j , find out k ̂ i = arg max j = 1 , , n i s j ( 1 ) s i zf j H i j , and k i = N k ̂ i , the k ̂ i th element in set , is the corresponding column index of channel matrix H, and let N = N k ̂ i , where denotes the set subtraction.

  5. 5.

    Cross the k ̂ i th column out of H i to be H i+1, let z i + 1 = z i h k i s k i 1 , i=i+1, and repeat steps 2 to 5 until i=n−1, where h k i , i = 0 , 1 , , n 1 is k i th column of H.

  6. 6.

    The columns of the channel matrix are reordered as H = h k n 1 , h k n 2 , , h k 1 , h k 0 .


3.2 Proposed adaptive threshold

In the tree searching, each full traversed path metric is calculated by
C = z R s ̂ 2 = R s s ̂ + v 2 .
When the estimate is correct, i.e., s ̂ = s , C=v2 is a central chi-square-distributed random variable with n degrees of freedom denoted as χ n 2 . In contrast, the metric for the case of s ̂ s is the non-central chi-square-distributed random variable with n degrees of freedom. Given a certain error pattern, if the Hamming distance between signal vector s and estimated vector s ̂ is K, then s and s ̂ differ at K places denoted as i1,i2,…,i K . In this case, the metric C now turns to be C given by
C = R s s ̂ + v 2 = k = 1 K r i k s i k ŝ i k + v 2 ,
where r i k is the i k th column of R. Obviously, k = 1 K r i k s i k ŝ i k + v is a Gaussian vector with the mean vector k = 1 K r i k s i k ŝ i k and the same covariance matrix as v. Thus, C is the non-central chi-square-distributed random variable with n degrees of freedom denoted as χ n,γ22, where the non-central parameter γ2 is shown as
γ 2 = k = 1 K r i k s i k ŝ i k 2 .
Since the probability density function (PDF) of the metric for correct path is different from that of the metric for erroneous path, the intersection point for PDFs of χ n 2 and χ n,γ22 can be set as the optimum threshold whether an estimate is correct or not. Among all the error patterns, the minimum one is that there exists one path which has Hamming distance 1 to the correct path, say, they differ at the site, and the corresponding column of R exactly has the minimum norm. Therefore, we select the optimum threshold to be the intersection point for PDFs of χ n 2 and χ n,γ22, which gives the minimum non-central parameter as
γ min 2 = min i = 1 , 2 , , n r i 2 × d min 2 ,
where d min 2 = min s , ŝ A , s ŝ s ŝ 2 is the minimum squared distance of two constellation points. Based on the non-central parameter, the threshold can be approximately linearly calculated by [10]
T = α × γ 2 + n × σ 2 ,
where α is a constant coefficient depending on the degrees of freedom, which can be calculated offline by numeric computation since the intersections for PDFs of χ n 2 and χ n,γ22 almost lie on a line if the non-central parameters are not very large, as shown in Figure 2 which indicates the PDF for central and non-central χ2 of 8 degrees of freedom. In this way, α can be obtained, for instance, as 0.3465, 0.366, and 0.4082 for the case where the degrees of freedom are 4, 8, and 16, respectively. The proposed threshold based on minimum error pattern (MEP-T) is applied when the SD searching reaches level n (leaf nodes). Based on our SSIC ordering scheme and proposed threshold, our SD search process can be described as follows:
  1. 1.

    Initialize radius d.

Figure 2

PDF for central and non-central distribution. PDF for central and non-central χ2 of 8 degrees of freedom.

  1. 2.

    Perform SSIC precessing algorithm to reorder the columns of channel matrix H.

  2. 3.

    Apply QR decomposition to the reordered channel matrix, i.e., H=Q R.

  3. 4.

    Calculate threshold T by (15).

  4. 5.

    Normally run a depth-first constrained tree search through n levels.

  5. 6.

    Compute the whole accumulated metric C through n levels by (11), and compare C with threshold T. Terminate the SD decoding process and output the solution if CT holds.

  6. 7.

    Otherwise, compare C with radius d. If Cd, update radius d as d=C and repeat steps from 5 to 7 until a path, which is the first one such that CT holds or has the minimum accumulated metric C, is found.


4 Simulation results

In this section, we display the comparisons of system performance in terms of symbol error rate (SER) and the computational complexity in terms of the average numbers of visited nodes for 4×4 and 6×6 MIMO systems with 16QAM and 64QAM modulation. As can be observed from Figure 3, SD with our SSIC ordering and MEP-T show very small SER degradation compared to the original SD. However, a considerable complexity reduction can be observed from Figures 4, 5, 6, and 7. It is clearly shown that our SSIC ordering itself is much more efficient than other conventional approaches such as BSQR, GB, and V-BLAST ordering. Moreover, combining SSIC ordering with MEP-T can further reduce the complexity and offer almost fixed complexity in average. Specifically, for SNR = 5 dB, our proposed scheme yields a little bit more than 8 visited nodes for a 4×4 system with 16QAM (Figure 4) and 64QAM (Figure 5) and a little more than 12 visited nodes for 6×6 16QAM (Figure 6) and 64QAM (Figure 7), whereas other approaches require at least 25 visited nodes or even higher.
Figure 3

SER performance comparison for 16QAM and 64QAM systems.

Figure 4

Complexity comparison for a 4×4 16QAM system.

Figure 5

Complexity comparison for a 4×4 64QAM system.

Figure 6

Complexity comparison for a 6×6 16QAM system.

Figure 7

Complexity comparison for a 6×6 64QAM system.

Intuitively, the reason why it has such a low complexity in average can be investigated through the behavior of the SSIC ordering and MEP-T threshold. The SSIC ordering leads to the case that the lattice point which has the most possible to be the solution has the higher privilege to be searched. Moreover, the threshold is set properly and can be considered that it constrains a hypersphere area in which all the lattice points (corresponding to the full path in the searching tree) are the ‘ML solution.’ Since the real ML solution has the minimum metric, it certainly lies in the area. Once any one lattice point in the area has been searched, the searching process terminates. As the SNR increases, the number of lattice points in the area goes down, and the probability of obtaining the real ML solution goes larger and larger. That is why the complexity is low while having a negligible performance loss at the high-SNR region under our scheme. At the low-SNR region, the area contains more lattice points, which leads to an incorrect estimate with a larger probability. However, the errors given in this way do not affect the performance significantly because even the real ML solution yielded by the exhaustive search at the low-SNR region also gives the wrong estimate. Therefore, in our SSIC with MEP-T scheme, the correct and incorrect estimates are well distinguished at the high-SNR region, and the complexity reduction is preferred at the low-SNR region instead of improving a little performance while increasing enormous complexity.

5 Conclusions

In this paper, we propose an efficient SD algorithm by jointly combining the SSIC ordering scheme and the adaptive threshold MEP-T. In particular, the SSIC ordering scheme is based on the iterative SIC and MEP-T is obtained by analyzing the PDFs of the correct and erroneous estimates, which have the minimum error pattern among all the SD searching paths. Simulation results show that our SSIC ordering scheme is more efficient than BSQR, GB, and V-BLAST by reducing the computational complexity in terms of the number of visited nodes. More importantly, combining SSIC ordering with MEP-T can further reduce the complexity and offer almost a fixed average complexity at an expense of an unnoticeably small performance degradation compared to conventional methods.



The authors would like to thank the Associate Editor and the reviewers for their comments and suggestions which help improve the quality and clarity of this paper. This work was supported by Zhejiang Natural Science Foundation (Y1090935) and National Natural Science Foundation of China (61379027).

Authors’ Affiliations

College of Communication Engineering, Hangzhou Dianzi University
School of Information Engineering, Hangzhou Dianzi University
College of Information Engineering, China Jiliang University


  1. Jalden J, Ottersten B: An exponential lower bound on the expected complexity of sphere decoding. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal: IEEE; 2004:393-396.Google Scholar
  2. Jalden J, Ottersten B: On the complexity of sphere decoding in digital communications. IEEE Trans. Sig. Proc 2005, 53(4):1474-1484.MathSciNetView ArticleGoogle Scholar
  3. Schnorr CP, Euchner M: Lattice basis reduction: improved practical algorithms and solving subset sum problems. Math. Program 1994, 66(2):181-191.MathSciNetView ArticleGoogle Scholar
  4. Wolniansky PW, Foschini GJ, Golden GD, Valenzuela RA: V-BLAST: an architecture for realizing very high data-rates over the rich-scattering wireless channel. In Proceedings of the URSI International Symposium on Signals, Systems, and Electronics. Piscataway: IEEE; 1998:295-300.Google Scholar
  5. Dai Y, Yan Z: Memory-constrained tree search detection and new ordering schemes. IEEE J. Sel. Topic Sig. Process 2009, 3(6):1026-1037.View ArticleGoogle Scholar
  6. Trujillo R, Garcia VM, Vidal A, Roger S, Gonzalez A: Gradient-based ordering for MIMO decoding. In Proceedings of the IEEE International Symposium on Signal Processing and Information Technology (ISSPIT). Ajman: IEEE; 2009:5-8.Google Scholar
  7. Azzam L, Ayanoglu E: Reduced complexity sphere decoding via a reordered lattice representation. IEEE Trans. Commun 2009, 57(9):2564-2569.View ArticleGoogle Scholar
  8. Lei S, Xiong C, Zhang X, Yang D: Adaptive control of surviving branches for fixed-complexity sphere decoder. In Proceedings of the IEEE 71st Vehicular Technology Conference. Taipei: IEEE; 2010:1-5.Google Scholar
  9. Wu X, Dai Y, Yan Z, Wang Y: Improving the reliability of the k-best algorithm for MIMO detection with ordering. In Proceedings of the Wireless and Optical Communications Conference (WOCC). Shanghai: IEEE; 2010:1-5.Google Scholar
  10. Wu X, Thompson J: Accelerated sphere decoding for multiple-input multiple-output systems using an adaptive statistical threshold. IET Signal Process 2009, 3(6):433-444. 10.1049/iet-spr.2009.0028MathSciNetView ArticleGoogle Scholar


© Cao et al.; licensee Springer. 2013

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.