- Open Access
System value-based optimum spreading sequence selection for high-speed downlink packet access (HSDPA) MIMO
EURASIP Journal on Wireless Communications and Networkingvolume 2013, Article number: 74 (2013)
This article proposes the use of system value-based optimization with a symbol-level minimum mean square error equalizer and a successive interference cancellation which achieves a system value upper bound (UB) close to the Gaussian UB for the high-speed downlink packet access system without affecting any significant computational cost. It is shown that by removing multi-code channels with low gains, the available energy is more efficiently used, and a higher system throughput is observed close to the system value UB. The performance of this developed method will be comparable to the orthogonal frequency division multiplexing-based long-term evolution scheme, without the need to build any additional infrastructure. Hence, reduce the cost of the system to both operators and consumers without sacrificing quality.
Wireless communication systems known as multiple-input multiple-output (MIMO) systems, which have multiple transmit and receive antennas, can be used to exploit the diversity and the multiplexing gains of wireless channels to increase their spectral efficiency. As an extension to Shannon’s capacity , the MIMO channel capacity bound was obtained by Foschini and Gans  and Telatar  independently. Assuming that perfect channel state information (CSI) is available at the transmitter, the MIMO system capacity upper bound (UB) can be obtained using the eigen modes of the MIMO channel matrix by performing water-filling (WF) over the spatial sub-channels. An important MIMO system design consideration is to operate the system close to its capacity UB. The objective of this article is to show how the high-speed downlink packet access (HSDPA) MIMO system can operate close to its capacity UB.
The third generation partnership project (3GPP) has developed the HSDPA system, given in the Release 5 specification  of the Universal Mobile Telecommunications System, as a multi-code wide-band code division multiple access (CDMA) system. To further increase the data rate, the HSDPA system introduced new features  such as adaptive modulation and coding and fast scheduling. The standardization of the Dual Stream Transmit Diversity (D-TxAA) HSDPA MIMO system for a single-user in 3GPP Release 7  further improved the downlink throughput without requiring a new spectrum or any additional bandwidth.
In , measurements are carried out to evaluate the performance of the standardized 3G HSDPA MIMO system with a CDMA transmission. It is shown that the current systems are utilizing only about 40% of the available downlink capacity. The capacity curve is approximately 10 dB away from the capacity UB  at high signal to noise ratios. There is an opportunity to improve the HSDPA system capacity, when operating over frequency selective channels, by enhancing the HSDPA MIMO standard of the equal energy allocation scheme as is specified in .
The frequency selectivity problem, which causes a large drop in throughput for the HSDPA due the inter-symbol interference (ISI) problem, is not a major problem for the orthogonal frequency division multiplexing (OFDM)-based systems [long-term evolution (LTE) advanced and WiMAX] as they use a guard period to deal with the ISI problem. If the throughput reduction problem is not solved in the HSDPA system, the OFDM-based systems will have the upper hand over the HSDPA system in urban environments. The HSDPA single-input single-output (SISO) system has been the main focus of the study in , which provides tools to combat frequency selectivity, when bringing the HSDPA SISO performance close to the OFDM-based systems. Should the ISI problem be solved for the HSDPA MIMO-based systems, the current HSDPA MIMO system would achieve throughputs close to the LTE advanced without the need to change the whole infrastructure by using throughput optimization methods. This is the focus of the current investigation.
2 Current investigation and related work
The downlink throughput optimization for the HSDPA multi-code CDMA system  considers the signature sequence and the power allocation for downlink users. 3GPP standardized an approach to spread the transmission symbols by using a given fixed set size of orthogonal variable spreading factor (OVSF) signature sequences. A MIMO system requires a signature sequence set size higher than the given single set of OVSF signature sequences available for a SISO system. 3GPP standardized a method to increase the OVSF set size by multiplying the given set with precoding weights and then concatenating the weighted sets of the spreading sequences. Each concatenated spreading sequence is used to transmit one symbol and is orthogonal to the remaining set of spreading sequences available at the transmitter for the transmission of other symbols. However, the spreading sequences’ orthogonality is lost at the receiving end after transmission over frequency selective multipath channels. In [11, 12], it is proposed that a linear minimum mean square error (MMSE) equalizer followed by a de-spreader could be used to restore partial orthogonality between the receiver de-spreading and the matched filter sequences in the detection process after receiving signals transmitted over a multipath channel. Recent developments have shown that linear MMSE equalizers suffer from a self-interference (SI) problem caused by ISI and multiple access interference, when operating over multipath channels.
SI reduces the system throughput performance, but good receiver design will minimize the degradation caused by the SI. When encountering SI various versions of interference cancellers could be used in conjunction with non-optimal receivers to improve the system throughput for the HSDPA system over frequency selective parallel channels. In , it is shown that a successive interference cancellation (SIC) scheme performs better than a parallel interference cancellation scheme, when the signal-to-noise ratio (SNR) differs over each frequency selective parallel channel. The works reported in [14–16] focus on the use of linear MMSE equalizers and SIC in reduction of the overall SI.
A two-stage SIC detection scheme with transmitter power optimization is examined in [16, 17] to improve the throughput performance for multi-code downlink transmission. In , the power at the transmitter and a two-stage SIC receiver are jointly and iteratively optimized for a multi-code MIMO system. However, at each iteration of the SIC, the equalizer coefficient and the power allocation calculations require an inversion of a covariance matrix for the received signal. The dimension of the covariance matrix is usually large and, as such, the iterative power allocation, the linear MMSE equalizer and the SIC implementations at the receiver become computationally expensive.
The focus of the article is on an HSDPA MIMO-based radio downlink system, which has a number of parallel SISO or MIMO frequency selective channels over which data are transmitted. The data are represented by a number of data symbols, which are spread by a group of spreading sequences when using the HSDPA system either with or without a SIC scheme. A set of signature sequences generated from the OVSF codes with precoding, as specified in the 3GPP Release 7, will be considered. A receiver with a symbol-level linear MMSE equalizer will be examined to jointly optimize the transmission energy allocation and the receiver for a single user system either with or without a SIC.
At the receiver each spreading sequence has a system value λ k , which is associated with the SNR γ k at the output of each de-spreading unit. The system value λ k for each spreading sequence depends on the transmission multipath channel and also on the availability of the SIC scheme. The implementation of each HSDPA MIMO system can be a non-SIC scheme as shown in Figure 1 or a SIC-based receiver scheme as shown in Figure 2. The non-SIC and the SIC-based receivers have different ways of determining the system values for a given number of spreading sequences, a given frequency selective multipath channel and a total transmission energy. This article will outline MIMO transceiver structures for the non-SIC and SIC-based receivers to introduce the system value concept. The system value UB for the HSDPA MIMO system will be presented. This system value capacity UB is close to the capacity of an additive white Gaussian noise channel. It will also be shown how the system values will be used to determine the total transmission rates for both the non-SIC and the SIC-based SISO and MIMO systems to maximize the total transmission rate.
The objective of the total transmission rate maximization for a given total number of spreading sequences will be to bring the downlink throughput close to the system value UB. This will be achieved by retaining the spreading sequences with the highest system values for a given total received SNR corresponding to a given total transmission energy E T . A given number of sequences will be ordered so that the corresponding system values are used at the transmitter in ascending order. The optimum number K∗ of signature sequences will be determined to select the first K∗-ordered signature sequences, which maximize the transmission throughput. The receivers will operate in a sequence, where the detection is ordered in the descending order of the corresponding system values for the SIC-based systems.
As shown in , the WF optimization is generally used for parallel channels with different sub-channel gains to provide optimum sub-channel selection, energy distribution, and also channel ordering. The iterative WF sum-capacity optimization is extensively examined in [20–23] and is proven to converge to the sum capacity UB of the multiple-access channel  to provide an UB for non-discrete rates. Other sub-channel removal methods have been studied in [24–26] to determine the number of active data streams. In , the eigen decomposition of the covariance matrix is used to isolate the “bad” data streams so that the sum MSE is minimized. In , it is suggested that low signal-to-interference and noise ratio (SINR) streams will be switched off to focus the available power on the remaining streams during the iterative power allocation process. In , the removal of sub-channels is proposed to improve the capacity when the rounding of the discrete rate does not improve the system throughput. The WF and channel removal schemes do not use the system value concept for signature sequence selection nor use rate adjustment to maximize the total throughput.
In this article, three bit rate adjustment methods will be considered with the appropriate energy allocation schemes. These methods will be applicable to both SIC and non-SIC-based receivers, when using discrete and non-discrete rates. Initially, an iterative WF algorithm will be proposed with a sub-channel removal for the selection of signature sequences. The system values will be used to maximize the throughput for non-discrete rate allocation by accounting for the channel SINRs corresponding to the received signature sequences instead of using only the channel gains to find the water levels. When using discrete rates the signature sequence selection scheme will be further extended to optimize the total rate for the HSDPA system downlink. The system values will be used to select an optimum number of spreading signature sequences from a given total number of sequences without any prior energy allocation. The chosen optimum number of sequences will be loaded with discrete rates using both the equal SINR allocation methods proposed in this article and the equal energy allocation schemes as specified in the current HSDPA standard. The equal SINR and energy loading schemes will use the mean and the minimum of system values for a given total energy to transmit the symbols at the required discrete rates. These three methods will be named as the iterative WF-based continuous bit loading method, the mean system value-based discrete bit loading method, and the minimum system value-based discrete bit loading method.
The mean and minimum system value-based methods will require different and equal transmission energy allocations, respectively. The iterative energy allocation methods will be described for the mean system value-based discrete bit loading systems.
The link throughput improvements for these three methods will be described, when considering the receiver design, power control, and signature sequence selection algorithms. A complexity reduction method will be presented for covariance matrix inversions. The results show that the HSDPA MIMO system, using the optimization methods proposed in this article, achieve a system throughput close to the system value capacity UB for the frequency selective channels. The results are then comparable with the LTE system, without incurring the cost of building new infrastructures.
In Section 3, two HSDPA MIMO system models will be described for receivers with the non-SIC and the SIC-based MMSE de-spreading units. In Section 4, the system value formulation will be presented and the MMSE filter coefficient calculations will be given. The system value UB concept for both the non-SIC and the SIC-based receivers will be presented in Section 5. The formulation of a simplified iterative covariance matrix for use in the design of the SIC-based receivers with MMSE equalizers will be described in Appendix Appendix 2 to support the material presented in Section 5. The system value-based sum capacity/throughput maximization methods for optimum signature sequence selection, energy allocation, and rate maximization methods will be described in Section 6. These schemes will be based on the iterative WF and the mean and the minimum system value optimization methods. Finally, the results will be described in Section 7 before the conclusions are given in Section 8.
3 System model
a is a scalar, is a column vector, and A is a matrix. The identity matrix with dimension L is given as I L .
3.2 Transmitter and a non-SIC-Based receiver model
The HSDPA MIMO system model used in the following sections will be briefly described in this section for both the non-SIC and the SIC-based receivers. Initially, a non-SIC-based multi-code CDMA MIMO downlink transmission system will be considered with N T transmit antennas and N R receive antennas with their respective indices represented by N T and N R . Given the spreading factor N of the system, the maximum number K of spreading sequences satisfies the relationship K≤ min(N T ,N R )N where each spreading sequence index is represented by k. When selecting the optimum number K∗ of spreading sequences, weak channels corresponding to a specific set of signature sequences will be excluded to maximize the total rate. The system under consideration will operate with the selected optimum number K∗ of spreading sequences. Each spreading sequence will transmit a symbol operating at a discrete rate chosen from a set of rates according to the CSI updated at regular transmission time intervals (TTIs). In the system model, each parallel binary bit packet for k=1,…,K∗ of length N U will be encoded to produce a length B vector and mapped to quadrature amplitude modulation (QAM) symbols each carrying b= log2M bits, where M is the chosen constellation size. The encoding rate will be used to obtain a realizable discrete rate of b p =rcode× log2M bits per symbol where p=1,…,P are the different discrete bit indices available. The bit rate for each spreading sequence is represented by for k=1,…,K∗.
A non-SIC-based system model is shown in Figure 1. In the CDMA system, the number of symbols transmitted per packet is given by where T c is the chip period and N T c is the symbol period. In each parallel channel, the mapped packet of symbols corresponding to over 1 TTI is represented by an N(x) long vector for k=1,…,K∗, where each symbol in carries unity average energy. The symbols over K∗ parallel channels are stored in an N(x)×K dimensional matrix which is also expressed as where the length K∗ vector contains symbols over the symbol periods of ρ=1,…,N(x).
Each spreading sequence will have an energy allocated, where the assigned energies are stored in a K∗×K∗ dimensional amplitude matrix . The energy weighted symbols will then be spread by signature sequences (spreading codes) and are represented by an (N T N)×K∗ signature sequence matrix
where and is a N×K∗ spreading sequence matrix of the N T th antenna. The length N transmit signal vector at antenna N T is given by for the symbol period ρ. The vector will then be fed to a pulse shaping filter at integer multiples of T c before up converting to the desired carrier frequency. The length N×N T MIMO transmit signal vector is given by
Assuming the clocks at the transmitter and the receiver are fully synchronized, the signals arriving at the receive antennas will be firstly down converted to the baseband before sampling at every T c at the output of the receiver chip match filter.
The receiver matched filtered signal vector for each symbol period will be represented by an N R (N+L−1) long vector , where L is the number of resolvable paths in a multipath wireless channel. The samples at the output of the chip match filter of the N R th antenna are represented by an (N+L−1)-length vector . The N R (N+L−1)×N(x)-dimensional matched filter matrix R is formed by taking as its ρ th-column such that . With and the N R (N+L−1)×K∗-dimensional MMSE linear de-spreading filter matrix containing de-spreading filter coefficients each of which is calculated using (10) for k=1,…,K∗. The estimate of the transmitted symbol can be found as follows:
The vector is used to form the N(x)×K-dimensional de-spread signal matrix or alternatively by using to de-spread the received signal vector of the k th channel. The de-spread signals pass through the decision device, where the signals are quantized, de-mapped and decoded to form binary data vectors for k=1,…,K.
At the output of each receiver, the mean square error (MSE) between the transmitted symbol y k (ρ) and the estimated symbol is given as . When the MSE is minimized, it has a relationship with the SINR γ k and the system value λ k as . Therefore, the system value is given by
3.3 The SIC-based receiver model
Figure 2 illustrates the system model for a SIC-based receiver, which collects the received signals to to formulate the received signal vector The receiver processes and cancels the signals channel by channel to ensure that the SI is minimized. Starting from channel K∗ and by setting the received signal matrix , the N R (N+L−1)×N(x)-dimensional reduced data matrix Rk−1 will iteratively be calculated using from k=K∗ to k=1 where is the allocated energy of the k th channel. The matrix Φ k of dimension N R (N+L−1)×N(x) will be constructed as , where the length N(x) vectors , and are the detected stream of the current symbol period, the detected streams with ISI symbols received in the previous and the next symbol periods, respectively. The N R (N+L−1)-dimensional receiver matched filter sequences and are given in (8), (31), and (32) for the current, previous, and next symbols, respectively.
At each k th channel, the estimated symbol vector is generated by using each MMSE de-spreading vector from (15) to yield a de-spread signal vector of and an estimated bit stream The decoded bit vector is re-coded at the receiver and re-modulated to regenerate the transmit symbol vector at the output of the decision device. The vector is used to form Φ k which is required to generate Rk−1 for the next channel. This process of cancelling the detected symbols continues from k=K∗ to k=1. The next section will introduce the system value and the de-spreading filter coefficient calculations for both the SIC and the non-SIC-based systems.
4 System value and MMSE de-spreading filter coefficient formulations
In this section, the system values and the corresponding MMSE de-spreading filter coefficients are expressed in terms of the received signal vector .
4.1 System values for a non-SIC-based receiver
The received signal vector over the symbol period ρ is given in terms of the transmitted signal vector as
and the received signal matrix is given by . The N R (N+L−1)-dimensional vector contains the concatenated noise samples at the output of the receiver chip matched filters. The N R (N+L−1)×N T N matrix H represents the overall MIMO channel convolution matrix formed as follows:
The channel convolution matrix between the pair of antennas is determined by their channel impulse response of dimension L×1. It is assumed that the signals from each N T th transmit antenna to each N R th receive antenna undergo the same channel condition for the packet duration with L resolvable paths, and the channel conditions obtained from the feedback of pilot signals. The corresponding channel convolution matrix between the pair of antennas is formed as
The spatiotemporal MIMO channel matrix for the previous symbol block and the next symbol block are given as
where J is a vector shifting matrix. The notation JN+L−1 is the shift matrix of dimension (N+L−1)×(N+L−1) defined as . When multiplied with a matrix, shifts the columns of the matrix up by N chips and fills the empty contents with zeros, while shifts the columns of the matrix down by N chips and fills the empty contents with zeros.
The N R (N+L−1)×K∗-dimensional receiver matched filter signature sequence matrix Q is calculated as follows:
The system value for the spread spectrum system based on a receiver without the SIC scheme is given by
where is an N R (N+L−1)×N R (N+L−1)-dimensional covariance matrix of the received signal vector . In (9), the covariance matrix C is calculated using (30) in terms of Q and the noise covariance matrix as shown in Appendix Appendix 1.
The normalized MMSE de-spreading coefficients for k=1,…,K∗ when the MSE per channel is minimized can be formed in terms of C , as shown below:
These coefficients are then stored in a matrix of dimension N R (N+L−1)×K∗.
4.2 System values and MMSE de-spreading filter coefficients for a SIC-based receiver
Similar to the received signal vector which is constructed in (4), a SIC-based received signal vector is formed to improve the SINR at the output of each receiver. For the SIC scheme, the system value λ k for k=1,…,K∗ is determined using the following equation:
where is the k th column of (8). The covariance matrix C k is initialized as and then iteratively constructed for k=1,…,K∗ using the following relationship:
After all iterations k=1,…,K∗ have been completed the covariance matrix given in (30) is set to be .
When calculating the system values for the SIC scheme, each system value λ k in (11) for k=1,…,K∗ involves one matrix inversion , which requires high computational complexity. By applying the matrix inversion lemma (A+U B V)−1=A−1−A−1U(B−1+V A−1U)V A−1 on D k in (13) and C k in (12), an iterative covariance matrix inversion method is formed by constructing the inverse matrices and using (33) and (34), respectively, as a function of as shown in Appendix Appendix 2 so that the total number of matrix inversions required to obtain λ k for k=1,…,K∗ reduces to 1.
The inverse matrices and the corresponding system values, λ k , are calculated iteratively so that the system value λ k given in (11) is reorganized using (34) to simplify the SINR γ k at the output of the k th SIC receiver to the following form
using the steps in (35) to (38) given in Appendix Appendix 2. Therefore, γ k can be calculated when is obtained using (33).
The MMSE linear equalizer de-spreading filter coefficients for the k th SIC receiver in (10) is expressed in terms of C k as
5 Sum capacity optimization using system values
The main focus of this article is to find the optimum number K∗ of spreading sequences, which maximizes the total rate, where K∗ is a subset of the total number K of spreading sequences used for transmission. The total rate is maximized by minimizing the total MSE where is the number of bits allocated to each spreading sequence symbol for k=1,…,K∗. The total MSE minimization criterion has been studied in [24, 27, 28] and can be expressed in terms of the Lagrangian dual objective function:
where λ is the Lagrangian multiplier. The minimization of the total MSE using the above equation provides solutions for E k and the Lagrangian multiplier λ, subject to the energy constraint . Since is expressed as a function of ϵ k and E k , will be determined only after energy allocation, which could be computationally expensive, when an iterative energy calculation is required. Therefore, this article uses the system value optimization originally presented in , where the system value λ k of the k th channel is calculated using (9) and (11) for the non-SIC and the SIC-based receivers, respectively. Differing from , in this article a method is proposed to calculate the discrete rate for each spreading sequence using the mean system value λmean prior to allocating the energy for each sequence.
The mean system value λmean is calculated by allocating energies equally such that and then obtaining the system value λ k from (9) for the non-SIC receiver or (11) for the SIC receiver, using the following equation
The total system capacities for the MMSE receivers for both the SIC and the non-SIC-based receivers are then given as
where Γ is the gap value. To relate the system values to discrete bit rate optimization, one can use the discrete bit rate and its SINR relationship . Thus, the target SINR can be expressed as a function of the discrete rate as follows:
and the corresponding target system value expressed as a function of can be obtained using
The next section will provide a detailed description of the system value based throughput optimization methods for both the non-SIC and the SIC-based spread spectrum MIMO systems.
6 System value-based discrete and WF algorithm-based non-discrete bit loading
In this section, an iterative WF algorithm and two discrete bit loading algorithms will be presented using the system value approach. These methods operate with a given total energy E T when implemented with or without the proposed SIC receiver. First, the iterative WF algorithms will be presented for continuous bit loading. Two iterative discrete bit loading methods will then be proposed to maximize the total rate without the need for any prior energy allocation. These discrete bit loading methods maximize the total rate by jointly allocating the discrete rate and then selecting the optimum number K∗ of ordered spreading sequences. The first discrete bit loading algorithm will use the mean system value λmean to determine the optimum number K∗ of spreading sequences and to select the sequences prior to allocating the energy for each sequence. The second discrete bit loading method will use the minimum system value λmin to select the optimum number of sequences.
The system values will be ordered in an ascending order for all combinations of Kopt=K,…,1 for both discrete bit loading methods prior to selecting the optimum number of signature sequences. The temporary number Kopt of optimum spreading sequences is used as an initial value for each loop in an iterative sequence number optimization process.
For the discrete bit loading methods with λmean and λmin, margin adaptive (MA) loading (equal rate) algorithms will be considered initially so that all spreading sequences have the same rate for k=1,…,K∗ by using the target system values identified in (19) in terms of the available discrete rates. The total transmission rate is R T =K∗b p . Then, the two-group (TG) rate adaptive optimization will be described for both cases to use the wasted (residual) energy caused by quantization loss, by loading a certain number of channels, m, with the next discrete rate bp+1 to further increase the total rate to RT,T G=(K∗−m)b p +m bp+1.
6.1 Iterative WF-based continuous bit loading
The iterative WF was originally developed to remove sub-channels, which contain negative energies, and to maximize the total rate. This section describes the iterative WF optimization, which finds the optimum sub-channels using the system values for continuous unequal bit loading. This iterative WF algorithm can also be applied to the HSDPA system with and without SIC by using the system value λ k formulation given in (9) and (11), respectively. The algorithm first allocates energies to the channels before the rates and the optimum number of channels are determined. The iterative WF starts with Kopt=K, where Kopt is the temporary optimum number of codes. In each Koptth iteration, the WF calculates the channel SINR per energy unit vector and assigns energies E k for k=1,…,Kopt. The signature sequences are reordered starting with those signature sequences which have the lowest channel SINR. The first sub-channel is removed if it was assigned with a negative energy. When there are no more sub-channels with negative energies, energies are allocated iteratively until they converge. This continues unless a channel with negative energy is detected during the process. With the later case, the corresponding sub-channel will be removed and energies are recalculated as before. The algorithm will return the optimum number of coded channels K∗, their respective allocated energies and signature sequences, covariance matrix C or C k and the MMSE receiver coefficient.
The iterative WF algorithm initializes Kopt=K and the procedure is summarized as follows:
Initialize the loop counter as I=1. The number of energies E k is K opt and vectors , and are of length K opt.
Perform energy allocation:
Calculate the channel SINR per energy unit vector by finding λ k from (9) for non-SIC or (11) when using the SIC receiver.
Determine the WF constant and energy allocations for k=1,…,K opt.
Perform signature sequence reordering procedure:
Find the term c k , the indices of the k th smallest element of . Store it in the vector .
Reorder vectors , , and as well as energies for k=1,…,K opt.
Carry out the channel removal process:If E 1< 0, remove this channel by setting K opt=K opt−1. Set and for k=1,…,K ∗. Form Q e =[Q , Q 1,Q 2]. Repeat the process from step 1.Otherwise, K ∗=K opt, set counter I=I+1 and repeat the process from step 2 until I=I max is reached.
6.2 System value-based signature sequence ordering for discrete loading
This section will describe the use of system values for ordering the signature sequences to maximize the system capacity by determining and selecting the optimum number of signature sequences for receivers with and without the SIC scheme. The signature sequence ordering process starts with by setting Kopt=K and continues by iteratively adjusting Kopt=Kopt−1 until Kopt=1 is reached. In each iteration, the system values are calculated, then the signature sequences (or coded channels) are ordered, and the signature sequence containing the smallest system value is removed. This generates a new set of selected and ordered signature sequences for each Koptth iteration.
By allocating energies equally to all selected spreading sequences k=1,…,Kopt for that iteration, the system values are obtained from (9) or (11) for the non-SIC and SIC cases, respectively. These system values are stored in a Kopt length vector . The mean system values λmean and the minimum system value λmin for each Kopt iteration are stored in the K-length vectors and respectively where and are initialized as .
The system values given in are sorted in an ascending order for the current Kopt iteration and are stored in the Koptth column of the K×K matrix λstore, i.e., in . The indices of the ordered system values are stored in a Kopt length vector , where indices range from 1 to Kopt.
The next step is to find the Kopt-length vector which contains the indices of the selected subset of the signature sequences used in the current Kopt iteration. These are also ordered according to the ascending order of the system values using . Next will be stored in the Koptth column of the K×K upper triangular matrix Kseq such that . The vector is initialized as and Kseq is initialized as 0K×K.
Defining Qorig, Qorig1, and Qorig2 as the original unmodified receiver signature sequence matrices of Q , Q1, and Q2 with its order is equivalent to S, reordering procedure is carried out by setting , and . The signature sequence removal will be completed by removing the first element of so that the vector length is reduced to Kopt−1, and by removing the first columns of Q, Q1, and Q2 so that the received signature sequence matrix dimension becomes N R (N+L−1)×(Kopt−1). This reduced matrix will be used to calculate the system values, and order and remove the spreading sequences with the smallest system value for the next Kopt iteration by setting Kopt=Kopt−1 and repeating the process until Kopt=1.
The procedure can be summarized as below.
Find all system values corresponding to each K opt from K opt=K to K opt= 1 by using the following steps.
Allocate energy equally for each signature sequence such that , for k=1,…,K opt. Form the amplitude matrix A.
Find λ k for k=1,…,K opt using (9) and C from (30) for non-SIC, or λ k from (11) and C k from (12) for SIC. Store .
Store the minimum system value and the mean system value .
Reorder the signature sequences and remove the signature sequence with the minimum λ k for each K opt iteration:
Find the indices of the k th smallest elements for k=1,…,K opt of , store it in .
Store the system values in in ascending order.
Find the vector which contains the indices of the selected subset of the signature sequences and with ordering according to . Store the reordered sequence index a k in .
Use a k to reorder , , and for k=1,…,K opt.
If K opt>1, set K opt=K opt−1. Set for k=1,…,K. Form Q e =[Q , Q 1,Q 2] and repeat steps 1 and 2. Otherwise, the optimum signature sequence identification for the discrete loading schemes will be performed, as described in the next two sections.
6.3 Mean system value-based discrete bit loading algorithm
To achieve the same SINR distribution at the output of each de-spreading unit so that a higher b p is selected for equal rate loading, transmission energies need to be adjusted to achieve a target (fixed) SINR at each receiver. The discrete transmission rate will be identified using the mean of the system value λmean. This method will operate with an energy constraint to identify the optimum number K∗ of signature sequences and select the transmission signature sequences to maximize the total transmission rate RT,mean.
With the relationship of the target system value λ∗ and the bit rates b p in (19), a set of target system values stored in the P-length vector corresponding to all bit rates b p will be generated. In the earlier Section 6.2, the ordered signature sequences for different number of signature sequences are given in Kseq for all combinations of Kopt=K,…,1. For these values the rate to be transmitted will be identified by comparing to the target system values in for all Kopt combinations. The optimum number of codes, K∗, will be selected from the Kopt combination, which gives the highest total rate RT,mean=K∗b p . This algorithm returns the total rate RT,mean, optimum number codes K∗ and the selected and ordered signature sequence matrix S(mean). The algorithm is described below:
For the set of bit rates , find the corresponding target system value for p=1,…,P.
Find that satisfies
Store the total rate for K opt=1,…,K.
Select the optimum signature sequences satisfying . The total rate .
Construct the signature sequence matrix by setting where for .
The TG optimization can be used to further maximize the total rate by loading m channels with bp+1 so that the total rate becomes . For the mean system value based optimization method, the number of channels mmean which loads the next discrete rate bp+1,mean will be obtained by finding the maximum mmean that satisfies the following inequality
6.3.1 Energy allocation for non-SIC
This section describes the energy allocation schemes for the mean system value-based discrete bit loading allocation for both the non-SIC receiver and the SIC receiver with equal rate or TG allocation. When allocating equal rate, the bit rates of each channel are equal, i.e., for k=1,…,K∗; while bit rates are allocated as for and for when using the TG allocation.
With K∗, and obtained in Section 6.3 the transmission energies for the non-SIC scheme can be iteratively calculated as shown below:
where i is the iteration number. The term is calculated by inverting Ci−1 given in (30), which is a function of Ek,(i−1) for with initialized for all channels. The iteration continues until the energies converge to fixed values or the maximum number of iterations, Imax is reached.
6.3.2 Energy allocation for SIC
As the iterative calculation of energy Ek,i depends on which requires energies Ek,(i−1) for k=1,…,K∗ for each iteration i, the SIC-based energy allocation method was developed to simplify the calculation of energy so that Ek,i depends only on Ek,(i−1) and the stored covariance matrix inverse which is a function of . The inverse covariance matrix will be calculated once per spreading sequence after having obtained the energy .
The energies for the SIC-based receiver can be iteratively calculated from E1 to without any need to invert a matrix for each energy iteration by rearranging (14) as follows:
By using (33), the energy calculation given in (23) can be simplified to
where the weighting factors ξ, ξ1, ξ2, ξ3, ξ4,ξ5, and ξ6 are constructed from , and using (36) and the covariance matrix, used for the calculation of E1, by initializing as . The terms ζ1,(i−1) and ζ2,(i−1) are calculated using (37) as a function of Ek,(i−1); while is the target SINR calculated as a function of using (20). The iterations of Ek,i continue until the energy converges to a fixed value or Imax is reached. Then, is calculated in terms of using (34). This process is repeated for all selected transmission channels for . Once the energies are allocated, the transmitter provides the receiver with the allocated energies. The next section will describe the minimum system value-based discrete bit loading schemes.
6.4 Minimum system value-based discrete bit loading algorithm
An equal energy loading method is adopted for the current HSDPA standards to load a discrete rate to each spreading sequence. Equal energy allocation produces varying SINRs at the receivers, but makes it simpler to allocate energies than the equal SINR loading scheme. As the channel with the minimum SINR is chosen as the target SINR to guarantee the quality of the service, this will also be referred to as the minimum system value-based discrete bit loading method. This section will describe how to select the optimum number and the corresponding signature sequences to maximize the total rate for the HSDPA downlink. For the minimum system value-based discrete bit loading, the transmission energies are allocated equally and there is no iterative energy adjustment. Differing from the mean system value-based discrete bit loading, the minimum system value λmin will be used to determine the transmission rate for each spreading sequence. With for all Kopt combinations and the ordering of the signature sequences given in Kseq as described in Section Sec5.2, the bit rate b p will be selected in a similar way to the mean system value based loading, except λmin is used to compare with the target system value. The algorithm will return the optimum number of codes , the total rate and the ordered signature sequence matrix S(min). The minimum system value-based loading is summarized below:
For the set of bit rates , find the corresponding target system value for p=1,…,P.
Find that satisfies
Store the total rate for K opt=1,…,K.
Select the optimum number of signature sequences by using . The total rate becomes .
Construct the signature sequence matrix by setting where for .
Again, a TG allocation can be performed to further increase the total rate. For the equal energy allocation, the channels that have system values where p corresponds to the index of bp,mean will be loaded with the next discrete rate bp+1,mean. The total rate for the minimum system value TG allocation will be .
The next section will provide the results obtained from the simulations and the discussions about the performance of the different loading algorithms.
Two separate experimental setup systems were developed using the Matlab and the National Instruments (NI) LabVIEW platforms with the parameters as listed in Table 1. The proposed system value optimization methods both with and without the SIC implementation were tested using the Matlab and LabVIEW simulation packages with the parameters: a spreading factor of N=16, the full number of spreading sequences K f =2N, an additive white noise variance of σ2=0.02, and a gap value of Γ=0 dB. A set of discrete rates which range from 0.5 to 6 bits per symbol with intervals of 0.5, was considered for transmission over a 2×2 MIMO HSDPA system. The OVSF codes, which are precoded according to 3GPP Release 7 given in , were used as spreading sequences.
The objective of using the two experimental platforms is to cross check the system performance obtained from the Matlab simulation environment and the LabVIEW environment. A real-time channel emulator was implemented by modifying the National Instruments FPGA channel emulation software. This emulator is fed with the vectors containing the channel impulse response samples which are externally generated from power delay profiles (PDP) as specified by the standardization organizations such as ITU and 3GPP. Two industry standard profiles, known as the pedestrians A and B PDP, shown in Tables 2 and 3, were adopted in this article as specified  by the ITU organization.
The pedestrians A and B PDP correspond to the channel impulse responses taken at non-regular intervals with a resolution of 10 ns. The PDP given in the ITU specification as shown in Tables 2 and 3 can be written as
where P i is the linear power (not the logarithmic scale) at delay τ i . This PDP is sampled with a sampling rate of where T c =260 ns is the chip period. The new PDP is given as
where is the power component at the l th chip period and L is the length of the sampled PDP. is given as the sum of all power in P(t) in the time interval and such as
The pedestrians A and B channels shown in Tables 2 and 3 are re-sampled at the chip period intervals as shown in Table 4. After sampling, power is normalized so that the PDP has a unity power gain. This produces the normalized square root PDP given in a vector form as where
Two PDP sampled at chip period intervals for the pedestrians A and B channels were produced as: and at regular chip period of T c =260 ns, which corresponds to the HSDPA system operating at 3.84 Mchips/s. The pedestrian A channel has a short delay spread of 3 chip periods and the pedestrian B PDP corresponds to a delay spread of 15 chip periods. The channel impulse response samples taken at the regular chip period intervals of T c =260 ns were used in the Matlab and the LabVIEW test environments. The pedestrians A and B PDP were specifically chosen to have channel impulses, which result in short and long ISI in the detection processes. In Table 4, the pedestrians A and B PDP taken at chip period intervals are listed to generate individual impulse responses by applying complex Gaussian random variables to each coefficient of the square root of the PDP.
Each entry in columns 2 and 3 of Table 4 corresponds to the non-zero square-root PDP coefficient for the pedestrian channel impulse response vectors and The entries and in Table 4 identify the square-root PDP coefficients for the non-zero elements of vectors and with index l+1. The PDP given in Table 4 were used to generate six sets of distinct channel impulse responses. The channel impulse response coefficients with Rayleigh distribution, corresponding to the transmissions from the MIMO transmitter i to the MIMO receiver j, are generated using two vectors and and also the relationship
and then the response is normalized using Where each coefficient a l and b l for l=0,…,L−1 is drawn from a normal distribution with zero mean and unity variance. Tables 5, and 6 list six sets of MIMO impulse responses generated from the pedestrians A and B PDP, respectively, to produce results for the experimental systems. The entries in Tables 5 and 6 identify the PDP amplitudes for the non-zero elements of vectors with index l+1. These responses were used in the Matlab and LabVIEW environments to obtain a set of mean total throughput versus signal to noise ratio curves for the pedestrians A and B channels. It was observed that both the Matlab and LabVIEW experimental setup environments produced almost identical results.
Results were produced for the throughput UBs and different optimization strategies for discrete rates in terms of system throughput in bits per symbol against the total SNR per symbol period per receiver antenna for 2×2 MIMO. The total received SNR is expressed in dB by using where N R =2 is the total number of receiver antennas.
For the UB throughput examination, the system value and the iterative WF UBs were simulated using the methods described in Sections 5 and 6.1, respectively. The corresponding curves for the water filling and the system value UBs both with and without the SIC schemes were labeled using the labels SIC WF UB, SIC SV UB, WF UB, and SV UB. Figure 3 shows the results for the WF UBs and system value UBs for both the non-SIC and the SIC schemes for the pedestrian A channel. The proposed system value UB achieves the same system capacity as the iterative WF for the systems with and without SIC. However, the system value UB is a good alternative to the WF UB due to its simplicity and its shorter processing time for calculating the system capacity. In the same figure, it is shown that the SIC UB achieves a much higher sum capacity especially at a high input SNR, where the total available energy is greater, and the energy per channel is higher. Thus, a higher interference is introduced to other parallel channels above a given total SNR and the system capacity saturates at an asymptotic value. To improve the sum capacity the SIC-based receiver cancels the interference corresponding to the detected symbols, starting from those which have the highest system value. As the SIC UB achieves a much higher sum capacity than the non-SIC system, it will be used as the ultimate UB, when comparing the performance and improvements obtained through different optimization strategies for the rest of this section.
Discrete bit rate allocation methods based on the use of the mean and the minimum system values for the equal energy and SNR cases were simulated as described in Section 6. The corresponding curves in various figures have been labeled using SIC TG ES, SIC TG EE, TG ES, and TG EE for the systems with and without SIC. The term ES refers to the equal SNR loading case and the term EE refers to the equal energy loading case. These labels were appended with either FULL or OPT for the configurations corresponding to the systems with the full and optimum number of spreading sequences. The signature sequence ordering for a given set of total receiver SNRs was implemented using the algorithm described in Section 6.2. The optimum number of spreading sequences and also the data rates to be transmitted for the mean and minimum system value-based algorithms were calculated using the methods described in Sections 6.3 and 6.4, respectively.
The mean system value-based rate allocation requires iterative energy calculations, which were produced using the methods described for the non-SIC and the SIC-based systems, respectively, in Sections 6.3.1 and 6.3.2. Iterative energy allocation methods were used to achieve equal SINR levels at the output of the de-spreading units. For the non-SIC receiver with the equal SNR (ES)-based transmission energy allocation, the iterative power allocation stops, either when the sum difference between the current energy and the previous energy in the energy iteration loop is less than 1% of the total energy, i.e., or when the maximum number of iteration Imax is reached. The energy for each coded channel E k for the SIC ES allocation iterates until .
The processes described above were repeated for various total signal to noise ratios at the output of the de-spreading units for channels with pedestrians A and B channel PDP.
In Figure 4, the results are shown for the two-group equal SINR allocation using an optimum sub-channel selection and SIC optimization strategies, when transmitting spread signals over pedestrian A channel. The improved system for the equal SINR allocation with SIC achieves system throughputs corresponding to the curves SIC TG ES OPT, SIC TG ES FULL, and these achieved throughputs are very close to the SIC UB. It is not necessary for the SIC-based receiver to determine the optimum number of spreading sequences, when allocating equal SINR as the SIC scheme reduces these interferences. The SIC TG ES OPT scheme provides a 3-dB improvement over the transmission system with the TG ES FULL strategy. The TG ES OPT scheme, on the other hand, provides a 1.5-dB enhancement over the TG ES FULL scheme, when the total SNR is 35 dB.
Figure 5 shows the pedestrian A results for a system with the optimum number of ordered spreading sequences, the SIC receiver and the discrete bit loading method based on minimum system value. It is shown that the SIC TG EE OPT scheme has a 4.5-dB improvement over the TG EE FULL-based system before the system throughput saturates at the total SNR value of 35 dB. The use of an optimum number of ordered signature sequences at the total SNR of 35 dB results in the TG EE OPT scheme having a 2.5-dB improvement over the TG EE FULL scheme. The performance of the receiver with the SIC TG EE FULL scheme is enhanced by 3 dB over the TG EE FULL scheme using the full number of spreading sequences. It is observed that the system with the TG equal energy (EE) allocation, SIC and the optimum number of spreading sequences approaches the non-SIC system value UB. It is further noted that at the total SNR value of 35-dB a 3-dB difference is observed compared with the SIC UB before the system throughput diverges.
Figure 6 shows the simulation results corresponding to data transmitted over the pedestrian B channel. The system throughput saturates for the TG ES FULL scheme at a lower total SNR (at 30 dB) compared to the pedestrian A channel. At the total discrete data rate of 100 bps, the SIC TG ES OPT provides 7 and 4 dB improvements, respectively, over the systems with TG ES FULL and TG ES OPT schemes. At the total discrete rate of 120 bps, more than 10-dB improvement is observed when using the SIC TG EE OPT scheme with the optimum number of spreading sequences over the TG EE FULL scheme. An 8-dB improvement is achieved by using the optimum number of ordered spreading sequences. Around the total SNR value of 30 dB the SIC TG EE OPT receiver with the optimum number of channels produces a 3-dB improvement over the TG EE OPT scheme without the SIC receiver. For the pedestrian B channel, the SIC TG EE OPT scheme for the TG discrete bit loading method produces a throughput, which exceeds the throughput of the TG method TG ES OPT with the optimum number of spreading sequences. The collaborative use of the SIC scheme with the optimum number of signature sequence selection scheme achieves a system throughput close to the system value UB.
The results extracted from Figures 3, 4, 5, and 6 are tabulated for the pedestrians A and B channels as shown in Tables 7 and 8, respectively. The entries in Tables 7 and 8 express the SNRs for specific data rates together with the total discrete rates at specific signal to noise ratios. The SIC scheme provides higher throughputs for both pedestrians A and B channels at an SNR of 35 dB. Specific entries as shown in Table 9 are extracted from Tables 7 and 8 for achievable data rates at the SNR of 35 dB for pedestrians A and B channels. The performances for all three SIC TG ES OPT, SIC TG EE OPT, and SIC TG EE FULL schemes for the pedestrian A channel are very close to each other. The TG EE FULL scheme achieves 29.7% of the SIC TG ES OPT performance and 37.4% of the SIC TG EE OPT performance for pedestrian B channel. On the other hand, the corresponding figures for the TG EE FULL scheme is 82% of the SIC TG ES performance and 85.8% of the SIC TG EE OPT performance for pedestrian A channel (Table 9).
The reason the TG EE FULL scheme achieves 29.7 and 82% of the SIC TG ES performances for pedestrians A and B channels, respectively, is that the PDP lengths or delay spreads for the pedestrians A and B channels are 3 and 15 chip periods, respectively. The HSDPA system, which uses the equal energy discrete bit loading method without the optimum number of spreading sequences suffers from a reduction in the total throughput compared with an HSDPA MIMO system with the optimum number of ordered spreading sequences, when encountering multipath channels with PDP lengths approaching the processing gain, N, of the system. The proposed method of finding the optimum number of ordered signature sequences improves the performance of equal energy loading systems.
This article has developed and proposed algorithms, which maximize the system throughput, while reducing the computational cost. Complexity reduced system value UBs are proposed, which achieve the same sum capacity as iterative WF. In terms of complexity reduction, the use of system values proposed in this article finds the rates and provides optimum sub-channels selection before power allocation is performed. This eliminates the requirement to undertake iterative searches for the optimum bit rates combined with computationally intensive iterative power allocation for the equal SNR (ES) allocation. The optimum number of signature sequences can produce the maximum system throughput close the system value capacity UB. The proposed SIC increases the system throughput, but also simplifies the covariance matrix inversion process required for both the EE and the ES allocations. The computational reduction is especially significant for the ES allocation, where iterative energy allocation is required.
It is shown that a system throughput improvement is achieved close to the SIC UB for both the pedestrians A and B channels by using the SIC-based receivers for the ES allocation. The SIC schemes with the full and optimum number of channels produce identical total rate results, when plotted against the total signal to noise ratio. It was observed that the signature sequence ordering was not essential for the equal SNR discrete bit loading algorithm. The identification of the optimum number of signature sequences for the equal energy allocation scheme significantly improves the total system throughput. The resultant scheme with the equal energy allocation, when using an SIC-based receiver with the ordered optimum number of signature sequences achieves a system throughput close to the non-SIC UB.
The mobile radio channels with a longer channel impulse response length, which are measured in terms of the number of chip period intervals, have severe sum capacity throughput degradations compared with the system value UBs for equal energy loading HSDPA MIMO systems without the optimum number of ordered spreading sequences. The influence of the Doppler frequency on the performance of the proposed HSDPA system is currently under investigation and will be reported in future publications.
The results presented in this article confirm that the proposed optimum signature sequence selection scheme for the SIC receiver provides a significant performance improvement for the HSDPA system. As it is now possible to obtain system throughput near the UB. The proposed schemes with HSDPA will achieve results comparable to the LTE, without incurring significant additional cost to modify the existing HSDPA infrastructures.
The receiver matched filter sequences , and will be used to determine the covariance matrix C. The covariance matrix C of the received signal of dimension N R (N+L−1)×N R (N+L−1) is constructed using the following equation
where ⊗ is the Kronecker product, Q e =[Q , Q1,Q2] is the extended Q matrix of dimension N R (N+L−1)×3K∗, Q1 represents the previous symbol period components and Q2 represents ISI from the next symbol period formed as
where and are the ISI components from the previous and next symbol periods; is the additive white Gaussian noise component with and is the noise variance per dimension.
Using the matrix inversion lemma, the inverse matrices and are obtained as shown below:
where the distance vectors and are formed as follows:
the weighting functions ξ,ξ1,ξ2,ξ3,ξ4, ξ5, and ξ6 are produced using
Then, the weighted energy terms ζ,ζ1, and ζ2 can be calculated as follows:
while the interim matrices Z1,Z2,Z3, and Z4 are formed as shown below:
For a given energy allocation E k for k=1,…,K, the parameters , and σ2, the matrices and , as well as the system values λ k are constructed by initializing and starting with k=1 as follows:
Calculate using (33) by finding parameters in to ξ 6, Z 1 to Z 3 and ζ 1, ζ 2 from (36) to (38).
Form in (34) with the given values for , , Z 4 and ζ.
Obtain the system value using as shown in (11).
Repeat this process from steps 1 to 3 for all selected transmission channels from k=1 to k=K∗.
Shannon C: A mathematical theory of communication. Bell Syst. Tech. J 1948, 27: 623-656.
Foschini G, Gans M: On limits of wireless communications in a fading environment when using multiple antennas. Wirel. Personal Commun 1998, 6: 311-335. http://dx.doi.org/10.1023/A:1008889222784 10.1023/A:1008889222784
Telatar E: Capacity of multi-antenna gaussian channels. Eur. Trans. Telecommun 1999, 10: 585-595. 10.1002/ett.4460100604
3GPP: Technical specification group radio access network; physical layer procedures (FDD) (Tech. Spec. 25.214 V.10.3.0). TS 25.214, 3rd Generation Partnership Project (3GPP). 2010.http://www.3gpp.org/ftp/Specs/html-info/25214.htm 
Cvitkovic M, Modlic B, Sisul G: High speed downlink packet access principles. In IEEE ELMAR, 2007. NJ: Piscataway; 2007:125-128.
3GPP: Technical specification group radio access network; Multiple Input Multiple Output in UTRA (Tech. Rep. 25.876 V.7.0.0). TR 25.876, 3rd Generation Partnership Project (3GPP). 2007.http://www.3gpp.org/ftp/Specs/html-info/25876.htm 
Mehlführer C, Caban S, Rupp M: Cellular system physical layer throughput: how far off are we from the Shannon bound? IEEE Wirel. Commun 2011, 18(6):54-63.
Mehlführer C, Caban S, Rupp M: Measurement-based performance evaluation of MIMO HSDPA. IEEE Trans. Veh. Technol 2010, 59(9):4354-4367.
Gurcan M, Ma I, Ghani A: The interference-reduced energy loading for multi-code HSDPA systems. EURASIP J. Wirel. Commun. Netw 2012, 127: 1-17.
Concha J, Ulukus S: Optimization of CDMA signature sequences in multipath channels. In IEEE VTS 53rd Vehicular Technology Conference, 2001. VTC 2001 Spring. NJ: Piscataway; 2001:1978-1982.
Visoz R, Gresset N, Berthet A: Advanced transceiver architectures for downlink MIMO CDMA evolution. IEEE Trans. Wirel. Commun 2007, 6(8):3016-3027.
Kim S, Kim S, Shin C, Lee J, Kim Y: A new multicode interference cancellation method for HSDPA system. In IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, 2009, PacRim 2009. NJ: Piscataway; 2009:487-490.
Bastug A, Slock DTM: Downlink WCDMA receivers based on combined chip and symbol level equalizationchip and symbol level equalization. Eur. Trans. Telecommun 2005, 16: 51-63. 10.1002/ett.1031
Wrulich M, Mehlfuhrer C, Rupp M: Managing the interference structure of MIMO HSDPA: a multi-user interference aware MMSE receiver with moderate complexity. IEEE Trans. Wirel. Commun 2010, 9(4):1472-1482.
Shenoy S, Ghauri I, Slock D: Receiver designs and multi-user extensions to MIMO HSDPA. In HSDPA/HSUPA Handbook. Edited by: Furht B, Ahson SA. Boca Raton: CRC Press; 2010:89-109.
He J, Gu G, Wu Z: MMSE interference suppression in MIMO frequency selective and time-varying fading channels. IEEE Trans. Signal Process 2008, 56(8):3638-3651.
Mehlfuhrer C, Caban S, Wrulich M, Rupp M: Joint throughput optimized CQI and precoding weight calculation for MIMO HSDPA. In IEEE 42nd Asilomar Conference on Signals, Systems and Computers. NJ: Piscataway; 2008:1320-1325.
Cui T, Lu F, Sethuraman V, Goteti A, Rao S, Subrahmanya P: Throughput optimization in high speed downlink packet access (HSDPA). IEEE Trans. Wirel. Commun 2011, 10(2):474-483.
Papandreou N, Antonakopoulos T: Bit and power allocation in constrained multicarrier systems: the single-user case. EURASIP J. Adv. Signal Process 2008., 2008: http://dx.doi.org/10.1155/2008/643081
Yu W, Rhee W, Boyd S, Cioffi J: Iterative water-filling for Gaussian vector multiple-access channels. IEEE Trans. Inf. Theory 2004, 50: 145-152. 10.1109/TIT.2003.821988
Jindal N, Rhee W, Vishwanath S, Jafar S, Goldsmith A: Sum power iterative water-filling for multi-antenna Gaussian broadcast channels. IEEE Trans. Inf. Theory 2005, 51(4):1570-1580. 10.1109/TIT.2005.844082
Kaya O, Ulukus S: Optimum power control for CDMA with deterministic sequences in fading channels. IEEE Trans. Inf. Theory 2004, 50(10):2449-2462. 10.1109/TIT.2004.834747
Kaya O, Ulukus S: Ergodic sum capacity maximization for CDMA: Optimum resource allocation. IEEE Trans. Inf. Theory 2005, 51(5):1831-1836. 10.1109/TIT.2005.846413
Shi S, Schubert M, Boche H: Downlink MMSE transceiver optimization for multiuser MIMO systems: duality and sum-MSE minimization. IEEE Trans. Signal Process 2007, 55(11):5436-5446.
Shi S, Schubert M, Boche H: Rate optimization for multiuser MIMO systems with linear processing. IEEE Trans. Signal Process 2008, 56(8):4020-4030.
Bergman S, Palomar D, Ottersten B: Joint bit allocation and precoding for MIMO systems with decision feedback detection. IEEE Trans. Signal Process 2009, 57(11):4509-4521.
Tenenbaum A, Adve R: Minimizing sum-MSE implies identical downlink and dual uplink power allocations. IEEE Trans. Commun 2011, 59(3):686-688.
Shen H, Li B, Tao M, Wang X: MSE-based transceiver designs for the MIMO interference channel. IEEE Trans. Wirel. Commun 2010, 9(11):3480-3489.
3GPP: 3rd Generation Partnership Project; Technical Specification Group Radio Access Network; High Speed Downlink Packet Access: UE Radio Transmission and Reception (TR 25.890 V1.0.0). TR 25.890, 3rd Generation Partnership Project (3GPP). 2002.http://www.3gpp.org/ftp/Specs/html-info/25890.htm 
During the development of the experimental apparatus several divisions of the National Instruments (NI) USA, UK, and Europe groups have loaned a NI PXIe-based 2×2 MIMO transceivers to enable Imperial College London to develop the MIMO channel emulator. National Instruments have also supplied a copy of their channel emulation software, which was modified by Imperial College London to operate with the experimental apparatus. We acknowledge the technical support given by the National Instruments team involving James Kimery, Robert Morton, Yiannis Pavlou, Ben Lavasani, David Baker, Trang Nguyen, Erik Luther, Jaeweon Kim, Ian Wong, and Ahsan Aziz.
The authors declare that they have no competing interests.