Open Access

Interference alignment for a multi-user SISO interference channel

  • Yasser Fadlallah1Email author,
  • Karine Amis1,
  • Abdeldjalil Aïssa-El-Bey1 and
  • Ramesh Pyndiah1
EURASIP Journal on Wireless Communications and Networking20142014:79

Received: 16 October 2013

Accepted: 13 April 2014

Published: 15 May 2014


Our work addresses the single-input single-output interference channel. The goal is to show that although interference alignment is suboptimal in the finite power region, it is able to achieve a significant overall throughput. We investigate the interference alignment scheme proposed by Choi et al. (IEEE Commun. Lett. 13(11): 847-849, 2009), which achieves a higher multiplexing gain at any given signal dimension than the scheme proposed by Cadambe and Jafar (IEEE Trans. Inform. Theory 54(8), 2008). Then, we try to modify the IA design in order to achieve enhanced sum-rate performance in the practical signal-to-noise ratio (SNR) region. Firstly, we introduce a way to optimize the precoding subspaces at all transmitters, exploiting the fact that channel matrices in the interference model of a single-input single-output channel are diagonal. Secondly, we propose to optimize jointly the set of precoder bases within their associated precoding subspaces. To this end, we combine each precoder with a new combination precoder, and this latter seeks the optimal basis that maximizes the network sum rate. We also introduce an improved closed-form interference alignment scheme that performs close to the other proposed schemes.


Interference channelSingle input single outputOptimization problemPrecodingInterference alignment

1 Introduction

In most existing wireless multi-user communication systems, interference is avoided either by coordinating the users to orthogonalize the channel access or by treating interference from other transmitters as noise. However, until recently, the capacity region of the interference channel (IC) remained unknown, except for some special cases such as strong and very strong interference [1, 2]. In [3], Maddah-ali et al. have proposed a new approach in order to show that the N-antennas MIMO X channels can offer as much as 4 N 3 degrees of freedom (DoF). This new approach of interference management has been named IA.

The key idea of IA is to jointly design all transmitted signals such that interfering signals at each receiver overlap and remain distinct from the desired signal. This approach has been exploited by Cadambe and Jafar in [4]. The authors have shown that the maximum achievable DoF in the K-user time-varying single-input single-output (SISO) IC, in the n dimensional Euclidean space, is K 2 and is achieved, thanks to an IA scheme. Later on, Motahari et al. have addressed the achievable DoF of a quasi-static IC. They have extended the idea of IA from space/time/frequency dimensions to the signal level dimensions and have shown that based on the field of Diophantine approximation in number theory [5], the interference can be aligned in the rational spaces, achieving a maximum DoF of K 2 .

The first IA scheme for SISO transmissions has been proposed in [4] for the time/frequency-varying channel. This scheme has been designed to achieve the asymptotic capacity in the IC, i.e., when both the signal-to-noise (SNR) and the signal dimensions tend to infinity. In contrast, Choi et al. have introduced another IA design that aims to achieve a higher multiplexing gain at any given signal dimension [6]. In this paper, we adopt an IA scheme for SISO transmission, and we try to modify the design in order to achieve higher sum-rate performance in the practical SNR region. Most references, among which [79], deal with IA schemes for MIMO interference channels. However, all mobile communication standards still include a SISO transmission mode as for instance the LTE downlink transmission mode 1. This is the reason why in this paper, we try to define efficient IA schemes for SISO interference channels.

In our contributions, we firstly introduce a way to optimize the precoding subspaces at all transmitters, exploiting the fact that channel matrices in the IA model are diagonal. Two solutions are derived; the first is achieved iteratively using projected gradient descent method, the second is a closed-form solution that avoids the numerical computation, thus, resulting in a very low computational complexity. Secondly, we propose to optimize the precoding vectors at each transmitter within its precoding subspace. To this end, we combine each IA precoder with a new combination precoder. The combination precoder seeks the optimal basis that maximizes the network sum-rate assuming an individual transmit power constraint. However, a closed-form solution does not seem trivial. Therefore, we apply an iterative process based on the simple gradient descent method, which converges to a local maximum due to the non-concavity of the objective function.

This paper is organized as follows. Section 2 describes the system model. Then, Section 3 presents the IA design in SISO IC. In Section 4, we propose to optimize the network sum rate through a diagonal matrix W. The precoding vector optimization within the IA subspaces is presented in Section 5. In Section 6, we present the convergence rate of the proposed iterative algorithms. Section 7 evaluates the sum-rate performance of the proposed optimization. Finally, Section 8 concludes the paper.

Notations: boldface upper case letters and boldface lower case letters denote matrices and vectors, respectively. For the transpose, transpose conjugate, and conjugate matrices, we use (.) T , (.) H , and (.), respectively. I p is the p×p identity matrix, and 1 p is the all-one vector of length p.

2 System model

Let us assume a K-user SISO IC with K transmitter-receiver pairs. A wireless channel links each receiver to each transmitter, but a given transmitter intends to have its signal decoded by a single dedicated receiver only. Let us denote by s j the length d j symbol vector to be transmitted by the j th user. As shown in Figure 1, s j is precoded by applying an N×d j precoding matrix yielding the length N vector x j . We assume a frequency-selective channel, and the inter-symbol interference is perfectly managed using an orthogonal frequency division multiplexing (OFDM) transmission scheme. The number of subcarriers is equal to N, and all components of x j are simultaneously transmitted on a one-component-per-subcarrier basis. The received signal at the k th receiver can be modeled as
Figure 1

Interference channel model.

y k = j = 1 K H kj V j x j + z k , k K ,
where K = { 1 , , K } is the set of all users, H kj â„‚ N × N is the diagonal channel matrix between the j th transmitter and the k th receiver, V j â„‚ N × d j is the precoding matrix of the j th transmitter. The j th transmitted information x j is defined as a d j ×1 vector belonging to a Gaussian continuous constellation. z k is the N×1 circularly symmetric complex Gaussian noise vector at the receiver k, with independent and identically distributed (i.i.d.) components, i.e. z k N c ( 0 , σ 2 I N ) . We also consider the following hypotheses in this paper:
  1. 1.

    Users do not cooperate.

  2. 2.

    Non-precoded user symbols are Gaussian continuously distributed and mutually independent.

  3. 3.

    The set of channel matrices H k j is entirely and perfectly known at all transmitters and all receivers.

  4. 4.

    All diagonal components of H kj k , j K are i.i.d. and continuously distributed, with absolute values upper-bounded with a finite value.

The maximum achievable DoF in the K-user SISO IC is equal to [4, 5]
lim snr C ( snr ) log 2 ( snr ) = K 2 ,

where C(snr) represents the channel capacity.

3 IA design in a SISO interference channel

3.1 Precoding design

The essence of the IA scheme is to design the transmit beamforming matrices in a way that the interference-free stream number at each receiver is maximized. The IA design conditions have been defined as follows:
rank U k H kk V k = d k , U k H kj V j = 0 , j k ,

where U k is the decoding matrix at the k th receiver. In other words, the desired signal belongs to the subspace generated by the vectors of G k =U k H k k V k , while the interference is completely eliminated. The feasibility of the linear system in (3) is conditioned to the following proprieties: (i) the linear system has to be proper, i.e., the number of variables is more than or equal to the number of equations and (ii) the linear system has to be generic [10]. In some particular cases, the genericity is satisfied by providing a channel matrix with random and independent coefficients.

Figure 2 illustrates the IA principle in the case of three users sharing the same resources to communicate. Each transmitter has one symbol to transmit to its dedicated receiver. All channel coefficients are supposed to be known at all transmittersa. In order to achieve the IA linear precoding design, each transmitter k transmits its symbol in the direction of a two-dimensional vector v k , and the precoding vectors at all transmitters are designed in such a way that at all receivers the vectors carrying the two interfering symbols are aligned and linearly independent of the vector that carries the desired symbol. Then, the interference is eliminated by a simple projection on the interference null space.
Figure 2

Three-user SISO interference channel with IA scheme.

One precoding design that provides IA at all receiver nodes and fulfills the conditions in (3) in the SISO interference channel is proposed by Choi et al. as [6]
V 1 = k , l K 1 , k l , ( k , l ) ( 2 , 3 ) T 23 1 T kl n kl | k , l K 1 , k l , ( k , l ) ( 2 , 3 ) n kl m + 1 , V 3 = T 23 1 k , l K 1 , k l , ( k , l ) ( 2 , 3 ) T 23 1 T kl n kl | k , l K 1 , k l , ( k , l ) ( 2 , 3 ) n kl m , V j = H 1 j 1 H 13 V 3 , T kl = H k 1 1 H kl H 1 l 1 H 13 .
where m is any non-negative integer which defines the number of transmitted symbols and the length of the precoding vectors, and T k l is an N×N diagonal matrix. In the IA design described previously, the achievable DoF per user can be obtained using the following combinations:
d 1 = m + M + 1 M and d 3 = m + M M ,

where M is a parameter depending on the user number, M=(K−1)(K−2)−1, d i is the DoF of the i th user, i.e., the number of transmitted symbols, and N the number of symbols in each IA vector is defined as N=d1+d2. In the particular scheme previously, the IA conditions can be satisfied by providing d i = d 3 , d 1 > d 3 , i K 1 , 3 . For example, in a three-user SISO multi-user IC, we have d1=n+1, d2=d3=n,N=2n+1, and n can be any non-negative integer.

3.2 Linear decoding design

In the aforementioned transmission model, the received signal given in (1) can be rewritten as
y k = H ̄ k k s k Desired subspace + j k H ̄ j k s j Interference subspace + z k ,

where H ̄ j k = H kj V j . We assume that the IA conditions are satisfied. Let H ̄ I k â„‚ N × ( N d k ) denotes the Nd k matrix spanning all interference subspaces, i.e., H ̄ j k for j{1,,K}, jk are all spanned by H ̄ I k . Before going further into the description, we introduce the following lemma:

Lemma 1

Let A 1 â„‚ N × n and A 2 â„‚ N × n ( N > n ) , where rank (A1)=n and rank (A2)=m,(mn) and span (A2)span(A1). Then, for every s 2 â„‚ n × 1 , s 1 â„‚ n × 1 such that A1s1=A2s2.

Using Lemma 1 at the receiver k, the interference subspace from the j th transmitter can be expressed in terms of H ̄ I k as
H ̄ j k s j = H ̄ I k s jI , where s jI = H ̄ I k H H ̄ I k 1 H ̄ I k H H ̄ j k s j .
Substituting (6) into (5) yields
y k = H ̄ k k s k + H ̄ I k s ̄ k + z k = H ̄ k k H ̄ I k [ s k T s ̄ k T T + z k = B k s ~ k + z k ,

where s ̄ k = s 1 I + + s ( k 1 ) I + s ( k + 1 ) I + + s KI , B k â„‚ N × N is a full rank matrix that spans the union of the desired and the interference subspaces, and s ~ k is the N×1 vector consisting of the d k desired streams and the Nd k interference streams. Equation 7 gives the mathematical formulation of a linear determined decoding problem, where an N length source data vector s ~ k is mixed by a constant mixing matrix B k to produce a vector y k of N observations. Such a decoding problem can be resolved using classical criteria such as zero-forcing (ZF), minimum mean square error (MMSE), and maximum likelihood (ML).

4 IA precoding subspaces optimization

In this section, we aim to optimize the IA precoding subspaces in the scheme described above. From (3), it can be noted that the modified precoding matrices defined as
V k = W V k IA k K ,

where V k IA is the original matrix derived with respect to the IA conditions and W is any diagonal matrix, which satisfy the IA conditions. That is, the projection of all precoding matrices of the IA scheme on a common diagonal matrix W keeps the IA conditions respected.

The precoding subspaces can be optimized by judiciously selecting the components of W in (8). This diagonal matrix W determines the interference and the desired subspaces design, while maintaining the IA conditions at the receivers. We assume both MMSE- and ZF-based detection schemes, widely used due to the simplicity of their implementation, and we derive two different optimized designs that maximize the network sum rate in both cases.

4.1 MMSE-based decoder - iterative solution

Assuming an MMSE decoder, the mutual information between the k th transmitter and its intended receiver k can be expressed as [11]
R k ( w ) = log 2 I N + p j H kj W V j V j H W H H kj H I N + p j k H kj W V j V j H W H H kj H ,
where p is the user average transmit power over the average noise power assumed equal at all receivers. Using the Sylvester’s determinant theorem [12], the fact that all channel matrices are diagonal, and the definition of B k and A k as
B k = j = 1 , j k K H kj V j V j H H kj H A k = B k + H kk V k V k H H kk H ,
(9) can be reformulated in the following compact form as
R k ( w ~ ) = log 2 I N + p W ~ A k I N + p W ~ B k ,
where W ~ = W H W is a diagonal matrix and its diagonal is defined as the vector w ~ with positive components ( w ~ i i ). One can notice from (10) that matrices A k and B k are written as the sum of semi-definite positive matrices and hence are positive semi-definite. Consequently, the Cholesky decompositionb can be applied [13], and the matrices A k and B k can be rewritten as
A k = L A k H L A k with L A k H = L B k H H kk V k and B k = L B k H L B k .
Substituting (12) into (11) yields the k th user rate
R k ( w ~ ) = log 2 I N + p L A k W ~ L A k H I N + p L B k W ~ L B k H .
Our goal is to seek w ~ that maximizes the total mutual information (i.e., network sum rate) in the IC under the constant total transmit power linear constraint. The maximization problem is then defined as
arg max w ~ 1 N k = 1 K log 2 I N + p L A k W ~ L A k H I N + p L B k W ~ L B k H ,
subject to the total transmit power constraint
k = 1 K tr W ~ V k V k H = K , w ~ i 0 , i 1 , , N .

It is not obvious whether a closed-form solution can be obtained or not; therefore, one can search for the solution iteratively. However, the convergence towards the global maximum is not guaranteed unless the objective function is concave. The proof of the concavity with respect to the variable vector w ~ , requires the objective function to be twice differentiable and its Hessian matrix to be negative semi-definite [14]. Indeed, a similar problem has been treated in [15] for the three-user IA scheme. The authors have demonstrated that a function having the form of (14) is concave if A k and B k are defined as in (12) (see Appendix 1 in [15]). In order to find the solution that achieves the optimum, we propose to use the projected gradient method with an optimized variable step size (details are given in Appendix 2). Other algorithms can also be used such as simple gradient descent method using Lagrange multipliers.

Remark 1

In [15], the diagonal matrix W has been optimized under an MMSE receiver assumption. However, the study has been restricted to the three-user SISO interference channel case. Also, there is no description of the iterative algorithm used for achieving the solution. In this section, we have provided a general design for the K-user SISO interference channel case. We have also described an iterative method (see Appendix 2) to converge towards the solution.

4.2 ZF-based decoder - closed-form solution

In the previous subsection, we have proposed to optimize the precoding subspaces using iterative processing when an MMSE is applied at the receiver. In this section, we apply a ZF criterion at the receiver. Then, we propose a closed-form solution for w that is asymptotically optimal. This solution is obtained from the network sum-rate maximization problem approximation for very high SNR and under the hypothesis of a ZF applied at all receivers. It also avoids the need for a numerical solver that requires a matrix inversion at each iteration and increases the processing time and computational cost.

Assuming a ZF criterion at all receiver nodes and an IA design at all transmitters, the mutual information between the k th transmitter and its intended receiver k is expressed as
R k = log 2 | I N + p U k H kk W V k V k H W H H kk H U k H |
where U k is the interference canceler at the k th receiver. Assuming well-conditioned channel matrices and using Sylvester’s determinant theorem [12], the k th user rate can be approximated for high SNR values by
R k SNR 1 log 2 p H kk W V k V k H W H H kk H U k H U k .
Now, we intend to maximize the sum-rate approximation k R k with respect to w under the total transmit power constraint. Using the following equivalence:
arg max w ~ k = 1 K R k ( w ~ ) arg max w ~ W ~ K k = 1 K H kk V k V k H H kk H U k H U k
and the fact that k = 1 K | H kk V k V k H H kk H | | U k H U k | is positive, the optimization problem in (17) can also be reduced to
arg max w ~ | W ~ | , subject to k tr W ~ V k V k H = KN , w ~ i > 0 i.
We notice that the problem in (18) is independent from the channel matrices, and the objective function is a simple determinant of a diagonal matrix, hence, a concave function. Introducing the Lagrange multiplier λ, the convex dual of this problem is formulated as follows [14]:
arg max w ~ arg min λ L ( w ~ , λ ) .
where the Lagrangian function is defined as
L ( w ~ , λ ) = W ~ λ tr k W ~ V k V k H KN , with tr k W ~ V k V k H = i = 1 N c i w ~ i , and c i = k = 1 K v ki 2 ,
and v k i stands for the i th row of the matrix V k . Since the objective function is concave, the Karush Kuhn-Tucker (KKT) conditions are sufficient to determine the global optimum. The KKT conditions of the problem in (19) are given by
w ~ L ( w ~ , λ ) | w ~ = w ~ = 0 , λ > 0 , i = 1 N c i w ~ i = K N
The solution of the linear problem in (21) is obtained as
w ~ i = K c i , i 1 , , N .

Hence, the components of w are obtained as w i = w ~ i for all i. It is worth noting that beside maximizing the sum-rate, the problem of maximizing the individual rate using the approximation in (16) has the same solution obtained in (22).

A major advantage of the proposed solution is the fact that it has an analytic simple expression making its implementation complexity very low. Indeed, the other algorithms proposed for sum-rate maximization and interference power minimization in SISO and MIMOc transmissions achieve the optimum using singular value decomposition (SVD) [11] and/or an iterative algorithm that requires hundreds to thousands iterations to converge [79, 16].

4.3 Complexity and sum-rate performance

The computational complexity is a major bottleneck of practical implementation that is considered in system designs. In the following, we discuss the complexity of the precoding schemes proposed above.

The first optimized design that maximizes the sum-rate assuming an MMSE detector is obtained using the projected gradient descent method. This iterative method requires at each iteration the computational cost of the first-order derivative of the objective function. Looking at the expression given in (34), one can notice that the derivative is calculated using matrix multiplications and matrix inversions with dimensions N×N. Therefore, the computational complexity at each iteration can be considered of order O N 3 . On the other hand, the design based on a closed-form solution requires the computation of a Frobenius norm and N real division. Thus, the complexity order is O ( N ) .

Figure 3 illustrates the average sum-rate per dimension performance of the non-optimized design proposed by Choi et al. in [6] and the optimized designs W-MMSE and W-ZF, which represent the designs based on the solutions to the problems given in (14) and (18), where we optimize the precoding subspaces of the IA scheme assuming an MMSE and a ZF, respectively. The optimizations are applied to the IA design proposed by Choi et al. It can be observed that for N=3 and N=7, the W-MMSE design outperforms the W-ZF design with the closed-form solution in the low SNR region. However, when the SNR becomes very high, the sum-rate performance for both designs get very close. This is due to the validity of the approximation in (16) for high SNR. On the other hand, the resulting gain of the optimized designs over the nonoptimized one is significant, e.g., at 20dB the gain is between 4 and 5 bits/s/Hz/dim.
Figure 3

Average sum rate per dimension of the two proposed designs for subspace improvement when N =3 and N =7.

It is important to note that the proposed designs result from the optimization of the original designs proposed in [6] and not the optimal IA design that maximizes the sum-rate. It explains why, when we compare the designs for different dimensions N, a higher sum-rate is obtained for N=3 compared to the design for N=7. Furthermore, in order to have a fair comparison between the cases N=3 and N=7, the average sum rate performance is normalized by the dimension N, which means that the sum rate is divided by the number of symbols in one symbol vector as shown in (14).

5 Precoding vectors design within IA subspaces

The previous section has addressed the optimization of the IA precoding subspaces at once using a diagonal matrix W. However, there was no claim for the optimality of the precoding vectors within IA subspaces. In this section, we propose to maintain the IA subspaces design at the transmitters, and we aim to optimize the precoding vectors within each subspace. We consider both cases: MMSE and ZF criterion at the receiver, and we attempt to maximize the network sum-rate in each case.

The precoding matrices defined in (4) are of size N×d k with N>d k , k. Thus, introducing a new combination matrix C k â„‚ d k × d k at each transmitter node as follows
P k = V k C k , k K ,

will modify the basis of V k within its own subspace without modifying the subspace itself. These variables can later be defined taking into account different criteria such as MSE, BER, sum-rate, and average transmit power. Next, we show how to optimize the additional combination matrices so as to maximize the network sum-rate.

5.1 MMSE-based decoder

Assuming an MMSE at all receivers, the mutual information between the k th transmitter and its dedicated receiver k can be written as a function of the combination matrices C k k as follows
R k = log 2 I N + p j = 1 K H kj P j P j H H kj H I N + p j k K H kj P j P j H H kj H .
Now, in order to maximize the sum-rate under the individual transmit power constraint, we propose the following maximization problem
arg max C k , k K 1 N k = 1 K log 2 I N + p j = 1 K H ̄ kj C j H ̄ kj C j H I N + p j k K H ̄ kj C j H ̄ kj C j H subject to tr V k C k C k H V k H = N , k K.
where H ̄ kj = H kj V j . It is well-known that the optimal solution is the one that nullifies the gradient of the sum-rate expression. However, a closed-form solution is not obvious due to a complicated expression of the first-order derivative as shown in (32) in Appendix 1. Therefore, we attempt to get close to the solution iteratively. We use an iterative algorithm that optimizes the cost function with respect to one variable while the others remain fixed. In our reasoning, each variable is considered as one of the precoding matrices. This technique results in a nonconvex optimization due to the dependence between the precoding matrices. At each iteration, the optimization is based on the gradient descent widely used in MIMO multi-user channels. The iterative algorithm for the sum-rate maximization is detailed in Algorithm 1.
In this algorithm the gradient is defined in (32) in Appendix 1, f describes the objective function given in (25), and the precoding matrices are supposed to be of unit Frobenius norm. The step size λ is updated using the backtracking search, which is an effective and quite simple method [14]. Despite the non-convexity of the multi-variable objective function, as long as the variable is steered in the gradient direction, the algorithm converges to a local maximum. In our simulations, the convergence of this iterative algorithm is supposed to be achieved either when
k C k ( l ) R < ε

or when a maximum number of iterations is reached, and ε is defined as a tolerance value. In our simulations, we assume ε=10−2.

5.2 ZF-based decoder

Given the k th user rate, the ZF-based detector uses a matrix U k to cancel the interference, yielding an equivalent d k ×d k MIMO transmission model. Many options exist to find the best family of combination matrices {C j } in order to maximize the sum-rate. The channel model after interference suppression at receiver k is obtained as
y k = U k H kk V k C k x k + U k z k , = H ~ k C k x k + U k z k ,

where U k is the decoding matrix at the k th receiver. It is defined as the d k ×N interference null space. The model defined in (27) is a typical MIMO single user model with channel matrix H ~ k and precoding matrix C k . One optimized form of C k is the one composed of the right singular vector of the new channel matrix H ~ k . Such a precoding scheme achieves the channel capacity as described in [17]. Another form that requires less computational complexity is the one that orthonormalizes the columns of the original precoding matrix V k . In [18], the authors have shown that this last form gets close to the maximum information rate when the SNR becomes high.

5.3 Complexity and sum-rate performance

The algorithm that optimizes the solution iteratively in Subsection 5.1 is based on the gradient descent method. At each iteration, the iterative algorithm requires the gradient of the objective function that needs itself inversion of N×N full rank matrices. Thus, the total computational complexity depends mainly on the number of iterations and on the precoding matrices dimensions. The complexity cost is of order O ( nb i N 3 ) where n b i is the number of iterations.

Figure 4 illustrates the sum-rate per dimension of the proposed design that uses Algorithm 1 and the design OW-ZF that orthonormalizes the precoding vectors. For the subspace optimization of OW-ZF, we use the closed form solution derived in Subsection 4.2. One can observe a sum-rate performance loss in the case of OW-ZF compared to the CW-MMSE for low SNR values, whereas when the SNR increases, both sum rates become very close. However, the OW-ZF requires less operations, the complexity cost is of order O ( N d k 2 ) at each transmitter, and no joint processing is required for the optimization design.
Figure 4

Average sum rate per dimension of the proposed precoding vectors design using the combination matrices. When N=3 and N=7.

Remark 2.

The proposed optimization of IA precoding subspaces cannot directly be extended to MIMO interference channels, as the channel matrices are no longer diagonal in the MIMO system model. However, the proposed optimization of the precoding vectors within each IA subspace can be used for MIMO optimization design since the IA conditions at all receiver are always maintained.

6 Convergence rate of the iterative solutions

In Subsections 4.1 and 5.1, we have proposed two iterative solutions, one aims to optimize the IA subspaces and the other optimizes the precoding vectors within each IA subspace without modifying the subspace itself.

The first iterative solution to the problem in (14) for the IA subspaces optimization is reached using the projected gradient method. We have mentioned that the objective function is concave, thus, the convergence towards the global optimum is guaranteed. On the other hand, the iterative solution proposed for the IA precoding vector optimization is reached using an algorithm based on the gradient descent method for a multi-variable objective function. Thereby, the objective function changes at every iteration yielding a nonconvex optimization problem. However, as long as the iterative method is based on the gradient descent and the variable follows the direction of the gradient using an optimized step size, a convergence towards a local optimum is guaranteed.

The convergence rates of the discussed iterative solutions above are shown in Figure 5. For the projected gradient descent method, the convergence towards either the optimal solution or a neighboring optimal solution requires hundreds of iterations. This slow convergence rate diverts the attention to the closed-form solution obtained in Subsection 4.2. Now, looking at the design for precoding vector optimization within their subspace, the convergence rate seems fast. For example, almost 10 to 15 iterations are required to achieve a near-optimal value at 15 and 25 dB when N=7. This fast convergence implies that for small dimensions and for precoding vector optimization as discussed in Subsection 5.2, the computational complexity is not that far from other proposed designs.
Figure 5

Convergence of the iterative algorithm in Section 5.1and the projected gradient method in Subsection 4.1.

7 Comparison of the proposed optimized designs to the state of art schemes

In this section, we compare the proposed designs to the distributed designs proposed in [7, 8] in terms of sum-rate per dimension. We consider a three-user frequency-selective SISO IC, with the model proposed in Section 2. The total independent stream number from all users is equal to T=3n+1, and the precoding vectors length is N=2n+1 for all users, and n can be any nonnegative number. The transmit constellation is Gaussian continuously distributed, and the channel coefficients are circularly symmetric complex Gaussian distributed with zero mean and unit variance. The following abbreviations are used for the compared designs:
  • ? OW-ZF : the proposed IA design with the closed-form solution derived in Subsection 4.2 that uses orthogonal precoding vectors

  • ? CW-MMSE: the IA design with the two iterative proposed optimization in Subsections 4.1 and 5.1

  • ? IA-Iter: the IA design obtained with the distributed algorithm proposed in [8]

  • ? Max-SINR: the beamforming design proposed in [7] that maximizes the signal-to-interference-and-noise ratio (SINR) of all streams

Figures 6, 7 and 8 illustrate the average sum-rate per dimension performance of the OW-ZF, the CW-MMSE, the IA-Iter, and the Max-SINR for N=3, N=5, and N=7, respectively. Compared to the CW-MMSE, the OW-ZF performs similarly for all configurations with a slight loss for low and medium SNR values. This is due to the fact that a ZF criterion becomes equivalent to an MMSE when the SNR becomes very high, and that the OW-ZF design is based on an approximation for high SNR. On the other hand, an important gain is obtained over the IA-Iter design over the whole SNR region when N=3 and N=5, e.g., at 20 dB a gain of about 2.2 to 2.5 bits/s/Hz and 1.7 to 2 bits/s/Hz is obtained for N=3 and N=5, respectively. It is worth noting that in addition to this gain, the OW-ZF design is a closed-form, thus, it exhibits a much less computational complexity than the other designs. It also does not require any iterative processing to achieve the solution, which can sustain the complexity order when N increases. Now, considering the beamforming optimization design that maximizes the SINR referred to as Max-SINR, this latter outperforms the proposed designs in the low and medium SNR region. However, this resulting gain decreases as the SNR increases in the medium to high SNR region. For example, the OW-ZF design and the Max-SINR design reach the same sum-rate value of about 10.4 bits/s/Hz at 30 dB when N=3. This result can show that in some particular cases, the proposed designs are very close to one of the most efficient designs when the SNR is high enough while keeping a low complexity level such as the OW-ZF design.
Figure 6

Comparison of the average sum rate per dimension for different precoding designs when N =3.

Figure 7

Comparison of the average sum rate per dimension for different precoding designs when N =5.

Figure 8

Comparison of the average sum rate per dimension for different precoding designs when N =7.

On the other side, compared to the orthogonal interference mitigation techniques such as time division multiple access (TDMA), the optimized IA scheme (i.e., CW-MMSE) achieves similar performance in the low SNR region. However, when the SNR increases, i.e., beyond 15 dB, the gain between both optimized IA designs (i.e., OW-ZF and CW-MMSE) and the TDMA starts to get wider, and achieves 2 bits/s/Hz/dim over the TDMA at 30 dB. This is due to the suboptimality of the IA design in the low SNR region and its tendency to optimality in the high SNR region [4].

Next, Figure 9 evaluates the performance of the following designs: OW-ZF, IA-Iter, and Max-SINR, as a function of the precoding vectors sizes. At 15 and 30 dB, the OW-ZF outperforms the IA-Iter for N≤9 and N≤11. On the other hand, it can be observed that the two iterative designs IA-Iter and Max-SINR result in an increasing sum-rate with the vector sizes. However, the closed-form design OW-ZF results in a decreasing sum-rate with the vectors sizes. This means that OW-ZF is close to the optimal for small precoding dimensions and starts moving away when N increases.
Figure 9

Evolution of the average sum rate per dimension with the precoding vector length.

8 Conclusions

In this paper, we have introduced three optimized designs for the IA scheme in a K-user SISO IC. The first and the second try to optimize the precoding subspaces at the IA transmitters through a common diagonal matrix assuming an MMSE and ZF linear detector, respectively. The third assumes an MMSE linear detector and seeks the optimal precoding vectors within a predefined subspace at each transmitter. The first and the third designs referred to as W-MMSE and C-MMSE, respectively, require iterative algorithms to converge to their optimum, whereas the second design referred to as W-ZF is obtained from a closed-form solution. Comparing to other IA distributed designs, the proposed designs show a significant sum-rate performance improvement and much less computational complexity when the closed-form solution is applied. To enhance the sum-rate performance, we have introduced an orthogonalization of the precoding vectors in the W-ZF design, which enables to achieve a trade-off between complexity and data rate.


a This hypothesis is very optimistic, but it is taken by many research works in the literature.

b It is important to note that the Cholesky decomposition, originally defined for a positive definite matrix, can be extended to the positive semi-definite case.

c The IA schemes proposed for MIMO transmission can also be used in SISO systems.

Appendix 1

Using the k th information rate expression in (24), the sum rate can be written as
R k = 1 K log 2 X k log 2 Y k
X k = I + p j = 1 K H ̄ kj C j H ̄ kj C j H , and Y k = I + p j k K H ̄ kj C j H ̄ kj C j H
Since the sum-rate is real valued function and C k k are complex variables, the gradient of the sum-rate can be calculated using the differential with respect to C k . It is known to be d R = 2 ∂R / C k . Details are given in [19]. Using the differential of log2|X k | computed as
d log 2 X k = trace X k 1 d X k , and d X k = p H ̄ kj C j d C j H H ̄ kj H ,
Using the following properties: trace (A dB H )=t r a c e(A T dB), d [trace(A)]=trace(dA), vec(d A)=dvec(X), and trace (A T B)= vec(A) T )vec(B), and referring to [19] that describes the first-order differential and the Jacobian matrix properties, we obtain
d log 2 X k = 2 p ln 2 vec H ̄ kk H X k 1 H ̄ kk C k T . vec d C k .
Thus, the gradient of R w.r.t. C k is obtained as follows:
C k ( l ) R = 2 p ln 2 i = 1 K H ̄ ik H X i 1 H ̄ ik C k 2 p ln 2 i k K H ̄ ik H Y i 1 H ̄ ik C k .

Appendix 2

The projected gradient algorithm requires firstly the computation of the gradient with respect to w ~
∂R ( w ~ ) w ~ i = k = 1 K X ki Y ki ,
where X k i and Y k i are defined as
X ki = p N l Aki I N + p L Ak W ~ L Ak H 1 l Aki H
Y ki = p N l Bki I N + p L Bk W ~ L Bk H 1 l Bki H
with l A k i and l B k i are the i th rows of the matrices L A k and L B k , respectively. The constraint, defined in (5), can be formulated as
k = 1 K tr V k W ~ V k H = i = 1 N w i ~ c i ,
with c i is the i th component of the vector c, c i = k | | v ki | | 2 , and v k i is the i th row of the matrix V k .Equation 35 defines the set of w ~ that satisfies the constraint, thus, given the gradient, we project it on the constraint hyperplane and update w ~ by
w ~ l + 1 = w ~ l + μ. p proj w ~ l ,
where μ is a variable step size and p ( w ~ ) is the projected gradient defined as
p proj w ~ l = w R w ~ l c t w R w ~ l . c c 2 .
The convergence towards the steady state is achieved either when
p w ~ l < ε ,

with ε is the tolerance factor for stopping the iterations or a maximum number of iterations is attained. In this algorithm, the step size μ is a determining factor to ensure a faster convergence, thus, it must be judiciously selected. In [14], two line search methods are proposed: exact line search and inexact line search methods. In practice, most line searches are inexact, and many methods have been proposed. One is the backtracking method, which is employed for our design. It is very simple to implement and quite effective. Besides, the step size is updated at each iteration to satisfy w ~ i > 0 for all i.



The authors would like to thank the ‘Institut Carnot Télécom et Société Numérique’ for funding the work of this paper.

Authors’ Affiliations

Institut Télécom; Télécom Bretagne; UMR CNRS 6285 Lab-STICC, Brest, France


  1. Sato H: On degraded Gaussian two-user channels. IEEE Trans. Inform. Theory 1978, IT-24: 637-640.View ArticleGoogle Scholar
  2. Carleial A: Interference channels. IEEE Trans. Inform. Theory 1978, 24: 60-70. 10.1109/TIT.1978.1055812MathSciNetView ArticleMATHGoogle Scholar
  3. Maddah-Ali M, Motahari AS, Khandani AK: Communication over MIMO X channels: interference alignment, decomposition, and performance analysis. IEEE Trans. Inform. Theory 2008, 54(8):3457-3470.MathSciNetView ArticleMATHGoogle Scholar
  4. Cadambe VR, Jafar SA: Interference alignment and degrees of freedom of the K-user interference channel. IEEE Trans. Inform. Theory 2008, 54(8):3425-3441.MathSciNetView ArticleMATHGoogle Scholar
  5. Motahari AS, Gharan SO, Maddah-Ali MA, Khandani AK: Real interference alignment: exploiting the potential of single antenna systems. arXiv:0908.2282Google Scholar
  6. Choi SW, Jafar SA, Chung S-Y: On the beamforming design for interference alignment. IEEE Commun. Lett 2009, 13(11):847-849.View ArticleGoogle Scholar
  7. Gomadam K, Cadambe VR, Jafar SA: Approaching the capacity of wireless networks through distributed interference alignment. In Proc. of IEEE Global Communications Conference, GLOBECOM. IEEE, New Orleans, USA; 2008.Google Scholar
  8. Peters SW, Heath RW: Interference alignment via alternating minimization. Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, 2445-2448.Google Scholar
  9. Santamaria I, Gonzalez O, Heath R, Peters S: Maximum sum-rate interference alignment algorithms for MIMO channels. In Proc. of IEEE Global Communications Conference, GLOBECOM. IEEE, Miami, USA; 2010.Google Scholar
  10. Yetis C, Gou T, Jafar SA, Kayran AH: On feasibility of interference alignment in MIMO interference networks. IEEE Trans. Signal Process 2010, 58(9):4771-4782.MathSciNetView ArticleGoogle Scholar
  11. Sung H, Park S, Lee K, Lee I: Linear precoder designs for K-user interference channels. IEEE Trans. Wireless Commun 2010, 9(1):291-300.View ArticleGoogle Scholar
  12. Serre D: Matrices: Theory and Applications, Second Edition. Springer, New York; 2010.View ArticleGoogle Scholar
  13. Higham NJ: Analysis of the Cholesky Decomposition of a Semi-Definite Matrix. Oxford University Press; 1990.MATHGoogle Scholar
  14. Boyd S, Vandenberghe L: Convex Optimization. Cambridge University Press, New York; 2004.View ArticleMATHGoogle Scholar
  15. Kim D, Torlak M: Optimization of interference alignment beamforming vectors. IEEE J. Selected Areas Commun 2010, 28(9):1425-1434.View ArticleGoogle Scholar
  16. Shen H, Li B, Tao M, Luo Y: The new interference alignment scheme for the MIMO interference channel. In Wireless Communications and Networking Conference (WCNC), 2010 IEEE. Sydney, Australia; 2010:1-6.Google Scholar
  17. Wolniansky PW, Foschini GJ, Golden G, Valenzuela RA: V-blast: an architecture for realizing very high data rates over the rich-scattering wireless channel. In Signals, Systems, and Electronics, 1998. ISSSE 98. 1998 URSI International Symposium On. IEEE, Pisa; 1998:295-300.Google Scholar
  18. Shen M, Host-Madsen A, Vidal J: An improved interference alignment scheme for frequency selective channels. In Proc. of IEEE International Symposium on Information Theory, ISIT. Toronto, ON; 2008:6-11.Google Scholar
  19. Magnus J, Neudecker H: Matrix Differential Calculus with Applications in Statistics and Econometrics. Wiley, Chichester; revised version 2007.MATHGoogle Scholar


© Fadlallah et al.; licensee Springer. 2014

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.