# Performance study of multiuser interference mitigation schemes for hybrid broadband multibeam satellite architectures

- Jesús Arnau
^{1}Email author, - Bertrand Devillers
^{2}, - Carlos Mosquera
^{1}and - Ana Pérez-Neira
^{2, 3}

**2012**:132

https://doi.org/10.1186/1687-1499-2012-132

© Arnau et al; licensee Springer. 2012

**Received: **15 November 2011

**Accepted: **5 April 2012

**Published: **5 April 2012

## Abstract

As the demand for higher throughput satellites increases, multibeam architectures with smaller beam spots are becoming common place. If the same frequency is strongly reused, the resulting interference when serving simultaneously many users requires some sort of pre or post-cancelation process. This article focuses on precoding and multiuser detection schemes for multibeam satellites, comparing hybrid on-board on-ground beamforming techniques with fully ground-based beamforming. Both techniques rely on the exchange of radiating element signals between the satellite and the corresponding gateway but, in the latter case, the interference mitigation process acts on all the radiating signals instead of the user beams directly, with the corresponding extra degrees of freedom for those cases for which the number of radiating elements is higher than the number of user beams. The analysis carried out in this study has shown that the potential advantage of ground-based beamforming may exceed 20% of the total throughput.

## Keywords

## 1 Introduction

The use of multiple spot beams in modern broadband satellites has increased during the last few years in an effort to serve higher throughput demands with a scalable cost, for which frequency reuse among users is required [1]. Thus, the same frequency band is shared by different beams to provide an overall higher throughput as long as the multiuser interference can be kept under control. This interference occurs due to the non-null side lobes of the beams radiation patterns, and is related to the degree of reuse of the spectrum. A partial frequency reuse would exclude adjacent beams from using the same portion of spectrum (or *color*). However, more aggressive frequency reuse strategies [2, 3] can push forward the overall spectral efficiency provided the resulting interference can be efficiently managed. On the other side, the increase in the number of spot beams, in the desired capacity per beam, and in the frequency reuse might convert feeder links in a bottleneck. Higher frequency bands such as Q/V, optical communications or multigateway architectures need to be addressed to accommodate the required capacity.

Conventional beamforming techniques are space-based (or on-board) architectures including analog or digital beamforming networks. On-board volume and calibration requirements are perhaps their main drawbacks. In an attempt to shift the complexity to the ground segment, more recent ground-based beamforming (GBBF) techniques rely on the exchange of radiating element signals between the satellite and the gateway. The forming of beams is realized on-ground with all the flexibility offered by on-ground digital signal processing [4]. Again, at the cost of a higher feeder link bandwidth demand for those cases with more feeds or radiating elements than number of beams, more sophisticated and power consuming techniques can be implemented. Flexibility is preserved, and changes in shape, traffic and pointing direction can be accommodated. Multiuser interference mitigation schemes such as precoding or multiuser detection can be jointly designed with the beamforming process at the gateway station: this joint optimization process is expected to provide some gain in terms of capacity as we will show in this study. The interference mitigation process would act on all the radiating signals instead of the user beams directly, with the corresponding extra degrees of freedom.

More specifically, the forward link from the gateway to the users can be recast as a multiple-input multiple-output (MIMO) broadcast channel [5] from an information theoretic perspective, whereas the return link corresponds to the multiple access channel (MAC) [6]. In the forward link, the sum rate capacity is known to coincide with the rate region on the non-linear dirty paper coding (DPC) [7]. However, linear alternatives are more attractive in terms of complexity. For instance, zero-forcing (ZF) linear precoding performs close to DPC in case many users are available and optimal user scheduling is performed [8]. By relaxing the zero interference constraint at each user, it has been proved that the so-called regularized channel inversion (a MMSE-like precoder) can significantly improve the performance [9].

As for the return link, the maximum sum-rate is known to be achieved via successive interference cancelation with minimum mean-squared error filtering at each stage (MMSE-SIC) [10]. However, much simpler, linear alternatives, such as the zero-forcing (ZF) receiver or the plain MMSE receiver ([11, 12]) are also popular because of their lower computational complexity.

Recently, the performance of the return link of a full on-ground architecture was investigated in [13], featuring an adaptive coding and modulation (ACM) enhanced DVB-RCS physical layer. Results showed an increase in throughput at the cost of some loss in availability when linear MMSE was applied; with MMSE-SIC, a significant improvement in throughput and availability was reported. An equivalent analysis for the forward link was reported in [3], where again a throughput increase was achieved at the expense of a loss in availability.

Within the SatNEx III (Satellite Network of Experts) framework, funded by the European Space Agency, and building on previous results such as those above, the use of multiuser interference mitigation schemes together with GBBF was analyzed to evaluate its potential improvement with respect to more classic onboard beamforming settings. The main achievements of this research, which have been partly presented in [14, 15], are reported in detail in this article. It will be shown that full on-ground architectures tend to outperform hybrid architectures with fixed on-board weights and on-ground multiuser interference mitigation. This conclusion will be analytically supported under perfect channel state information (CSI), and a sufficient condition for the two architectures to be equivalent will be given. Moreover, detailed simulations will provide insight on the behavior of both architectures when imperfect channel knowledge is assumed, and also when the analog beamforming suffers from miscalibration issues.

The rest of the article is structured as follows: Section 2 describes the system model, Section 3 evaluates the performance of some processing techniques when the gateway has perfect CSI, Section 4 describes the modeling of non-perfect CSI, Section 5 reports simulation results and, finally, conclusions are summarized in Section 6.

**Notation**: Boldface uppercase letters denote matrices and boldface lowercase letters refer to column vectors. We denote by (.) ^{
H
} the Hermitian transpose. The *N* × *N* identity matrix is denoted by **I** _{
N
}, and diag(**a**) builds a diagonal matrix from the elements of the vector **a**. Nonboldface lowercase letters are used to refer to the entries of a matrix: the (*k, l*)th entry of the matrix **W** is denoted by *w*_{
kl
} .

## 2 System description

*K*beam spots; a single user link is active at a given time and carrier block at each beam. The satellite in Figure 1 uses a fed reflector antenna array with

*N*feeds to exchange signals with the users. In the absence of on-board beamforming, all these signals will be relayed through a feeder link with the gateway station (GW) on Earth. In the sequel, we will assume a single gateway and neglect the possible impairments caused by the feeder link. As a more conventional option, if on-board beamforming is applied, beamforming weights will be assumed to be fixed and

*K*signals, one per beam, will be synthesized from the combination of the

*N > K*feed signals, with the corresponding reduction in the feeder link required capacity. For the radiation pattern which will be considered, only a small subset of the feeds will be involved in the conformation of each individual beam. As a limit case with practical application in some cases, for

*N*=

*K*each radiating element feeds a different beam, and the two options considered in this article collapse to the same case.

We will refer to the full on-ground processing as *feed processing*, whereas the hybrid architecture with on-board fixed beamforming will be often quoted as *beam processing*.

### 2.1 Return link

*K*be the number of users on Earth and

*N*is the number of on-board feeds. At the feed level, the mathematical model of the return link reads

**y**is an

*N*× 1 vector that contains the symbols received at each feed,

**s**is a

*K*× 1 stack of the symbols transmitted by each user (see Figure 2),

**n**is the

*N*× 1 vector of zero-mean complex white Gaussian noise, such that $\text{E}\left\{\text{n}{\text{n}}^{H}\right\}={N}_{o}\cdot {\text{I}}_{N}$, and

**H**represents the

*N*×

*K*channel matrix. The channel flat-frequency response is parameterized as

where each matrix is described in the following paragraphs.

*Feed radiation pattern and path losses*. **G** is assumed to be an *N* × *K* matrix that accounts for the gains of the feed radiation pattern, the on-board attenuation and the free space losses; recall that the feeder link is considered transparent. Matrix **G** is not deterministic given the random positions of the users within their corresponding beam spots.

*Atmospheric fading*. The attenuation due to atmospheric phenomena--specially the rain--can be significant in bands such as the Ka-band. In this study, the empirical probability density function (pdf) of this attenuation, obtained for the city of Rome, was used [16]. This pdf would define the statistics of the marginal distribution of each link's attenuation, but it reports no information about the possible spatial correlation.

This spatial correlation is of great importance for the design of multi-satellite systems as the one addressed in [17], and its influence in a general random MIMO channel has been covered in [18] but, as far as the authors know, little has been said about rain correlation in multibeam satellite systems when there is only one user per beam. In [19], the concept of correlated area (CA) was introduced, defined as a spatial region in which Earth stations experiment highly correlated rain attenuation; the correlation with stations out of the CA would be considered negligible. The shape and length of a CA depends on many environmental factors, but diameters between 30 and 50 km are quoted to be frequent.

Based on these figures, it seems reasonable to assume that each beamspot belongs to a different CA when the radius of the spots is large enough. This was done, for instance, in [20], where beamspots have a diameter of 250 km and therefore users from different cells are assumed to be uncorrelated in terms of rain fading. In our case, the beam radius is also large (100 beams to cover Europe), and correlation values have been proven to be negligible, as shown in Appendix 1. As a consequence, matrix **L** is assumed to be diagonal and its entries are considered independent.

Further refinements of the channel model were not considered since this simple characterization has been deemed to be useful for the intended comparisons.

**B**the

*K*×

*N*beamforming matrix and define ${\text{H}}_{b}\doteq \text{B}\text{H}$ and ${\text{n}}_{b}\doteq \text{B}\text{n},$ then the received signal becomes

Note that $\mathrm{\Sigma}\doteq \text{E}\left\{{\mathbf{n}}_{b}{\mathbf{n}}_{b}^{H}\right\}={N}_{o}\cdot \mathbf{B}{\mathbf{B}}^{H}.$

### 2.2 Forward link

where **r** (resp. **w**) is a *K* × 1 vector containing the stack of the received signals (resp. noise components) at each user. Similarly as before, we assume that $\text{E}\left\{\mathbf{w}{\mathbf{w}}^{H}\right\}={N}_{o}\cdot {\mathbf{I}}_{K}$. The *N* × 1 vector **x** is the stack of the transmitted signals at all feeds. The forward link channel matrix is simply the transpose of that of the return link: **H**^{
T
} is of size *K* × *N*. Let us stress that this reciprocity is strictly limited to the mathematical formalism of both links. In fact, the reciprocity between the forward and return links does not hold in practice, since they typically involve different frequency bands.

where *P*_{
T
} denotes the total transmit power.

**x**=

**B**

^{ T }

**x**

_{ b }, where

**x**

_{ b }is the stack of the on-ground transmitted signal in the beam space, keeping in mind that a perfectly calibrated and noiseless feeder link is assumed. The signal model (4) becomes

where ${\text{H}}_{b}\doteq \text{B}\text{H}$ was defined above and expresses the principle of beamforming: the effect of the matrix **B** is essentially the linear combination of the radiation pattern of all *N* feeds to generate *K* beam radiation patterns.

## 3 Perfect CSI at the gateway

As a first step, let us assume that the gateway has perfect knowledge of the channel state, when acting either as transmitter or receiver. Throughout this section, we will establish measures of performance and show that, under linear combining, feed processing outperforms beam processing.

### 3.1 Return link

A key objective of this study is to compare the performance of these two approaches. To accomplish this task, we will make use of the mean-squared error (MSE) after combining, which is defined as $\text{E}\left\{|\mathbf{s}-\widehat{\mathbf{s}}{|}^{2}\right\}$.

**Q**

_{ f }its covariance matrix, then it would read

with **P** = **B**^{
H
} (**BB**^{
H
} )^{-1}**B**.

*i*th user is given by 1/

**Q**

_{ ii }-1[11], it makes sense to use the total MSE, given by ${\sum}_{i=1}^{K}{\mathbf{Q}}_{ii}=\text{trace}\left\{\mathbf{Q}\right\}$, as a performance metric. Then, it can be shown that

where *λ*_{
i
} (**H**^{
H
}**H**) denotes the *i* th largest eigenvalue of **H**^{
H
}**H**. We have that (11) is an immediate consequence of the following, stronger result.

**Theorem 1** *Let* **H** *and* **B**^{
H
} *be two tall matrices of the same size with full column rank. Let* **P** *be a projection matrix of* **B**, *that is*, **P** = **B**^{
H
} (**BB**^{
H
} )^{-1}**B**. *Then, it holds that σ*_{
i
} (**H**) ≥ *σ*_{
i
} (**PH**), *with σ*_{
i
} (**A**) *denoting the ith largest singular value of matrix* **A**.

For the proof, see Appendix 2.

A sufficient condition for the traces to be equal is that **PH** = **H**. Since **P** is a projection matrix, this will happen whenever range(**B**^{
H
} ) = range(**H**). However, since **B** is fixed and **H** time-varying, it seems not possible to meet such condition. Recall that, even if the fading is negligible, the users are assumed to be located randomly into their beam spots. Thus, assuming constant **H** would require the feed pattern to be constant over each cell; it would also require users near the border of their cells to experiment almost the same interference as if they were located close to the center. On account of all these facts, ensuring range(**B**^{
H
} ) = range(**H**) is not realistic.

### 3.2 Forward link

**F**the

*N ×*

*K*precoding matrix, and

**s**the

*K*× 1 symbol vector. The

*k*th entry of

**s**is the unit energy constellation symbol destined to the

*k*th user. To comply with the transmit power constraint (5), the precoding matrix

**F**has to satisfy

**F**

_{ b }is the

*K*×

*K*precoding matrix, in terms of which the transmit power constraint (5) becomes

*β*and

*β*

_{ b }has to be chosen such to comply with (15) and (17), respectively. Note that these particular versions of the ZF linear precoders are such that they equalize the signal to noise ratio (SNR) among users. The resulting SNR is given by

for the feed and beam processing, respectively. We prove here that the SNR achieved by the feed processing is always greater than or equal to that associated with the beam processing: SNR _{
f
} ≥ SNR_{
b
}. This is a direct consequence of the following property.

**Theorem 2**

*Let*

**H**

*and*

**B**

^{ H }

*be two tall matrices of the same size with full column rank. Then, the following inequality holds:*

*The equality is reached if* **H** *and* **B** *share the same left and right singular vectors, respectively*.

For the proof, see Appendix 3.

As stated in the theorem, a sufficient condition for the beam processing not to suffer any performance loss with respect to the feed processing is that the matrices **H** and **B** share the same left and right singular vectors, respectively. However, in a similar way as for the return link, this condition is not likely to be met in practice since **B** is a fixed (non channel-adaptive) beamforming matrix while **H** is time-varying (due to the random characteristic of the users positions).

## 4 Non-perfect CSI at the gateway

In a realistic scenario, the gateway does not know the actual values in the channel matrix **H**, but has only an estimate of them. The type and quality of these estimates, commonly based on the use of training sequences, will have an effect on the ultimate performance of the system. In this section, the estimation of the matrix **H** will be introduced. Some degree of uncertainty on the beamforming matrix **B** will also be discussed, while the noise power *N*_{0} will be assumed perfectly known.

### 4.1 Channel estimation by training sequences

For the estimation of the channel in the return link, each user employs a distinct training sequence, known as its unique word (UW). The gateway, upon reception of all the sequences, estimates the values in the channel matrix.

If perfect symbol synchronism can be assumed, then Walsh-Hadamard sequences can be used as UWs, and it is possible to apply the pseudoinverse procedure [13] to estimate the channel matrix. However, due to the nature of the communication in the return link, symbol synchronism cannot be assumed.

In spite of this fact, the pseudoinverse procedure can still be used as long as good timing and frequency recovery is applied. Channel estimation under these circumstances would require the use of pseudorandom sequences with good cross-correlation properties, rather than Walsh-Hadamard sequences. Moreover, under accurate synchronization, pseudorandom sequences have been reported [13] to produce negligible correlation between the estimation errors of the different elements of the matrix.

*L*is the training sequence length and

**W**is a matrix with independent zero-mean unit-variance Gaussian entries. For the case with beamforming on-board, it would read

In the forward link, the symbol synchronicity is not an issue anymore. We consider that the precoder design is now based on a feed channel estimate ${\widehat{\mathbf{H}}}^{T}={\mathbf{H}}^{T}+\mathbf{E}$ (or ${\widehat{\mathbf{H}}}_{b}^{T}={\mathbf{H}}_{b}^{T}+{\mathbf{E}}_{b}$ in the beam space). Each row of ${\widehat{\mathbf{H}}}^{T}$(or ${\widehat{\mathbf{H}}}_{b}^{T}$) is based on a channel estimation which is carried out separately at each user terminal and then reported to the gateway via a return channel (assumed ideal). Note that the reporting of ${\widehat{\mathbf{H}}}^{T}$ to the gateway implies feeding back (*N*-*K* )*K* more channel samples than for reporting ${\widehat{\mathbf{H}}}_{b}^{T}$. We assume *L*-length orthogonal training sequences, such that the entries of **E** (or **E**_{
b
} ) are i.i.d. zero mean complex circular symmetric Gaussian random variables with variance inversely proportional to *L*.

### 4.2 Analog miscalibration

where the entries in *Δ* **B** are of the form *Δ* **B**_{
ij
} = *b*_{
ij
} *r*_{
ij
} and *r*_{
ij
} are independent, real, zero-mean Gaussian random variables. This models a variation, both in the real and imaginary parts of the weights, that is random with given variance, but proportional to the original value.

## 5 Simulation results

*K*= 100 beams covering the whole Europe area. The satellite antenna pattern was provided by ESA, and corresponds to an array fed reflector antenna with

*N*= 155 feeds. Matrix

**B**was also provided by ESA, as a typical beamforming matrix of current systems. It was designed such as to limit the level of interference among users in a conventional system (without interference mitigation technique but with adequate frequency reuse pattern).

Simulation parameters

Simulation parameters | |
---|---|

Atmospheric fading | City of Rome [16] |

UTs location distribution | Uniformly distributed |

Feed and beam gain patterns | Generated from data provided by ESA |

FL simulation parameters | |

UT antenna gain ${G}_{R}^{2}$ | 41.7 dBi |

UT clear sky ${G}_{R}^{2}/T$ ClearSky | 17.68 dB/K |

UT rain delta temperature | 221.83 K |

RL simulation parameters | |

Training sequence length | 128 symbols |

Receiver noise figure | 2.5 dB |

Total receiver noise temperature | 517 K |

The user link has a total available bandwidth of 500 MHz; color schemes with frequency reuse factor equal to 3 and 1 were studied, corresponding to 166 and 500 MHz available bandwidth per beam, respectively. The reference scenario consists in a frequency reuse factor equal to 3, fixed beamforming and no processing at the gateway. Simulation results have been extracted for a number of interference mitigation techniques, both for the forward link and the return link. The purpose of this is to compare the performance of both architectures in as many different situations as possible. For illustrations purposes simulation results cover a large range of transmit powers, although it is important to stress that the most extreme values do not correspond to practical cases.

Results have been averaged for a total of 1,000 channel realizations, with the exception of those showing the average probability of non-availability, which required 10,000 iterations to yield a reasonable confidence interval. Apart from the fading, the randomness of the channel is due to the position of the users, which are assumed to be uniformly distributed within each spot. For each realization, the SINR for each user after interference mitigation is computed, and its throughput is then inferred according to the preliminary specifications or DVB-RCS2 in the RL and DVB-S2 in the FL.

### 5.1 Return link

The user link operates at 30 GHz (Ka-band), and is based on the DVB-RCS2 standard [22]. The baudrate is 4 Msymb/s and the guardbands amount to the 11% of the carrier bandwidth [23]. Apart from the MMSE receiver presented in Section 3.1, the MMSE-SIC receiver has also been simulated, since it is known to be capacity achieving under ideal conditions, and therefore provides an upper bound on the achievable performance.

*t*

_{ f }and

*ρ*

_{ f }be the average throughput and average availability, respectively, of the overall system when full on-ground feed processing is employed, and define

*tb*and

*ρb*as the corresponding counterparts in the beam processing case. The feed combining gain is defined as

*Δ*

**B**={

*b*

_{ ij }

*r*

_{ ij }} and let

*β*be the variance of the random variables

*r*

_{ ij }. Following this model, Figure 7 depicts the evolution of the feed combining gain for different values of

*β*, that is, for different degrees of uncertainty on the analog beamforming weights. Results account for more significant feed combining gains when some degree of uncertainty is present.

### 5.2 Forward link

Again, simulation results will use the average total throughput and availability as performance measures, but this time the total transmit power *P*_{
T
} will be used as a parameter rather than the EIRP since the directivity of the feeds is part of the feed radiation pattern data provided by ESA.

*P*

_{ T }= 30 dBW the regularized channel inversion in the feed space generates a 111% relative throughput increase with respect to the reference scenario, and 17% with respect to the same processing in the beam space. However, a slight decrease in system availability can still be observed with respect to the reference scenario. Figures 10 and 11 consider additionally the UpConst MMSE precoder, and illustrate again the benefit of the full on-ground architecture. It can be seen that the comparison between the regularized channel inversion and UpConst MMSE precoders depends on the value of the transmit power: in terms of availability, the regularized channel inversion outperforms the UpConst MMSE precoder for high values of the transmit power, and viceversa at low values of the transmit power.

*L*= 256. The full on-ground architecture still appears beneficial, especially throughput wise. Moreover, from comparing Figures 11 and 13, it can be noticed that the regularized channel inversion appears to be more robust to imperfect CSI than the precoder based on the uplink-downlink duality.

*L*= 1024, and

*L*= 256. We can observe the robustness of the feed processing with regularized channel inversion. In fact, quite surprisingly, for moderate to high values of the transmit power the relative gain generated by the full on-ground architecture for this precoder increases as the degree of CSI decreases, reaching 24% with

*P*

_{ T }= 30 dBW and

*L*= 256 symbols.

## 6 Conclusions

The results obtained in the previous sections show that feed-level techniques tend to outperform beam processing ones, a fact that has been analytically proved for the case of perfect CSI. In the particular case of the return link, this gain may be rather small when the channel estimation errors are noticeable but the estimation of the on-board analog beamforming is accurate. On the contrary, if the uncertainty on the beamforming is high, then the advantage for using all the information from the feeds seems to be much higher: with the greatest level of uncertainty simulated, the feed combining gain reaches 20% at an EIRP 40 dBW. In what refers to the forward link, the uncertainty about the analog beamforming is not relevant, and the feed processing gain amounts to 24% with *P*_{
T
} = 30 dBW and a realistic level of CSI. It is to be noticed, however, that working with the 155 feed signals would require rather more bandwidth in the feeder link. Therefore, there exists a tradeoff between performance and feeder link requirements, and the choice of the most suitable processing architecture would need to take into account all these considerations.

## Appendix 1: Spatial correlation of rain fading

*A*

_{1}and

*A*

_{2}, is lognormal and presents the following correlation factor [25]

*d*is the horizontal distance between both points on Earth, ${\sigma}_{i}^{2}$ is the variance of the marginal distribution of

*A*

_{ i }and

*c*(

*d*) is the correlation factor between the rain rates. According to [26], this factor may be accurately modeled as

*d*

_{0}is the distance at which

*c*(

*d*) = 1/

*e*(usually called

*decorrelation distance*) and

*s*

_{0}is a shape parameter. Although values for both variables must be set according to detailed environmental data, in [26] it has been shown that, for the

*instantaneous*correlation between two points,

*s*

_{0}≈ 1, yielding

As for *d*_{0}, different values have been proposed in the literature and, in any case, we must note that these will be highly dependent on the geographical area. For instance, in articles devoted to terrestrial communications like [27, 28], values around *d*_{0} = 0.46 km are reported, while [25] uses *d*_{0} = 1.844 km; [26] even reports values up to *d*_{0} = 7 km for the State of Oklahoma, although this value was obtained by averaging over a very long period of time.

*r*with respect to

*d*for different values of

*d*

_{0}. Since, as stated above, we have assumed the same statistics for the marginal attenuations over each path, Equation (27) simplifies to

*σ*

^{2}= 1.58. As we can see, the correlation factor quickly runs close to zero as the distance increases. In Figure 16, the same data is represented in log scale, in order to interpret such small values; the range of distances has also been multiplied by four. We can see that, even for the (far) most pessimistic case, the correlation values are very small, if not close to zero, after

*d*= 60 km.

## Appendix 2: Proof of Theorem 1

with *σ*_{
i
} (**A**) representing the *i* th largest singular value of matrix **A**.

**B**=

**U**Σ

**V**

^{ H }, the singular value decomposition (SVD) of

**B**, then

**V**is a unitary matrix, then it holds that

**A**≐

**VH**. On account of the previous statements, proving (31) is equivalent to proving

**A**is larger or equal to that of the same matrix after setting some rows to zero. To prove this fact, we will make use of the following property: let

**A**be in general any tall matrix such that

**A**= [

**a**

_{1}

**a**

_{2}...

**a**

_{ k }] and define

**A**

_{ r }= [

**a**

_{1}

**a**

_{2}...

**a**

_{ r }], then for all

*r*from 1 to

*k*- 1 it holds that [29]

**A**loses rows and not columns. Recall now that matrix

**A**is of size

*N*×

*N*. If we write

*K*×

*N*, and

and concludes the proof. □

## Appendix 3: Proof of Theorem 2

**H**=

**V**Σ

_{ H }

**U**

^{ H }, the left-hand side in (42) can be simplified as

where ${\sigma}_{k}^{2}\left(\mathbf{H}\right)$ denotes the *k* th largest singular value of **H**.

**B**=

**W**Σ

_{ B }

**Q**

^{ H }, the right-hand side in (42) can easily be worked out as

**Z**

_{1}is of size

*K*×

*K*, submatrices

**Z**

_{2}and ${\mathbf{Z}}_{3}^{H}$ are

*K*× (

*N-K*), and submatrix

**Z**

_{4}is (

*N*-

*K*) × (

*N*-

*K*). We denote the

*k*th largest eigenvalue of ${\mathbf{Z}}_{1}^{H}{\mathbf{Z}}_{1}$ by ${\lambda}_{k}\left({\mathbf{Z}}_{1}^{H}{\mathbf{Z}}_{1}\right)$. Let us first realize that

**Z**is a unitary matrix, we have that

**Z**

^{ H }

**Z**=

**I**

_{ N }, which implies that

Hence, the matrix ${\mathbf{M}}^{H}{\mathbf{Z}}_{3}^{H}{\mathbf{Z}}_{3}\mathbf{M}$ has to be diagonal. Moreover, it has to have positive elements on the diagonal since ${\mathbf{Z}}_{3}^{H}{\mathbf{Z}}_{3}$ is semi positive definite, which proves (47).

where (51) follows from (47), and concludes the proof.

Note that the inequality becomes an equality if **Z** = **I**_{
N
} , that is, **Q**^{
H
}**V** = **I**_{
N
}. In others words, the equality is reached if **H** and **B** share the same left and right singular vectors, respectively. □

## Declarations

### Acknowledgements

The authors would like to acknowledge the work of the University of Bologna and Technical University of Graz teams, in particular, M. Bergmann, W. Gappmair, E. A. Candreva, G.E. Corazza, F. Lombardo, I. Thibault, and A. Vanelli-Coralli. We are also in debt with Riccardo de Gaudenzi for his valuable comments. Research supported by ESA contract 23089/10/NL/CLP "SatNEx Network of Experts", the European Regional Development Fund (ERDF) and the Spanish Government under projects DYNACS (TEC2010-21245-C02-02/TCM) and COMONSENS (CONSOLIDER-INGENIO 2010 CSD2008-00010), and the Galician Regional Government under projects "Consolidation of Research Units" 2009/62 and 2010/85. The work of B. Devillers was also partially supported by the Spanish Government under project TEC2010-17816 (JUNTOS). The work of A. Pérez-Neira had been supported by the Spanish Government under project TEC2008-06327-C03-01 and the Catalan Government under the grant 22009SGR0891. Some preliminary results of this study were presented at Asilomar and Globecom 2011 conferences.

## Authors’ Affiliations

## References

- Brandel D, Watson W, Weinberg A: Nasa's advanced tracking and data relay satellite system for the years 2000 and beyond.
*Proc IEEE*1990, 78: 1141-1151. 10.1109/5.56928View ArticleGoogle Scholar - Caire G, Debbah M, Cottatellucci L, De Gaudenzi R, Rinaldo R, Mueller R, Gallinaro G: Perspectives of adopting interference mitigation techniques in the context of broadband multimedia satellite systems.
*ICSSC 2005, 23rd AIAA International Communications Satellite Systems Conference, Rome, Italy*2005, 1: 1-5.Google Scholar - Cottatellucci L, Debbah M, Gallinaro G, Mueller R, Neri M, Rinaldo R: Interference mitigation techniques for broadband satellite systems.
*Proc 24th AIAA Int Commun Satell Systems Conf, ICSSC, San Diego CA*2006, 1: 1-13.Google Scholar - Angeletti P, Alagha N: Space/ground beamforming techniques for emerging hybrid satellite terrestrial networks.
*27th International Communications Satellite Systems Conference (ICSSC 2009), Edimburgh, UK*2009, 1: 1-6.Google Scholar - Vishwanath S, Jindal N, Goldsmith A: Duality, achievable rates, and sum-rate capacity of gaussian mimo broadcast channels.
*IEEE Trans Inf Theory*2003, 49: 2658-2668. 10.1109/TIT.2003.817421MathSciNetView ArticleMATHGoogle Scholar - Somekh O, Shamai S: Shannon-theoretic approach to a gaussian cellular multiple-access channel with fading.
*IEEE Trans Inf Theory*2000, 46: 1401-1425. 10.1109/18.850679MathSciNetView ArticleMATHGoogle Scholar - Caire G, Shitz S: On the achievable throughput of a multi-antenna gaussian broadcast channel.
*IEEE Trans Inf Theory*2003, 49: 1691-1707. 10.1109/TIT.2003.813523View ArticleMATHGoogle Scholar - Yoo T, Goldsmith A: On the optimality of multiantenna broadcast scheduling using zero-forcing beamforming.
*IEEE J Sel Areas Commun*2006, 24: 528-541.View ArticleGoogle Scholar - Peel C, Hochwald B, Swindlehurst A: A vector-perturbation technique for near-capacity multiantenna multiuser communication. Part I: channel inversion and regularization.
*IEEE Trans Commun*2005, 53: 195-202. 10.1109/TCOMM.2004.840638View ArticleGoogle Scholar - Tse D, Viswanath P:
*Fundamentals of Wireless Communication*. Cambridge University Press, New York, NY, USA; 2005.View ArticleMATHGoogle Scholar - Choi J:
*Optimal Combining and Detection: Statistical Signal Processing for Communications*. 1st edition. Cambridge University Press, New York, NY, USA; 2010.View ArticleMATHGoogle Scholar - Paulraj A, Nabar R, Gore D:
*Introduction to Space-Time Wireless Communications*. 1st edition. Cambridge University Press, New York, NY, USA; 2008.Google Scholar - Gallinaro G, Debbah M, Müller R, Rinaldo R, Vernucci A: Interference mitigation for the reverse-link of interactive satellite networks.
*9th International Workshop on Signal Processing for Space Communications, Noordwijk, The Netherlands*2006, 1: 1-7.Google Scholar - Arnau-Yanez J, Bergmann M, Candreva E, Corazza G, de Gaudenzi R, Devillers B, Gappmair W, Lombardo F, Mosquera C, Perez-Neira A, Thibault I, Vanelli-Coralli A: Hybrid space-ground processing for high-capacity multi-beam satellite systems.
*IEEE Global Telecommunications Conference (GLOBECOM 2011), Houston TX*2011, 1: 1-6.Google Scholar - Devillers B, Pérez-Neira A, Mosquera C: Joint linear precoding and beamforming for the forward link of multi-beam broadband satellite Systems.
*Proc IEEE Global Communications Conference, GLOBECOM, Houston, Texas*2011, 1: 1-6.Google Scholar - Zorba N, Realp M, Perez-Neira A: An improved partial CSIT random beamforming for multibeam satellite systems.
*10th International Workshop on Signal Processing for Space Communications, 2008 SPSC, Rhodes Island, Greece*2008, 1: 1-8.Google Scholar - Liolis KP, Panagopoulos AD, Cottis PG: Multi-satellite MIMO communications at ku-band and above: investigations on spatial multiplexing for capacity improvement and selection diversity for interference mitigation.
*EURASIP J Wirel Commun Netw*2007, 2007: 16-16.View ArticleGoogle Scholar - Ishimaru A, Ritcey J, Jaruwatanadilok S, Kuga Y: A MIMO propagation channel model in a random medium.
*IEEE Trans Antennas and Propagation*2010, 58: 178-186.View ArticleGoogle Scholar - Castro M, Granados G: Cross-layer packet scheduler design of a multibeam broadband satellite system with adaptive coding and modulation.
*IEEE Trans Wirel Commun*2007, 6: 248-258.View ArticleGoogle Scholar - Chatzinotas S, Zheng G, Ottersten B: Joint precoding with flexible power constraints in multibeam satellite systems.
*IEEE Global Telecommunications Conference (GLOBECOM), Houston TX*2011, 1: 1-5.Google Scholar - Kay SM:
*Fundamentals of Statistical Signal Processing: Estimation Theory*. Prentice-Hall, Inc., Upper Saddle River, NJ, USA; 1993.MATHGoogle Scholar - DVB.org, Digital Video Broadcasting (DVB); Second Generation DVB Interactive Satellite System; Part 2: Lower Layers for Satellite standard 2011.Google Scholar
- Brandt H, Lücke O, Boussemart V, Párraga-Niebla C, Flo T, Kissling C, Schweikert R: Resources Management using adaptive fade mitigation techniques in DVB-S2/RCS multi-beam systems.
*Proccedings of the 25th International Communications Satellite Systems Conference (ICSSC 2007), Seoul, South Korea*2007, 1: 1-13.Google Scholar - Viswanath P: DNC Tse, Sum capacity of the vector Gaussian broadcast channel and uplink-downlink duality.
*IEEE Trans Inf Theory*2003, 49: 1912-1921. 10.1109/TIT.2003.814483MathSciNetView ArticleMATHGoogle Scholar - Gremont B, Filip M: Simulation of a high frequency satellite link with a fade countermeasure.
*IEE National Conference on Antennas and Propagation, York, UK*1999, 1: 164-168.View ArticleGoogle Scholar - Ciach GJ, Krajewski WF: Analysis and modeling of spatial correlation structure in small-scale rainfall in central oklahoma.
*Adv Water Resour*2006, 29(10):1450-1463. 10.1016/j.advwatres.2005.11.003View ArticleGoogle Scholar - Arapoglou P-DM, Kartsakli E, Chatzarakis GE, Cottis PG: Cell-site diversity performance of lmds systems operating in heavy rain climatic regions.
*Int J Infrared Millimeter Waves*2004, 25: 1345-1359. doi:10.1023/B:IJIM.0000045144.01224.bfView ArticleGoogle Scholar - Cheffena M, Braten L, Ekman T: On the space-time variations of rain attenuation.
*IEEE Trans Antennas Propag*2009, 57: 1771-1782.View ArticleGoogle Scholar - Golub GH, Van Loan CF:
*Matrix Computations (Johns Hopkins Studies in Mathematical Sciences)*. 3rd edition. The Johns Hopkins University Press, Baltimore, MD; 1996.Google Scholar - Marshall AW, Olkin I:
*Inequalities: Theory of Majorization and its Applications*. Academic Press, New York, NY; 1979.MATHGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.