Performance study of multiuser interference mitigation schemes for hybrid broadband multibeam satellite architectures

Arnau, Jesús; Devillers, Bertrand; Mosquera, Carlos; Pérez-Neira, Ana

doi:10.1186/1687-1499-2012-132

Research
Open access
Published: 05 April 2012

Performance study of multiuser interference mitigation schemes for hybrid broadband multibeam satellite architectures

Jesús Arnau¹,
Bertrand Devillers²,
Carlos Mosquera¹ &
…
Ana Pérez-Neira^2,3

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 132 (2012) Cite this article

6101 Accesses
37 Citations
3 Altmetric
Metrics details

Abstract

As the demand for higher throughput satellites increases, multibeam architectures with smaller beam spots are becoming common place. If the same frequency is strongly reused, the resulting interference when serving simultaneously many users requires some sort of pre or post-cancelation process. This article focuses on precoding and multiuser detection schemes for multibeam satellites, comparing hybrid on-board on-ground beamforming techniques with fully ground-based beamforming. Both techniques rely on the exchange of radiating element signals between the satellite and the corresponding gateway but, in the latter case, the interference mitigation process acts on all the radiating signals instead of the user beams directly, with the corresponding extra degrees of freedom for those cases for which the number of radiating elements is higher than the number of user beams. The analysis carried out in this study has shown that the potential advantage of ground-based beamforming may exceed 20% of the total throughput.

1 Introduction

The use of multiple spot beams in modern broadband satellites has increased during the last few years in an effort to serve higher throughput demands with a scalable cost, for which frequency reuse among users is required [1]. Thus, the same frequency band is shared by different beams to provide an overall higher throughput as long as the multiuser interference can be kept under control. This interference occurs due to the non-null side lobes of the beams radiation patterns, and is related to the degree of reuse of the spectrum. A partial frequency reuse would exclude adjacent beams from using the same portion of spectrum (or color). However, more aggressive frequency reuse strategies [2, 3] can push forward the overall spectral efficiency provided the resulting interference can be efficiently managed. On the other side, the increase in the number of spot beams, in the desired capacity per beam, and in the frequency reuse might convert feeder links in a bottleneck. Higher frequency bands such as Q/V, optical communications or multigateway architectures need to be addressed to accommodate the required capacity.

Conventional beamforming techniques are space-based (or on-board) architectures including analog or digital beamforming networks. On-board volume and calibration requirements are perhaps their main drawbacks. In an attempt to shift the complexity to the ground segment, more recent ground-based beamforming (GBBF) techniques rely on the exchange of radiating element signals between the satellite and the gateway. The forming of beams is realized on-ground with all the flexibility offered by on-ground digital signal processing [4]. Again, at the cost of a higher feeder link bandwidth demand for those cases with more feeds or radiating elements than number of beams, more sophisticated and power consuming techniques can be implemented. Flexibility is preserved, and changes in shape, traffic and pointing direction can be accommodated. Multiuser interference mitigation schemes such as precoding or multiuser detection can be jointly designed with the beamforming process at the gateway station: this joint optimization process is expected to provide some gain in terms of capacity as we will show in this study. The interference mitigation process would act on all the radiating signals instead of the user beams directly, with the corresponding extra degrees of freedom.

More specifically, the forward link from the gateway to the users can be recast as a multiple-input multiple-output (MIMO) broadcast channel [5] from an information theoretic perspective, whereas the return link corresponds to the multiple access channel (MAC) [6]. In the forward link, the sum rate capacity is known to coincide with the rate region on the non-linear dirty paper coding (DPC) [7]. However, linear alternatives are more attractive in terms of complexity. For instance, zero-forcing (ZF) linear precoding performs close to DPC in case many users are available and optimal user scheduling is performed [8]. By relaxing the zero interference constraint at each user, it has been proved that the so-called regularized channel inversion (a MMSE-like precoder) can significantly improve the performance [9].

As for the return link, the maximum sum-rate is known to be achieved via successive interference cancelation with minimum mean-squared error filtering at each stage (MMSE-SIC) [10]. However, much simpler, linear alternatives, such as the zero-forcing (ZF) receiver or the plain MMSE receiver ([11, 12]) are also popular because of their lower computational complexity.

Recently, the performance of the return link of a full on-ground architecture was investigated in [13], featuring an adaptive coding and modulation (ACM) enhanced DVB-RCS physical layer. Results showed an increase in throughput at the cost of some loss in availability when linear MMSE was applied; with MMSE-SIC, a significant improvement in throughput and availability was reported. An equivalent analysis for the forward link was reported in [3], where again a throughput increase was achieved at the expense of a loss in availability.

Within the SatNEx III (Satellite Network of Experts) framework, funded by the European Space Agency, and building on previous results such as those above, the use of multiuser interference mitigation schemes together with GBBF was analyzed to evaluate its potential improvement with respect to more classic onboard beamforming settings. The main achievements of this research, which have been partly presented in [14, 15], are reported in detail in this article. It will be shown that full on-ground architectures tend to outperform hybrid architectures with fixed on-board weights and on-ground multiuser interference mitigation. This conclusion will be analytically supported under perfect channel state information (CSI), and a sufficient condition for the two architectures to be equivalent will be given. Moreover, detailed simulations will provide insight on the behavior of both architectures when imperfect channel knowledge is assumed, and also when the analog beamforming suffers from miscalibration issues.

The rest of the article is structured as follows: Section 2 describes the system model, Section 3 evaluates the performance of some processing techniques when the gateway has perfect CSI, Section 4 describes the modeling of non-perfect CSI, Section 5 reports simulation results and, finally, conclusions are summarized in Section 6.

Notation: Boldface uppercase letters denote matrices and boldface lowercase letters refer to column vectors. We denote by (.) ^H the Hermitian transpose. The N × N identity matrix is denoted by I _N, and diag(a) builds a diagonal matrix from the elements of the vector a. Nonboldface lowercase letters are used to refer to the entries of a matrix: the (k, l)th entry of the matrix W is denoted by w_kl .

2 System description

The object of study consists of a single satellite which gives service to a region covered by K beam spots; a single user link is active at a given time and carrier block at each beam. The satellite in Figure 1 uses a fed reflector antenna array with N feeds to exchange signals with the users. In the absence of on-board beamforming, all these signals will be relayed through a feeder link with the gateway station (GW) on Earth. In the sequel, we will assume a single gateway and neglect the possible impairments caused by the feeder link. As a more conventional option, if on-board beamforming is applied, beamforming weights will be assumed to be fixed and K signals, one per beam, will be synthesized from the combination of the N > K feed signals, with the corresponding reduction in the feeder link required capacity. For the radiation pattern which will be considered, only a small subset of the feeds will be involved in the conformation of each individual beam. As a limit case with practical application in some cases, for N = K each radiating element feeds a different beam, and the two options considered in this article collapse to the same case.

We will refer to the full on-ground processing as feed processing, whereas the hybrid architecture with on-board fixed beamforming will be often quoted as beam processing.

2.1 Return link

Let K be the number of users on Earth and N is the number of on-board feeds. At the feed level, the mathematical model of the return link reads

y = H s + n

(1)

where y is an N × 1 vector that contains the symbols received at each feed, s is a K × 1 stack of the symbols transmitted by each user (see Figure 2), n is the N × 1 vector of zero-mean complex white Gaussian noise, such that $E \{n n^{H}\} = N_{o} \cdot I_{N}$ , and H represents the N × K channel matrix. The channel flat-frequency response is parameterized as

H = G L

(2)

where each matrix is described in the following paragraphs.

Feed radiation pattern and path losses. G is assumed to be an N × K matrix that accounts for the gains of the feed radiation pattern, the on-board attenuation and the free space losses; recall that the feeder link is considered transparent. Matrix G is not deterministic given the random positions of the users within their corresponding beam spots.

Atmospheric fading. The attenuation due to atmospheric phenomena--specially the rain--can be significant in bands such as the Ka-band. In this study, the empirical probability density function (pdf) of this attenuation, obtained for the city of Rome, was used [16]. This pdf would define the statistics of the marginal distribution of each link's attenuation, but it reports no information about the possible spatial correlation.

This spatial correlation is of great importance for the design of multi-satellite systems as the one addressed in [17], and its influence in a general random MIMO channel has been covered in [18] but, as far as the authors know, little has been said about rain correlation in multibeam satellite systems when there is only one user per beam. In [19], the concept of correlated area (CA) was introduced, defined as a spatial region in which Earth stations experiment highly correlated rain attenuation; the correlation with stations out of the CA would be considered negligible. The shape and length of a CA depends on many environmental factors, but diameters between 30 and 50 km are quoted to be frequent.

Based on these figures, it seems reasonable to assume that each beamspot belongs to a different CA when the radius of the spots is large enough. This was done, for instance, in [20], where beamspots have a diameter of 250 km and therefore users from different cells are assumed to be uncorrelated in terms of rain fading. In our case, the beam radius is also large (100 beams to cover Europe), and correlation values have been proven to be negligible, as shown in Appendix 1. As a consequence, matrix L is assumed to be diagonal and its entries are considered independent.

Further refinements of the channel model were not considered since this simple characterization has been deemed to be useful for the intended comparisons.

In those cases for which a fixed beamforming is applied on-board, the above model must be modified accordingly. Let B the K × N beamforming matrix and define $H_{b} ≐ B H$ and $n_{b} ≐ B n,$ then the received signal becomes

y_{b} = H_{b} s + n_{b} .

(3)

Note that $Σ ≐ E \{n_{b} n_{b}^{H}\} = N_{o} \cdot B B^{H} .$

2.2 Forward link

Analogously to the return link, we describe the signal model for the forward link, which reads as

r = H^{T} x + w

(4)

where r (resp. w) is a K × 1 vector containing the stack of the received signals (resp. noise components) at each user. Similarly as before, we assume that $E \{w w^{H}\} = N_{o} \cdot I_{K}$ . The N × 1 vector x is the stack of the transmitted signals at all feeds. The forward link channel matrix is simply the transpose of that of the return link: H^T is of size K × N. Let us stress that this reciprocity is strictly limited to the mathematical formalism of both links. In fact, the reciprocity between the forward and return links does not hold in practice, since they typically involve different frequency bands.

For a fair comparison of all forward link scenarios that will be considered in the sequel, it is critical to define a common transmit power constraint. For this, we assume the following constraint on the average power transmitted at the feed level:

E \{x^{H} x\} \leq P_{T}

(5)

where P_T denotes the total transmit power.

In the case of a fixed on-board beamforming, we have that x = B^Tx_b , where x_b is the stack of the on-ground transmitted signal in the beam space, keeping in mind that a perfectly calibrated and noiseless feeder link is assumed. The signal model (4) becomes

r = H_{b}^{T} x_{b} + w

(6)

where $H_{b} ≐ B H$ was defined above and expresses the principle of beamforming: the effect of the matrix B is essentially the linear combination of the radiation pattern of all N feeds to generate K beam radiation patterns.

3 Perfect CSI at the gateway

As a first step, let us assume that the gateway has perfect knowledge of the channel state, when acting either as transmitter or receiver. Throughout this section, we will establish measures of performance and show that, under linear combining, feed processing outperforms beam processing.

3.1 Return link

For the return link, and assuming on-ground feed processing, the MMSE combiner [21] that yields $\hat{s} = W^{H} y$ is

W^{H} = H^{H} {(N_{0} I_{N} + H H^{H})}^{- 1} = {(N_{0} I_{K} + H^{H} H)}^{- 1} H^{H}

(7)

whereas the processing of the beams would entail $\hat{s} = W_{b}^{H} y_{b}$ with

W_{b}^{H} = (I_{K} + H_{b}^{H} Σ^{- 1} H_{b}) {- 1}^{} H_{b}^{H} Σ^{- 1} .

(8)

A key objective of this study is to compare the performance of these two approaches. To accomplish this task, we will make use of the mean-squared error (MSE) after combining, which is defined as $E \{| s - \hat{s} |^{2}\}$ .

Let us denote Q_f its covariance matrix, then it would read

Q_{f} = R_{x} - R_{x y} R_{y}^{- 1} R_{y x} = {(I_{K} + \frac{1}{N_{0}} H^{H} H)}^{- 1}

(9)

for the case of feed processing. For the case of beam processing, it would be

Q_{b} = {(I_{k} + H_{b}^{H} \frac{{(B B^{H})}^{- 1}}{N_{0}} H_{b})}^{- 1} = {(I_{K} + \frac{1}{N_{0}} H^{H} P H)}^{- 1}

(10)

with P = B^H (BB^H )^-1B.

Since the SINR for the i th user is given by 1/Q_ii -1[11], it makes sense to use the total MSE, given by $\sum_{i = 1}^{K} Q_{i i} = trace \{Q\}$ , as a performance metric. Then, it can be shown that

trace \{Q_{b}\} \geq trace \{Q_{f}\}

(11)

as follows. Let us express both traces as

trace \{Q_{f}\} = \sum_{i = 1}^{K} λ_{i} (Q_{f}) = \sum_{i = 1}^{K} \frac{N_{0}}{N_{0} + λ_{i} (H^{H} H)}

(12)

and

trace \{Q_{b}\} = \sum_{i = 1}^{K} λ_{i} (Q_{b}) = \sum_{i = 1}^{K} \frac{N_{0}}{N_{0} + λ_{i} (H^{H} P H)}

(13)

where λ_i (H^HH) denotes the i th largest eigenvalue of H^HH. We have that (11) is an immediate consequence of the following, stronger result.

Theorem 1 Let H and B^H be two tall matrices of the same size with full column rank. Let P be a projection matrix of B, that is, P = B^H (BB^H )^-1B. Then, it holds that σ_i (H) ≥ σ_i (PH), with σ_i (A) denoting the ith largest singular value of matrix A.

For the proof, see Appendix 2.

A sufficient condition for the traces to be equal is that PH = H. Since P is a projection matrix, this will happen whenever range(B^H ) = range(H). However, since B is fixed and H time-varying, it seems not possible to meet such condition. Recall that, even if the fading is negligible, the users are assumed to be located randomly into their beam spots. Thus, assuming constant H would require the feed pattern to be constant over each cell; it would also require users near the border of their cells to experiment almost the same interference as if they were located close to the center. On account of all these facts, ensuring range(B^H ) = range(H) is not realistic.

3.2 Forward link

As mentioned above, interference mitigation techniques in the forward link take the form of precoding at the gateway. In this article, we focus exclusively on linear precoding. In the case of on-ground feed processing, linear precoding is expressed as

x = F s

(14)

with F the N × K precoding matrix, and s the K × 1 symbol vector. The k th entry of s is the unit energy constellation symbol destined to the k th user. To comply with the transmit power constraint (5), the precoding matrix F has to satisfy

trace \{F F^{H}\} \leq P_{T} .

(15)

In the case of adaptive linear precoding in the beam space, we write

x_{b} = F_{b} s

(16)

where F_b is the K × K precoding matrix, in terms of which the transmit power constraint (5) becomes

trace \{B^{T} F_{b} F_{b}^{H} B^{*}\} \leq P_{T} .

(17)

In this section, we assume a zero-forcing (ZF) precoder. The zero-forcing (ZF) criterion targets the complete cancelation of the inter-user interference, by precoding with the pseudoinverse of the channel matrix. The corresponding expressions are

F = \sqrt{β} H^{*} {(H^{T} H^{*})}^{- 1}

(18)

F_{b} = \sqrt{β_{b}} H_{b}^{*} {(H_{b}^{T} H_{b}^{*})}^{- 1}

(19)

for the precoding in the feed space and beam space, respectively. The value of the constants β and β_b has to be chosen such to comply with (15) and (17), respectively. Note that these particular versions of the ZF linear precoders are such that they equalize the signal to noise ratio (SNR) among users. The resulting SNR is given by

SN R_{f} = \frac{P_{T} / N_{0}}{trace \{{(H^{T} H^{*})}^{- 1}\}} = \frac{P_{T} / N_{0}}{trace \{{(H^{H} H)}^{- 1}\}}

(20)

SN R_{b} = \frac{P_{T} / N_{0}}{trace \{B^{T} {(B^{*} H^{*} H^{T} B^{T})}^{- 1} B^{*}\}} = \frac{P_{T} / N_{0}}{trace \{B^{H} {(B H H^{H} B^{H})}^{- 1} B\}}

(21)

for the feed and beam processing, respectively. We prove here that the SNR achieved by the feed processing is always greater than or equal to that associated with the beam processing: SNR _f ≥ SNR_b. This is a direct consequence of the following property.

Theorem 2 Let H and B ^H be two tall matrices of the same size with full column rank. Then, the following inequality holds:

trace \{{(H^{H} H)}^{- 1}\} \leq trace \{B^{H} {(B H H^{H} B^{H})}^{- 1} B\} .

(22)

The equality is reached if H and B share the same left and right singular vectors, respectively.

For the proof, see Appendix 3.

As stated in the theorem, a sufficient condition for the beam processing not to suffer any performance loss with respect to the feed processing is that the matrices H and B share the same left and right singular vectors, respectively. However, in a similar way as for the return link, this condition is not likely to be met in practice since B is a fixed (non channel-adaptive) beamforming matrix while H is time-varying (due to the random characteristic of the users positions).

4 Non-perfect CSI at the gateway

In a realistic scenario, the gateway does not know the actual values in the channel matrix H, but has only an estimate of them. The type and quality of these estimates, commonly based on the use of training sequences, will have an effect on the ultimate performance of the system. In this section, the estimation of the matrix H will be introduced. Some degree of uncertainty on the beamforming matrix B will also be discussed, while the noise power N₀ will be assumed perfectly known.

4.1 Channel estimation by training sequences

For the estimation of the channel in the return link, each user employs a distinct training sequence, known as its unique word (UW). The gateway, upon reception of all the sequences, estimates the values in the channel matrix.

If perfect symbol synchronism can be assumed, then Walsh-Hadamard sequences can be used as UWs, and it is possible to apply the pseudoinverse procedure [13] to estimate the channel matrix. However, due to the nature of the communication in the return link, symbol synchronism cannot be assumed.

In spite of this fact, the pseudoinverse procedure can still be used as long as good timing and frequency recovery is applied. Channel estimation under these circumstances would require the use of pseudorandom sequences with good cross-correlation properties, rather than Walsh-Hadamard sequences. Moreover, under accurate synchronization, pseudorandom sequences have been reported [13] to produce negligible correlation between the estimation errors of the different elements of the matrix.

On account of the previous statements, and in the absence of a fixed beamforming, channel estimation would be modeled as

\hat{H} = H + E = H + \frac{N_{0}}{L} W

(23)

where L is the training sequence length and W is a matrix with independent zero-mean unit-variance Gaussian entries. For the case with beamforming on-board, it would read

{\hat{H}}_{b} = H_{b} + E_{b} = H + \frac{N_{0}}{L} B W .

(24)

In the forward link, the symbol synchronicity is not an issue anymore. We consider that the precoder design is now based on a feed channel estimate ${\hat{H}}^{T} = H^{T} + E$ (or ${\hat{H}}_{b}^{T} = H_{b}^{T} + E_{b}$ in the beam space). Each row of ${\hat{H}}^{T}$ (or ${\hat{H}}_{b}^{T}$ ) is based on a channel estimation which is carried out separately at each user terminal and then reported to the gateway via a return channel (assumed ideal). Note that the reporting of ${\hat{H}}^{T}$ to the gateway implies feeding back (N-K )K more channel samples than for reporting ${\hat{H}}_{b}^{T}$ . We assume L-length orthogonal training sequences, such that the entries of E (or E_b ) are i.i.d. zero mean complex circular symmetric Gaussian random variables with variance inversely proportional to L.

4.2 Analog miscalibration

For the case of beam processing in the return link, the gateway must be aware of the exact beamforming weights that are set on board, since they will be necessary to compute the noise covariance matrix in the MMSE combiner (8). Even though these are subject of calibration, it is very likely that their actual values will experiment some minor changes through time, mainly because of the non-ideal nature of the analog circuitry. As a consequence, the information at the gateway can be seen as an estimate of the actual beamformer. The following mathematical model is proposed:

\hat{B} = B + Δ B

(25)

where the entries in Δ B are of the form Δ B_ij = b_ij r_ij and r_ij are independent, real, zero-mean Gaussian random variables. This models a variation, both in the real and imaginary parts of the weights, that is random with given variance, but proportional to the original value.

5 Simulation results

In order to further compare the performance of the proposed precoding and multiuser detection architectures, Monte Carlo simulations have been carried out according to the scenario described in Table 1. This scenario features K = 100 beams covering the whole Europe area. The satellite antenna pattern was provided by ESA, and corresponds to an array fed reflector antenna with N = 155 feeds. Matrix B was also provided by ESA, as a typical beamforming matrix of current systems. It was designed such as to limit the level of interference among users in a conventional system (without interference mitigation technique but with adequate frequency reuse pattern).

Table 1 Simulation parameters

Full size table

The user link has a total available bandwidth of 500 MHz; color schemes with frequency reuse factor equal to 3 and 1 were studied, corresponding to 166 and 500 MHz available bandwidth per beam, respectively. The reference scenario consists in a frequency reuse factor equal to 3, fixed beamforming and no processing at the gateway. Simulation results have been extracted for a number of interference mitigation techniques, both for the forward link and the return link. The purpose of this is to compare the performance of both architectures in as many different situations as possible. For illustrations purposes simulation results cover a large range of transmit powers, although it is important to stress that the most extreme values do not correspond to practical cases.

Results have been averaged for a total of 1,000 channel realizations, with the exception of those showing the average probability of non-availability, which required 10,000 iterations to yield a reasonable confidence interval. Apart from the fading, the randomness of the channel is due to the position of the users, which are assumed to be uniformly distributed within each spot. For each realization, the SINR for each user after interference mitigation is computed, and its throughput is then inferred according to the preliminary specifications or DVB-RCS2 in the RL and DVB-S2 in the FL.

5.1 Return link

The user link operates at 30 GHz (Ka-band), and is based on the DVB-RCS2 standard [22]. The baudrate is 4 Msymb/s and the guardbands amount to the 11% of the carrier bandwidth [23]. Apart from the MMSE receiver presented in Section 3.1, the MMSE-SIC receiver has also been simulated, since it is known to be capacity achieving under ideal conditions, and therefore provides an upper bound on the achievable performance.

Figures 3 and 4 depict the evolution of the total average throughput as a function of the terminals EIRP. Results have been averaged only for those realizations in which the link was active. To this extent, Figure 5 shows the average probability of non-availability for the different MUD techniques. It can be observed that a considerable increase in throughput is experimented thanks to using multiuser detection, although at the cost of some loss in availability. In fact, only SIC detection manages to reduce the outage probability with respect to the benchmark scenario.

Moreover, full on-ground processing reports higher throughput figures both with perfect and non-perfect CSI. To further investigate the potential advantage of this strategy, Figure 6 represents the performance gain obtained in this case with respect to the hybrid architecture, using training sequences of length 128 symbols. Let t_f and ρ_f be the average throughput and average availability, respectively, of the overall system when full on-ground feed processing is employed, and define tb and ρb as the corresponding counterparts in the beam processing case. The feed combining gain is defined as

γ ≐ \frac{t_{f} ρ_{f}}{t_{b} ρ_{b}} .

(26)

It can be seen that, despite the existence of channel estimation errors, there are always non-negligible improvements when choosing a full on-ground architecture. Moreover, results on the hybrid architecture assumed so far perfect knowledge of the fixed beamforming matrix. Recall now the error model (25) for the analog calibration, given by the error matrix Δ B ={b_ijr_ij } and let β be the variance of the random variables r_ij . Following this model, Figure 7 depicts the evolution of the feed combining gain for different values of β, that is, for different degrees of uncertainty on the analog beamforming weights. Results account for more significant feed combining gains when some degree of uncertainty is present.

5.2 Forward link

The user link in the downstream is assumed to operate at 20 GHz (K-band), and is based on the DVB-S2 standard. Besides the ZF precoder presented in Section 3.2, the following advanced precoders are considered and will be compared in the simulations:

The regularized channel inversion precoding [9].
The so-called UpConst MMSE precoder which is based on the uplink-downlink duality [24]. This pre-coder solution was proposed in [3], where it is said to achieve a good compromise between throughput and availability.

Again, simulation results will use the average total throughput and availability as performance measures, but this time the total transmit power P_T will be used as a parameter rather than the EIRP since the directivity of the feeds is part of the feed radiation pattern data provided by ESA.

Let us assume first that the channel is perfectly known at the gateway. Figures 8 and 9 first compare the regularized channel inversion and ZF linear precoder. The regularized channel inversion significantly outperforms the more naive ZF precoder. Most importantly, the benefit of the full on-ground architecture (i.e., feed processing) is apparent both in terms of throughput and availability. For instance, at P_T = 30 dBW the regularized channel inversion in the feed space generates a 111% relative throughput increase with respect to the reference scenario, and 17% with respect to the same processing in the beam space. However, a slight decrease in system availability can still be observed with respect to the reference scenario. Figures 10 and 11 consider additionally the UpConst MMSE precoder, and illustrate again the benefit of the full on-ground architecture. It can be seen that the comparison between the regularized channel inversion and UpConst MMSE precoders depends on the value of the transmit power: in terms of availability, the regularized channel inversion outperforms the UpConst MMSE precoder for high values of the transmit power, and viceversa at low values of the transmit power.

We now disregard the assumption of perfect CSI at the gateway, and analyze the robustness of the different schemes to channel estimation errors. Figures 12 and 13 depict the achievable throughput and system availability for a training sequence length L = 256. The full on-ground architecture still appears beneficial, especially throughput wise. Moreover, from comparing Figures 11 and 13, it can be noticed that the regularized channel inversion appears to be more robust to imperfect CSI than the precoder based on the uplink-downlink duality.

In Figure 14, we compare the feed combining gain (26) associated with the regularized channel inversion for different degree of CSI: perfect CSI, imperfect CSI with L = 1024, and L = 256. We can observe the robustness of the feed processing with regularized channel inversion. In fact, quite surprisingly, for moderate to high values of the transmit power the relative gain generated by the full on-ground architecture for this precoder increases as the degree of CSI decreases, reaching 24% with P_T = 30 dBW and L = 256 symbols.

6 Conclusions

The results obtained in the previous sections show that feed-level techniques tend to outperform beam processing ones, a fact that has been analytically proved for the case of perfect CSI. In the particular case of the return link, this gain may be rather small when the channel estimation errors are noticeable but the estimation of the on-board analog beamforming is accurate. On the contrary, if the uncertainty on the beamforming is high, then the advantage for using all the information from the feeds seems to be much higher: with the greatest level of uncertainty simulated, the feed combining gain reaches 20% at an EIRP 40 dBW. In what refers to the forward link, the uncertainty about the analog beamforming is not relevant, and the feed processing gain amounts to 24% with P_T = 30 dBW and a realistic level of CSI. It is to be noticed, however, that working with the 155 feed signals would require rather more bandwidth in the feeder link. Therefore, there exists a tradeoff between performance and feeder link requirements, and the choice of the most suitable processing architecture would need to take into account all these considerations.

Appendix 1: Spatial correlation of rain fading

Next, we present a simple model for the rain fading correlation between two links to assess its potential impact on the interference mitigation schemes. To start with, we will follow the well-known model which specifies that the joint distribution of rain attenuation on two slant paths, which we will call A₁ and A₂, is lognormal and presents the following correlation factor [25]

r = \frac{e^{c (d) σ_{1} σ_{2}} - 1}{\sqrt{(e^{σ_{1}^{2}} - 1) (e^{σ_{2}^{2}} - 1)}}

(27)

where d is the horizontal distance between both points on Earth, $σ_{i}^{2}$ is the variance of the marginal distribution of A_i and c(d) is the correlation factor between the rain rates. According to [26], this factor may be accurately modeled as

c (d) = e^{- (\frac{d}{d_{0}}) s_{0}}

(28)

where d₀ is the distance at which c(d) = 1/e (usually called decorrelation distance) and s₀ is a shape parameter. Although values for both variables must be set according to detailed environmental data, in [26] it has been shown that, for the instantaneous correlation between two points, s₀ ≈ 1, yielding

c (d) = e^{- \frac{d}{d_{0}}} .

(29)

As for d₀, different values have been proposed in the literature and, in any case, we must note that these will be highly dependent on the geographical area. For instance, in articles devoted to terrestrial communications like [27, 28], values around d₀ = 0.46 km are reported, while [25] uses d₀ = 1.844 km; [26] even reports values up to d₀ = 7 km for the State of Oklahoma, although this value was obtained by averaging over a very long period of time.

We will now perform some simulations in order to check the evolution of r with respect to d for different values of d₀. Since, as stated above, we have assumed the same statistics for the marginal attenuations over each path, Equation (27) simplifies to

r = \frac{e^{c (d) σ^{2}} - 1}{e^{σ^{2}} - 1} .

(30)

The results obtained are shown in Figure 15, where σ² = 1.58. As we can see, the correlation factor quickly runs close to zero as the distance increases. In Figure 16, the same data is represented in log scale, in order to interpret such small values; the range of distances has also been multiplied by four. We can see that, even for the (far) most pessimistic case, the correlation values are very small, if not close to zero, after d = 60 km.

Appendix 2: Proof of Theorem 1

The goal is to prove

σ_{i} (H) \geq σ_{i} (P H)

(31)

with σ_i (A) representing the i th largest singular value of matrix A.

Consider now B = U ΣV^H , the singular value decomposition (SVD) of B, then

P = B^{H} {(B B^{H})}^{- 1} B = V Σ^{H} {(Σ Σ^{H})}^{- 1} Σ V^{H} = V (\begin{matrix} I_{K} & 0 \\ 0 & 0 \end{matrix}) V^{H} = V Φ V^{H} .

(32)

Since V is a unitary matrix, then it holds that

σ_{i} (V H) = σ_{i} (H)

(33)

and

σ_{i} (V^{H} Φ V H) = σ_{i} (Φ V H) .

(34)

Let us define now A ≐ VH. On account of the previous statements, proving (31) is equivalent to proving

σ_{i} (A) \geq σ_{i} (Φ A)

(35)

that is, each singular value of a matrix A is larger or equal to that of the same matrix after setting some rows to zero. To prove this fact, we will make use of the following property: let A be in general any tall matrix such that A = [a₁a₂ ... a_k ] and define A_r = [a₁a₂ ... a_r ], then for all r from 1 to k - 1 it holds that [29]

σ_{1} (A_{r + 1}) \geq σ_{1} (A_{r}) \geq σ_{2} (A_{r + 1}) \geq \dots \geq σ_{r} (A_{r + 1}) \geq σ_{r} (A_{r}) \geq σ_{r + 1} (A_{r + 1}) .

(36)

This interlacing property will prove useful for our purpose even though matrix A loses rows and not columns. Recall now that matrix A is of size N × N. If we write

A = (\begin{matrix} A_{1} \\ A_{2} \end{matrix})

(37)

with both block matrices of size K × N, and

Φ A = (\begin{matrix} A_{1} \\ 0 \end{matrix})

(38)

then it is possible to define

\tilde{A} = (\begin{matrix} A_{1}^{H} & A_{2}^{H} \\ 0 & 0 \end{matrix})

(39)

whose singular values take the form $σ (\tilde{A}) = [σ_{1} (A) σ_{2} (A) \dots σ_{K} (A) 00 \dots 0]$ . If we now remove columns from the right, the interlacing property tells us that

σ_{i} ((\begin{matrix} A_{1} \\ 0 \end{matrix})) \leq σ_{i} (\tilde{A})

(40)

which implies that

σ_{i} (Φ A) \leq σ_{i} (A)

(41)

and concludes the proof. □

Appendix 3: Proof of Theorem 2

In this appendix, we prove that

trace \{{(H^{H} H)}^{- 1}\} \leq trace \{B^{H} {(B H H^{H} B^{H})}^{- 1} B\} .

(42)

With the following singular value decomposition H = V Σ _HU^H , the left-hand side in (42) can be simplified as

trace \{{(H^{H} H)}^{- 1}\} = \sum_{k = 1}^{K} \frac{1}{σ_{k}^{2} (H)}

(43)

where $σ_{k}^{2} (H)$ denotes the k th largest singular value of H.

Similarly, denoting B = W Σ_B Q^H , the right-hand side in (42) can easily be worked out as

trace \{B^{H} {(B H H^{H} B^{H})}^{- 1} B\} = trace \{{(Z_{1}^{H} Z_{1})}^{- 1} {(Σ_{H}^{H} Σ_{H})}^{- 1}\}

(44)

with the following definitions

Z ≐ Q^{H} V

(45)

≐ (\begin{matrix} Z_{1} & Z_{2} \\ Z_{3} & Z_{4} \end{matrix})

(46)

where the submatrix Z₁ is of size K × K , submatrices Z₂ and $Z_{3}^{H}$ are K × (N-K ), and submatrix Z₄ is (N-K ) × (N - K ). We denote the k th largest eigenvalue of $Z_{1}^{H} Z_{1}$ by $λ_{k} (Z_{1}^{H} Z_{1})$ . Let us first realize that

\begin{matrix} λ_{k} (Z_{1}^{H} Z_{1}) \leq 1, & k = 1, \dots, K . \end{matrix}

(47)

Indeed, since Z is a unitary matrix, we have that Z^HZ = I_N, which implies that

Z_{1}^{H} Z_{1} + Z_{3}^{H} Z_{3} = I_{K} .

(48)

With the following eigenvalue decomposition $Z_{1}^{H} Z_{1} = M diag (λ_{1} (Z_{1}^{H} Z_{1}), \dots, λ_{K} (Z_{1}^{H} Z_{1})) M^{H}$ , we can rewrite (48) as

diag (λ_{1} (Z_{1}^{H} Z_{1}), \dots, λ_{K} (Z_{1}^{H} Z_{1})) = I_{K} - M^{H} Z_{3}^{H} Z_{3} M .

(49)

Hence, the matrix $M^{H} Z_{3}^{H} Z_{3} M$ has to be diagonal. Moreover, it has to have positive elements on the diagonal since $Z_{3}^{H} Z_{3}$ is semi positive definite, which proves (47).

Finally, by Theorem H.1.h in [30], we have that

trace \{{(Z_{1}^{H} Z_{1})}^{- 1} {(Σ_{H}^{H} \sum_{H})}^{- 1}\} \geq \sum_{k = 1}^{K} \frac{1}{λ_{K - k + 1} (Z_{1}^{H} Z_{1})} \frac{1}{σ_{k}^{2} (H)}

(50)

\geq \sum_{k = 1}^{K} \frac{1}{σ_{k}^{2} (H)}

(51)

where (51) follows from (47), and concludes the proof.

Note that the inequality becomes an equality if Z = I_N , that is, Q^HV = I_N. In others words, the equality is reached if H and B share the same left and right singular vectors, respectively. □

References

Brandel D, Watson W, Weinberg A: Nasa's advanced tracking and data relay satellite system for the years 2000 and beyond. Proc IEEE 1990, 78: 1141-1151. 10.1109/5.56928
Article Google Scholar
Caire G, Debbah M, Cottatellucci L, De Gaudenzi R, Rinaldo R, Mueller R, Gallinaro G: Perspectives of adopting interference mitigation techniques in the context of broadband multimedia satellite systems. ICSSC 2005, 23rd AIAA International Communications Satellite Systems Conference, Rome, Italy 2005, 1: 1-5.
Google Scholar
Cottatellucci L, Debbah M, Gallinaro G, Mueller R, Neri M, Rinaldo R: Interference mitigation techniques for broadband satellite systems. Proc 24th AIAA Int Commun Satell Systems Conf, ICSSC, San Diego CA 2006, 1: 1-13.
Google Scholar
Angeletti P, Alagha N: Space/ground beamforming techniques for emerging hybrid satellite terrestrial networks. 27th International Communications Satellite Systems Conference (ICSSC 2009), Edimburgh, UK 2009, 1: 1-6.
Google Scholar
Vishwanath S, Jindal N, Goldsmith A: Duality, achievable rates, and sum-rate capacity of gaussian mimo broadcast channels. IEEE Trans Inf Theory 2003, 49: 2658-2668. 10.1109/TIT.2003.817421
Article MathSciNet MATH Google Scholar
Somekh O, Shamai S: Shannon-theoretic approach to a gaussian cellular multiple-access channel with fading. IEEE Trans Inf Theory 2000, 46: 1401-1425. 10.1109/18.850679
Article MathSciNet MATH Google Scholar
Caire G, Shitz S: On the achievable throughput of a multi-antenna gaussian broadcast channel. IEEE Trans Inf Theory 2003, 49: 1691-1707. 10.1109/TIT.2003.813523
Article MATH Google Scholar
Yoo T, Goldsmith A: On the optimality of multiantenna broadcast scheduling using zero-forcing beamforming. IEEE J Sel Areas Commun 2006, 24: 528-541.
Article Google Scholar
Peel C, Hochwald B, Swindlehurst A: A vector-perturbation technique for near-capacity multiantenna multiuser communication. Part I: channel inversion and regularization. IEEE Trans Commun 2005, 53: 195-202. 10.1109/TCOMM.2004.840638
Article Google Scholar
Tse D, Viswanath P: Fundamentals of Wireless Communication. Cambridge University Press, New York, NY, USA; 2005.
Book MATH Google Scholar
Choi J: Optimal Combining and Detection: Statistical Signal Processing for Communications. 1st edition. Cambridge University Press, New York, NY, USA; 2010.
Book MATH Google Scholar
Paulraj A, Nabar R, Gore D: Introduction to Space-Time Wireless Communications. 1st edition. Cambridge University Press, New York, NY, USA; 2008.
Google Scholar
Gallinaro G, Debbah M, Müller R, Rinaldo R, Vernucci A: Interference mitigation for the reverse-link of interactive satellite networks. 9th International Workshop on Signal Processing for Space Communications, Noordwijk, The Netherlands 2006, 1: 1-7.
Google Scholar
Arnau-Yanez J, Bergmann M, Candreva E, Corazza G, de Gaudenzi R, Devillers B, Gappmair W, Lombardo F, Mosquera C, Perez-Neira A, Thibault I, Vanelli-Coralli A: Hybrid space-ground processing for high-capacity multi-beam satellite systems. IEEE Global Telecommunications Conference (GLOBECOM 2011), Houston TX 2011, 1: 1-6.
Google Scholar
Devillers B, Pérez-Neira A, Mosquera C: Joint linear precoding and beamforming for the forward link of multi-beam broadband satellite Systems. Proc IEEE Global Communications Conference, GLOBECOM, Houston, Texas 2011, 1: 1-6.
Google Scholar
Zorba N, Realp M, Perez-Neira A: An improved partial CSIT random beamforming for multibeam satellite systems. 10th International Workshop on Signal Processing for Space Communications, 2008 SPSC, Rhodes Island, Greece 2008, 1: 1-8.
Google Scholar
Liolis KP, Panagopoulos AD, Cottis PG: Multi-satellite MIMO communications at ku-band and above: investigations on spatial multiplexing for capacity improvement and selection diversity for interference mitigation. EURASIP J Wirel Commun Netw 2007, 2007: 16-16.
Article Google Scholar
Ishimaru A, Ritcey J, Jaruwatanadilok S, Kuga Y: A MIMO propagation channel model in a random medium. IEEE Trans Antennas and Propagation 2010, 58: 178-186.
Article Google Scholar
Castro M, Granados G: Cross-layer packet scheduler design of a multibeam broadband satellite system with adaptive coding and modulation. IEEE Trans Wirel Commun 2007, 6: 248-258.
Article Google Scholar
Chatzinotas S, Zheng G, Ottersten B: Joint precoding with flexible power constraints in multibeam satellite systems. IEEE Global Telecommunications Conference (GLOBECOM), Houston TX 2011, 1: 1-5.
Google Scholar
Kay SM: Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall, Inc., Upper Saddle River, NJ, USA; 1993.
MATH Google Scholar
DVB.org, Digital Video Broadcasting (DVB); Second Generation DVB Interactive Satellite System; Part 2: Lower Layers for Satellite standard 2011.
Brandt H, Lücke O, Boussemart V, Párraga-Niebla C, Flo T, Kissling C, Schweikert R: Resources Management using adaptive fade mitigation techniques in DVB-S2/RCS multi-beam systems. Proccedings of the 25th International Communications Satellite Systems Conference (ICSSC 2007), Seoul, South Korea 2007, 1: 1-13.
Google Scholar
Viswanath P: DNC Tse, Sum capacity of the vector Gaussian broadcast channel and uplink-downlink duality. IEEE Trans Inf Theory 2003, 49: 1912-1921. 10.1109/TIT.2003.814483
Article MathSciNet MATH Google Scholar
Gremont B, Filip M: Simulation of a high frequency satellite link with a fade countermeasure. IEE National Conference on Antennas and Propagation, York, UK 1999, 1: 164-168.
Article Google Scholar
Ciach GJ, Krajewski WF: Analysis and modeling of spatial correlation structure in small-scale rainfall in central oklahoma. Adv Water Resour 2006, 29(10):1450-1463. 10.1016/j.advwatres.2005.11.003
Article Google Scholar
Arapoglou P-DM, Kartsakli E, Chatzarakis GE, Cottis PG: Cell-site diversity performance of lmds systems operating in heavy rain climatic regions. Int J Infrared Millimeter Waves 2004, 25: 1345-1359. doi:10.1023/B:IJIM.0000045144.01224.bf
Article Google Scholar
Cheffena M, Braten L, Ekman T: On the space-time variations of rain attenuation. IEEE Trans Antennas Propag 2009, 57: 1771-1782.
Article Google Scholar
Golub GH, Van Loan CF: Matrix Computations (Johns Hopkins Studies in Mathematical Sciences). 3rd edition. The Johns Hopkins University Press, Baltimore, MD; 1996.
Google Scholar
Marshall AW, Olkin I: Inequalities: Theory of Majorization and its Applications. Academic Press, New York, NY; 1979.
MATH Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the work of the University of Bologna and Technical University of Graz teams, in particular, M. Bergmann, W. Gappmair, E. A. Candreva, G.E. Corazza, F. Lombardo, I. Thibault, and A. Vanelli-Coralli. We are also in debt with Riccardo de Gaudenzi for his valuable comments. Research supported by ESA contract 23089/10/NL/CLP "SatNEx Network of Experts", the European Regional Development Fund (ERDF) and the Spanish Government under projects DYNACS (TEC2010-21245-C02-02/TCM) and COMONSENS (CONSOLIDER-INGENIO 2010 CSD2008-00010), and the Galician Regional Government under projects "Consolidation of Research Units" 2009/62 and 2010/85. The work of B. Devillers was also partially supported by the Spanish Government under project TEC2010-17816 (JUNTOS). The work of A. Pérez-Neira had been supported by the Spanish Government under project TEC2008-06327-C03-01 and the Catalan Government under the grant 22009SGR0891. Some preliminary results of this study were presented at Asilomar and Globecom 2011 conferences.

Author information

Authors and Affiliations

Signal Theory and Communications Department, University of Vigo, 36310, Vigo, Spain
Jesús Arnau & Carlos Mosquera
Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), 08860, Castelldefels, Barcelona, Spain
Bertrand Devillers & Ana Pérez-Neira
Department of Signal Processing and Communications, Universitat Politècnica de Catalunya, 08034, Barcelona, Spain
Ana Pérez-Neira

Authors

Jesús Arnau
View author publications
You can also search for this author in PubMed Google Scholar
Bertrand Devillers
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Mosquera
View author publications
You can also search for this author in PubMed Google Scholar
Ana Pérez-Neira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jesús Arnau.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Arnau, J., Devillers, B., Mosquera, C. et al. Performance study of multiuser interference mitigation schemes for hybrid broadband multibeam satellite architectures. J Wireless Com Network 2012, 132 (2012). https://doi.org/10.1186/1687-1499-2012-132

Download citation

Received: 15 November 2011
Accepted: 05 April 2012
Published: 05 April 2012
DOI: https://doi.org/10.1186/1687-1499-2012-132

Performance study of multiuser interference mitigation schemes for hybrid broadband multibeam satellite architectures

Abstract

1 Introduction

2 System description

2.1 Return link

2.2 Forward link

3 Perfect CSI at the gateway

3.1 Return link

3.2 Forward link

4 Non-perfect CSI at the gateway

4.1 Channel estimation by training sequences

4.2 Analog miscalibration

5 Simulation results

5.1 Return link

5.2 Forward link

6 Conclusions

Appendix 1: Spatial correlation of rain fading

Appendix 2: Proof of Theorem 1

Appendix 3: Proof of Theorem 2

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords