Achievable rates optimization for broadcast channels using finite size constellations under transmission constraints

Mheich, Zeina; Alberge, Florence; Duhamel, Pierre

doi:10.1186/1687-1499-2013-254

Research
Open access
Published: 31 October 2013

Achievable rates optimization for broadcast channels using finite size constellations under transmission constraints

Zeina Mheich^1,2,3,
Florence Alberge^1,2,3 &
Pierre Duhamel^1,2,3

EURASIP Journal on Wireless Communications and Networking volume 2013, Article number: 254 (2013) Cite this article

1369 Accesses
7 Citations
Metrics details

Abstract

In this paper, maximal achievable rate regions are derived for power-constrained AWGN broadcast channel involving finite constellations and two users. The achievable rate region is studied for various transmission strategies including superposition coding and compared to standard schemes such as time sharing. The maximal achievable rates are obtained by optimizing over both the joint distribution of probability and over the constellation symbol positions. A numerical solution is proposed for solving this non-convex optimization problem. Then, we consider several variations of the same problem by introducing various constraints on the optimization variables. The aim is to evaluate efficiency vs. complexity tradeoffs of several transmission strategies, some of which (the simplest ones) can be found in actual standards. The improvement for each scheme is evaluated in terms of SNR savings for target achievable rates or/and percentage of gain in achievable rates for one user compared to a reference scheme. As an application, two scenarios of coverage areas and user alphabets are considered. This study allows to evaluate with practical criteria the performance improvement brought by more advanced schemes.

1 Introduction

During the past few decades, information networks have witnessed tremendous and rapid advances, based on the important growth in the adoption of new wireless technologies, applications and services, first from cellular networks and more recently for computer networks (WLANs). Consequently, wireless networks are exposed to capacity and coverage problems, and the focus is now shifting towards capturing some of the aspects of realistic networks by studying natural network models such as models with broadcasting.

In 1972, achievable rate region is obtained by Cover in [1] for Gaussian broadcast channels with two outputs and generalized by Bergmans to broadcast channels with any number of outputs [2]. Roughly a year later, the optimality of the sets of achievable rates was established by Bergmans [3] and Gallager [4]. Superposition coding is a possible solution to achieve good rate regions in which information intended for high-noise receivers and information intended for low-noise receivers are superimposed and transmitted simultaneously on the same radio resource. The low-noise receivers can always decode messages intended for the high-noise receivers. Thus, they effectively cancel out the interference due to the signal intended for the high-noise receivers, and then decode their own message. The high-noise receivers decode their messages by treating the low-noise receivers message as noise. Superposition coding appears in several contexts in information theory and is closely related to multilevel coding and unequal error protection [5, 6]. Cover showed [1] that the superposition coding reaches the theoretical limit of the capacity region for two user Gaussian broadcast channel using an infinite Gaussian input alphabet for each user. A treatment of the case of multiple transmitter/receivers for the band-limited additive white Gaussian noise channel is given by Bergmans and Cover in [7], where it is proved that superposition coding can achieve higher-rate region than orthogonal schemes such as frequency-division multiple access (FDMA) or time division multiple access (TDMA). However, in actual transmission systems, the channel input is constrained to a finite size alphabet with equal probability symbols. A well-known practical implementation of superposition coding is hierarchical modulation, also called layered modulation, which uses constellations with non-uniformly spaced signal points creating different levels of error protection. Hierarchical modulation is used to mitigate the cliff effect in digital television broadcast and is included in various standards, such as Digital Video Broadcast for Terrestrial Television (DVB-T) [8], DVB to Handhelds (DVB-H), and DVB Satellite services to Handhelds (DVB-SH) [9] standard proposal for mobile digital TV transmission. A study about the performance of hierarchical modulation and a comparison with time sharing strategy in terms of achievable rates can be found in [10].

The restriction imposed by practical systems in using finite signaling constellation and equiprobable symbols reduces the achievable rates and leads to a gap with the capacity region achieved with Gaussian input alphabets for AWGN broadcast channel. This gap can be reduced using a technique called constellation shaping. In fact, most results for constellation shaping with finite signal constellations consider only point-to-point communication systems [11]. Then, the concept of constellation shaping has been adapted to most modern coding and modulation techniques as for example turbo coding and BICM schemes [12–19]. For broadcast channels, the achievable rate region for two-user AWGN broadcast channels with finite input alphabets is derived in [20] when superposition of modulated signal is used as transmission strategy. In their work, the authors assume a uniform distribution over the finite input set. To our knowledge, no study is available about the maximization of the achievable rate region for two-user AWGN broadcast channels with finite size constellations by optimizing over both the joint probability distribution and constellation symbol positions for a broadcast transmission strategy. This general framework encompasses hierarchical modulations as a special case. In this paper, maximal achievable rate regions are derived for power-constrained AWGN broadcast channel of two users with M-pulse amplitude modulation (M-PAM) constellations of M points using various transmission strategies. A numerical solution is proposed for solving this non-concave optimization problem. In a typical broadcast system, there is a trade off between achievable rates and coverage areas. Therefore, we are interested in determining the transmission strategy which provides the best achievable rates or the maximal SNR gain for a given coverage scenario. The compromise between the simplicity of implementation and expected gains is also evaluated.

The organization of the paper is as follows. Section 2 recalls some information theory results on broadcast channels and degraded broadcast channels. In section 3, various transmission strategies for broadcast systems are described. Section 4 gives a formulation of the problem in terms of optimization for the various transmission strategies under consideration. Then, computational aspects are discussed. An iterative algorithm is proposed for the computation of maximal achievable rate regions using superposition coding (general case) and M-PAM constellation or in the particular case of superposition modulation. The proposed algorithm can handle an optimization with respect to the joint distribution of probability or with respect to the positions of constellation symbols. Both variables can also be considered jointly. Obviously, the best results are obtained for the most general case. Our target is to (1) evaluate the loss experienced using simple schemes, (2) identify situations in which complex schemes (non-standard) lead to significant improvements. As an application, we consider, in section 5, several scenarios of coverage areas and user alphabets, and we give conclusions about the transmission strategies which can provide the best trade off between efficiency and complexity of implementation.

2 AWGN broadcast channels

A two-receiver (users) broadcast channel (BC) consists of an input alphabet , two output alphabets $Y_{1}$ (user 1), $Y_{2}$ (user 2), and a conditional pdf $P_{Y_{1} Y_{2} | X}$ on $Y_{1} \times Y_{2}$ . Let X, Y₁, and Y₂ be random variables representing the input and outputs of the BC. Figure 1 depicts the two users BC with two independent messages W₁ and W₂. The encoder generates a codeword xⁿ(w₁,w₂) of length n based on these two messages. Each user receives, respectively, $y_{1}^{n}$ and $y_{2}^{n}$ . A BC is said to be physically degraded if $P_{Y_{1} Y_{2} | X} (y_{1}, y_{2} | x) = P_{Y_{1} | X} (y_{1} | x) \cdot P_{Y_{2} | Y_{1}} (y_{2} | y_{1})$ (i.e., X→Y₁→Y₂ form a Markov chain). A BC is said to be stochastically degraded or degraded if there exists a random variable $\tilde{Y_{1}}$ which has the same conditional pdf as Y₁ given X, such that $X \to \tilde{Y_{1}} \to Y_{2}$ forms a Markov chain. We are interested in degraded BC because its capacity region is known, while it is not available for the general case.

In our system model, W₁ denotes the private message intended for receiver 1 only, and W₂ is a common message for both receivers. A typical example of this situation is digital TV broadcasting to two different groups of receivers, classified according to their channel conditions, where the basic signal (common signal) should be available to all receivers. The higher quality is realized by adding the basic signal with an incremental signal (private signal for receivers of good channel conditions) which carries TV signal with a high data rate, such as HDTV.

Let R₁ and R₂ be the rates at which the transmitter is sending W₁ and W₂, respectively. Thus, user 1 achieves R₁+R₂, while user 2 achieves R₂. The capacity region of the degraded broadcast channel X→Y₁→Y₂ in Figure 1 is the convex hull of the closure of rate pairs (R₁+R₂,R₂) satisfying

\begin{align} R_{1} & \leq I (X; Y_{1} | U) \end{align}

(1)

\begin{align} R_{2} & \leq I (U; Y_{2}) \end{align}

(2)

for some joint distribution $P_{{UXY}_{1} Y_{2}} = P_{UX} \cdot P_{Y_{1} | X} \cdot P_{Y_{2} | X}$ on ${U \times X \times Y_{1} \times Y_{2}}$ [21]. $P_{Y_{1} | X}$ and $P_{Y_{2} | X}$ are conditional pdfs that depend on the channel model. P_UX is the joint probability distribution of U and X, where the auxiliary random variable U has cardinality bounded by $| U | \leq min {| X |, | Y_{1} |, | Y_{2} |}$ . The capacity region is achieved using superposition coding, where U serves as the center of a cloud of codewords that can be distinguished by both receivers. Since the capacity region of a BC depends only on the conditional marginals, the capacity region of the stochastically degraded BC is equal to that of the corresponding physically degraded channel. Cover [1] showed that in the case of binary symmetric BC and AWGN BC, superposition coding expands the rate region beyond that achievable with time sharing.

Now, consider the Gaussian broadcast channel with two users. Without loss of generality, assume that Y₁ is less noisy than Y₂. It can easily be shown that scalar Gaussian broadcast channels are equivalent to a degraded channel,

\begin{align} Y_{1} & = X + Z_{1} \end{align}

(3)

\begin{align} Y_{2} & = X + Z_{2} = Y_{1} + Z_{2}^{'}, \end{align}

(4)

where $Z_{1} \sim N (0, σ_{1}^{2}), Z_{2} \sim N (0, σ_{2}^{2}), Z_{2}^{'} \sim N (0, σ_{2}^{2} - σ_{1}^{2})$ , and $Z_{1}, Z_{2}^{'}$ are independent. Thus, Gaussian BC is stochastically degraded. We assume an average power constraint on the transmitted power P defined as $E [X^{2}] \leq P$ . The received signal to noise ratio for each user is ${SNR}_{i} = \frac{P}{σ_{i}^{2}}$ , where SNR₁>SNR₂, and $σ_{i}^{2}$ is the variance of the noise Z_i. The capacity region of the AWGN-BC is the set of rate pairs (R₁+R₂,R₂), such that

\begin{align} R_{1} & \leq C (α \cdot {SNR}_{1}) \end{align}

(5)

\begin{align} R_{2} & \leq C (\frac{(1 - α) \cdot {SNR}_{2}}{α \cdot {SNR}_{2} + 1}) \end{align}

(6)

for all α∈ [ 0,1], where $C (x) = \frac{1}{2} \cdot \underset{2}{log} (1 + x)$ . The theoretical limit of two-user AWGN BC is achieved using signal superposition [1].

3 Broadcast transmission strategies

In this section, various transmission strategies for broadcast systems are described. The strategies are presented in ascending order of implementation complexity. Specifically, by moving from one strategy to another, we release some constraints on the system implementation to reach finally the most complex strategy that can be used to broadcast information for users. Obviously, since the simple schemes can be understood as adding constraints to the most general case, they are less efficient in terms of attainable rates.

3.1 Time sharing

Time sharing (TS) has been widely used in broadcast systems as broadcast transmission strategy. In time sharing scheme, a percentage of time is used to send one message, and the rest of the time is used to send another message. Thus, it is practical to implement because the rate pairs can be achieved by strategies used for point-to-point channel and sharing the time between messages. As in previous works on broadcasting, this situation serves as a reference for the more advanced schemes.

In this work, a time sharing scheme with standard constellation M-PAM (Figure 2) is considered when symbols are used with equal probability. A standard M-PAM constellation is defined as a constellation with M real symbols belonging to $X = {M - 1 - 2 \cdot (i - 1), for i = 1, \dots, M}$ . During the time slot dedicated to send a message, only one data stream is sent using the entire set of constellation points. In classical implementations of time sharing, the conventional M-PAM symbols are equally spaced and used with equal probability.

3.2 Hierarchical Modulation (HM)

In two-layer hierarchical modulation, constellation symbols are used to transmit two data streams simultaneously for two users [22, 23]. Constellation symbols are usually chosen with the same probability but may be non-equally spaced. These symbols can be considered as the sum of two lower-order modulations, one for each user. The modulation with higher power is used for the 'bad’ channel, the one with smallest power for the 'good’ channel. Hence, the encoding using hierarchical modulation can be separable for the two streams which is more practical.

This is explained here using 4-PAM as an example. Figure 3 shows the constellation diagram of a hierarchical 4-PAM with parameter ℓ=ℓ₁/ℓ₂ used to determine the spacing between the groups of constellation points (clouds). ℓ is the ratio of the spacing between the groups to the spacing between individual points within a group. Standard values of ℓ are 1, 2, and 4. When ℓ increases, with a fixed total transmission power P, the two points from both sides of origin form a cloud. The location of a point within its cloud is regarded as the information for the 'good’ user. The other information, i.e., the number of the cloud in which the point is located is the information for the 'bad’ user. In this way, two separate data streams can be made available for transmission. Formally, we are still dealing with 4-PAM, but in the hierarchical interpretation, it is viewed as the combination of two BPSK modulations which have different robustness to noise. In other words, the service coverage areas differ in size for both users. The better-protected data stream is referred to as the high-priority (HP) stream which is mapped in Figure 3 to the most significant bit. The other one is referred to as the low-priority (LP) stream (Figure 3) and mapped in Figure 3 to the least significant bit. Receivers with good reception conditions can receive both streams, while those with poorer reception conditions may only receive the high priority stream considering the LP stream as noise. This corresponds to a specific labeling of the modulation.

3.3 Superposition modulation

In superposition modulation (SM) [24], the M constellation points are used such that the labeling is separable, i.e., M=M₁M₂, and that the M points are obtained by adding (in ) two rv’s X₁ and X₂ of cardinality M₁ and M₂, respectively ( $M_{1}, M_{2} \in ℕ ∖ {0, 1}$ ). Thus, this scheme is with an enlarged set of feasible labelings than in the previous case [25, 26]. This leads also to U≡X₂ for superposition modulation because user 2 can distinguish only U.

This work studies several cases of superposition modulation. First, when the constellation symbols for each user are used with equal probability. This case will be denoted as ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ . This is a practical case since the encoding of the messages is separable, and the symbols are used with equal probability as in real transmission systems. Then, the constraint of using equiprobable symbols is released and the symbols of user constellations can be dependent and used with non-equal probability (P_UX non-uniform). Thus, the encoding here is done jointly for the two messages. This strategy will be denoted ${SM}_{\bar{X}, P_{UX}, P_{X}}$ when the symbols take the values of a standard M-PAM and ${SM}_{X, P_{UX}, P_{X}}$ , otherwise. In the latter case, the symbol positions can take arbitrary values and will be considered as variables to be optimized. The definition of superposition modulation can be generalized using more general form for P_UX than the uniform case. In superposition modulation, $2^{{nR}_{2}}$ independent codewords uⁿ=x⁽²⁾ⁿ(w₂) of length n are generated according to P_U; for each of these codewords, $2^{{nR}_{1}}$ satellite codewords vⁿ=x⁽¹⁾ⁿ(w₁) are generated and added to form codewords xⁿ(w₁,w₂)=uⁿ+vⁿ according to P_X|U. Thus, the fine information vⁿ is superimposed on the coarse information uⁿ.

Note that the capacity region of Gaussian broadcast channel is achieved using this coding scheme and successive cancellation decoding, where U (≡X₂) and V (≡X₁) are independent random variables following normal distributions. However, we do not assume here that U and V are independent. Consequently, for superposition modulation, P_UX takes a specific expression. As an example, consider an 8-PAM modulation. In that case, the transmitted signal at time k is the sum of the two users signals and is given by $x_{k} = x_{k}^{(1)} + x_{k}^{(2)}$ , where $x_{k}^{(1)} \in X_{1}$ and $x_{k}^{(2)} \in X_{2}$ with M₁·M₂=8. Two configurations are possible either M₂=4 ( $X_{1}$ is a BPSK, and $X_{2}$ is a 4-PAM) or M₂=2 ( $X_{1}$ is a 4-PAM, and $X_{2}$ is a BPSK). In both cases, P_UX is a sparse matrix of size M₂×M with expression

\begin{matrix} P_{UX} = [\begin{matrix} p_{00} & p_{01} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & p_{12} & p_{13} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & p_{24} & p_{25} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & p_{36} & p_{37} \end{matrix}] if M_{1} = 2, M_{2} = 4 \end{matrix}

(7)

\begin{matrix} P_{UX} = [\begin{matrix} p_{00} & p_{01} & p_{02} & p_{03} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & p_{14} & p_{15} & p_{16} & p_{17} \end{matrix}] if M_{1} = 4, M_{2} = 2, \end{matrix}

(8)

where P_UX[ i,j]=p_i-1,j-1= Pr{U=u_i-1,X=x_j-1}. In both cases, the number of elements to be computed is 8.

Note also that P_UX and (of cardinality M) determine the labeling of the input signal constellation for a fixed labeling for $X_{1}$ and $X_{2}$ [25, 26]. Thus, the information can be distinguished using the labeling. Consider for example a label $l_{k}^{u}$ of ${log}_{2} (| X_{2} |)$ binary labels for u_k and $l_{j}^{v}$ of ${log}_{2} (| X_{1} |)$ binary labels for v_j with $k \in {0, .., | X_{2} | - 1}$ and $j \in {0, .., | X_{1} | - 1}$ . Obviously, the M symbols $x_{i}, i \in {0, .., | X | - 1}$ carry log₂(M) binary labels which are the concatenations of the labels of u_k and v_j such as x_i=u_k+v_j.

Part of this work on superposition modulation was presented in [25–27], where the achievable rate regions for ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ and ${SM}_{X, P_{UX}, P_{X}}$ strategies are analyzed using a 4-PAM constellation in [25, 26] and for {4,8,16}-PAM constellations in [27]. In this work, the achievable rates are also derived for ${SM}_{\bar{X}, P_{UX}, P_{X}}$ using {4,8,16}-PAM constellations.

3.4 Superposition coding

Superposition coding (SC) is one of the basics of coding schemes in network information theory. This idea was first introduced by Cover in an information theoretic study of broadcast channels [1]. In superposition coding, the joint distribution of probability P_UX can take a more general form than in the case of superposition modulation. In this case, the labeling cannot distinguish between the common information and the private information for user 1, a fact which increases the decoder complexity. Indeed, since the auxiliary random variable U has cardinality bounded by $| U | \leq min {| X |, | Y_{1} |, | Y_{2} |}$ , we use the name general superposition coding or superposition coding simply to describe the case, where $| U | = min {| X |, | Y_{1} |, | Y_{2} |}$ . For superposition coding and with M-PAM modulation, P_UX is an M×M matrix with elements p_i,j.

The basics of superposition coding are briefly recalled below; a detailed description is given in [28]. In this scheme, $2^{{nR}_{2}}$ sequences $u^{n} (w_{2}), w_{2} \in [1, 2^{{nR}_{2}}]$ each i.i.d., are generated randomly and independently to represent the coarse message, each according to $\prod_{i = 1}^{n} p_{U} (u_{i})$ . For each auxiliary sequence, uⁿ(w₂) randomly, conditionally, and independently generates $2^{{nR}_{1}}$ sequences xⁿ(w₁,w₂) and $w_{1} \in [1, 2^{{nR}_{1}}]$ , each according to $\prod_{i = 1}^{n} p_{X | U} (x_{i} | u_{i} (w_{2}))$ to represent the fine message w₁. Thus, in superposition coding, the auxiliary random variable U serves as a cloud center for the information, distinguishable by both receivers. In this case, the decoding of information by users is based on large block joint typicality. This comes in contrast with the simpler cases where the message for user 2 was carried by the center of modulation clouds which imply a possible scalar detection.

The achievable rates for superposition coding will be studied for various strategies corresponding to different constraints on P_UX and/or . An exhaustive list of all the strategies under consideration is given in Table 1, where redundant configurations are omitted.

Table 1 Strategies under consideration

Full size table

4 Achievable rate regions

For a two-user Gaussian BC, the theoretical limit of the capacity region is achieved using Gaussian input alphabet for each user. However, practical implementation constraints impose the use of finite input alphabets, and the symbols are usually chosen with equal probability. These restrictions contribute to increase the gap between the capacity region achieved with infinite Gaussian inputs and the throughput obtained in practical situations. In this section, we are interested in computing the achievable rate region of power-constrained AWGN BC when the transmitted signal is modulated using an M-PAM constellation, under the various situations described above. Since the last case (superposition coding) encompasses all previous ones as special cases, the corresponding optimization problems can be solved with the same strategy, which is detailed in this section.

4.1 Problem formulation

Consider a two-user memoryless AWGN broadcast channel (SNR₁>SNR₂) with signal power constraint P. The channel input belongs to a finite set $X = {x_{0}, \dots, x_{M - 1}} \subset ℝ$ represented by an M-PAM constellation. Assume a symmetric input signal constellation with respect to the origin. Since has cardinality bounded by $| U | \leq min {| X |, | Y_{1} |, | Y_{2} |}$ , and the output alphabet cardinality for an AWGN channel is infinite, we have $| U | \leq | X |$ . Thus, $| U | \leq M$ .

To determine the maximal achievable rate region using superposition coding, consider the case $| U | = M$ . For superposition modulation, we take into account the specificity on P_UX given in section 3.3. We also consider within the same framework the problem of maximizing the achievable rates under additional constraints on optimization variables (P_UX and ): standard M-PAM symbols values, uniform distribution for P_UX, uniform distribution for P_X. The problem of maximizing the achievable rates under a specific situation is solved subject to a combination of constraints according to Table 1. We recall that in this work, message w₂ is a common message to both receivers, and w₁ is a private message to user 1. Thus, the achievable rate region (R₂ vs. R₁+R₂) can be obtained by solving the weighted sum rate (θ·R₁+(1-θ)·R₂) maximization for θ∈ [ 0,0.5]. Indeed, for θ=0, we maximize the common information rate R₂, and when θ=0.5, we maximize the rate achieved by user 1 (R₁+R₂). Using (1) and (2), the optimization problem under consideration is:

\begin{array}{l} max_{P_{UX}, X} & θ \cdot I (X; Y_{1} | U) + (1 - θ) \cdot I (U; Y_{2}) \\ s.t. & p_{ij} \geq 0 \forall i, j \\ \sum_{i, j} p_{ij} \cdot x_{j}^{2} \leq P \end{array}

(9)

and subject to the constraint on the joint pdf P_UX or on given in Table 1 for each strategy, where p_ij= Pr{U=u_i,X=x_i},j∈{0,..,M-1}, and $i \in {0, .., | U | - 1}$ . The two mutual information I(X;Y₁|U) and I(U;Y₂) can be written as follows:

\begin{align} I (X; Y_{1} | U) = & \sum_{i, j} \int_{- \infty}^{+ \infty} p_{ij} P_{Y_{1} | X} (y_{1} | x_{j}) \\ \times log \frac{(\sum_{j^{'}} p_{i j^{'}}) P_{Y_{1} | X} (y_{1} | x_{j})}{\sum_{j^{'}} p_{i j^{'}} P_{Y_{1} | X} (y_{1} | x_{j^{'}})} {dy}_{1} \end{align}

(10)

\begin{align} I (U; Y_{2}) = & \sum_{i} \int_{- \infty}^{+ \infty} (\sum_{j} p_{ij} P_{Y_{2} | X} (y_{2} | x_{j})) \\ \times log \frac{\sum_{j^{'}} p_{i j^{'}} P_{Y_{2} | X} (y_{2} | x_{j^{'}})}{(\sum_{j^{'}} p_{i j^{'}}) (\sum_{i^{'}, j^{'}} p_{i^{'} j^{'}} P_{Y_{2} | X} (y_{2} | x_{j^{'}}))} {dy}_{2}, \end{align}

(11)

where all logarithms are taken base 2. The AWGN channel for each user is characterized by the conditional pdf

P_{Y_{i} | X} (y | x) = \frac{1}{\sqrt{2 π σ_{i}^{2}}} . e^{- \frac{{(y - x)}^{2}}{2 σ_{i}^{2}}} i \in {1, 2} .

(12)

When θ=0 or θ=1 and for $| U | = M$ (which are referred in this paper as point-to-point (PtP) channel case), the individual achievable rates R₂ and R₁ are maximized respectively. The problem (9) is equivalent to

\begin{array}{l} max_{P_{X}, X} & I (X; Y_{k}) \\ s.t. & p_{i} \geq 0 \forall i \\ \sum_{i} p_{i} = 1 \\ \sum_{i} p_{i} \cdot x_{i}^{2} \leq P, \end{array}

(13)

where p_i= Pr{X=x_i},i∈{0,..,M-1} is the input probability distribution, and k∈{1,2}. When θ=0 or 1, problem (13) is solved for k=2 and 1, respectively, with I(X;Y_k) given by

\begin{align} I (X; Y_{k}) = & \int_{- \infty}^{+ \infty} \sum_{j} p_{j} P_{Y_{k} | X} (y_{k} | x_{j}) \\ \times log \frac{P_{Y_{k} | X} (y_{k} | x_{j})}{\sum_{j^{'}} p_{j^{'}} P_{Y_{k} | X} (y_{k} | x_{j^{'}})} {dy}_{k} . \end{align}

(14)

For the time sharing scheme using standard constellation, the achievable rate pair (R₁+R₂,R₂) is such that [1]

\{\begin{array}{l} R_{1} = α \bar{R_{1}} \\ R_{2} = (1 - α) \bar{R_{2}} \end{array},

(15)

where $\bar{R_{1}}$ and $\bar{R_{2}}$ are achievable rates for PtP channel using standard M-PAM constellation at SNR₁ and SNR₂, respectively. Varying α from 0 to 1 yields achievable rate region.

Problem (9) is not convex; therefore, direct numerical optimization is inefficient. Clearly, an exhaustive search is not feasible as the complexity would be exponential in the total number of variables. An iterative method for solving (9) is proposed in the next section.

4.2 Numerical solution

Consider a regularized version of (9) as

\begin{array}{l} L (P_{UX}, x_{0}, .., x_{M - 1}, s) = & θ \cdot I (X; Y_{1} | U) + (1 - θ) \cdot I (U; Y_{2}) \\ + s \cdot (P - \sum_{i = 0}^{| U | - 1} \sum_{j = 0}^{M - 1} p_{ij} \cdot x_{j}^{2}), \end{array}

(16)

where s is a regularization parameter. For a given value of s, the optimization problem in (16) is solved (for the most general case) with respect to P_UX and to $X = (x_{0}, x_{1}, \dots, x_{M - 1})$ alternately until convergence:

P_{UX}^{(ℓ)} = arg max_{P_{UX} \in C} L (P_{UX}, x_{0}^{(ℓ - 1)}, .., x_{M - 1}^{(ℓ - 1)}, s)

(17)

X^{(ℓ)} = arg max_{X} L (P_{UX}^{(ℓ)}, x_{0}, .., x_{M - 1}, s),

(18)

where ℓ is the iteration index, and denotes the set of constraints on P_UX and can be defined either as $C = {P_{UX} : p_{ij} \geq 0, \sum_{i, j} p_{i, j} = 1}$ or as $C = {P_{UX} : p_{ij} \geq 0, \sum_{i} p_{i, j} = \frac{1}{M}}$ (equiprobable symbols). The optimization problem in (17) with constraint set $C = {P_{UX} : p_{ij} \geq 0, \sum_{i, j} p_{i, j} = 1}$ can be handled by a modified 'Blahut-Arimoto’-type algorithm [29]. Indeed, in order to take into account the regularization, we can show that the Blahut-Arimoto-type algorithm proposed in [30] for broadcast channels should be modified by replacing Equation (19) of Lemma 3 in [30] by $q^{*} (u, x) = \frac{β [Q, \tilde{Q}, \bar{Q}] (u, x) \cdot e^{- s \frac{x^{2}}{1 - θ}}}{\sum_{u^{'}, x^{'}} β [Q, \tilde{Q}, \bar{Q}] (u^{'}, x^{'}) \cdot e^{- s \frac{x^{' 2}}{1 - θ}}}$ instead of $q^{*} (u, x) = \frac{β [Q, \tilde{Q}, \bar{Q}] (u, x)}{\sum_{u^{'}, x^{'}} β [Q, \tilde{Q}, \bar{Q}] (u^{'}, x^{'})}$ , where $β [Q, \tilde{Q}, \bar{Q}] (u, x)$ is defined in Equation (19) of [30]. When there is an additional constraint on constellation symbols to be equiprobable, i.e., $C = {P_{UX} : p_{ij} \geq 0, \sum_{i, j} p_{i, j} = 1 and \sum_{i} p_{i, j} = \frac{1}{M}}$ , the Blahut-Arimoto-type algorithm in [30] should also be modified to take into account the additional constraint. In this case, Equation (19) of Lemma 3 in reference [30] should be replaced by $q^{*} (u, x) = \frac{1}{| X |} \cdot \frac{β [Q, \tilde{Q}, \bar{Q}] (u, x)}{\sum_{u} β [Q, \tilde{Q}, \bar{Q}] (u, x)}$ , which does not depend on s, where $β [Q, \tilde{Q}, \bar{Q}] (u, x)$ is defined in Equation (19) in this reference.

Now consider (18). The function $L (P_{UX}^{(ℓ)}, x_{0}, .., x_{M - 1}, s)$ is not a concave function for all $X \in ℝ^{M}$ . However, we observed in our experiments that $L (P_{UX}^{(ℓ)}, x_{0}, .., x_{M - 1}, s)$ is a concave function if $X \in D$ , where $D = {X \in ℝ^{M} : | x_{i} - x_{j} | > d \forall i, j \in {0, .., M - 1} and i \neq j}$ , and d depends on the size of the constellation and on the SNR. Since we are interested in finding non-degenerated constellation, we restrict the optimization process to . Then, a simplex method is used to perform the optimization with initial value in .

The alternative maximization method can at least increase the objective function in each iteration. In the experiments, we have observed that this method converges at least to a local maximum (denoted $p_{i, j}^{*} (s), x_{j}^{*} (s), 0 \leq j \leq M - 1, 0 \leq i \leq | U | - 1$ ). We discuss now the choice of s. Since we do not know a priori which value of s may correspond to the satisfaction of the equality power constraint, we propose to use an iterative process as follows:

\begin{matrix} s^{(k + 1)} = {[s^{(k)} - γ \cdot (P - \sum_{i = 0}^{| U | - 1} \sum_{j = 0}^{M - 1} p_{ij}^{*} (s^{(k)}) \cdot {(x_{j}^{*} (s^{(k)}))}^{2})]}^{+}, \end{matrix}

(19)

where [.]⁺ is defined as [.]⁺= max(.,0). The value of s is increased or decreased with the sign of $P - \sum_{i = 0}^{| U | - 1} \sum_{j = 0}^{M - 1} p_{ij}^{*} (s^{(k)}) \cdot {(x_{j}^{*} (s^{(k)}))}^{2}$ . The process stops when the power constraint is fulfilled. The proposed algorithm is summarized in Table 2. Obviously, when constellation symbols are constrained to the values of a standard constellation, (P 2) which is defined in Table 2 will not be used. Similarly, when P_UX is uniform, (P 1) is not used. An alternative interpretation of this algorithm is to recognize that L(P_UX,x₀,..,x_M-1,s) is the Lagrangian dual of problem 9. Equations (17) and (18) are an iterative method for solving

f (s) = max_{P_{UX}, x_{0}, .., x_{M - 1}} L (P_{UX}, x_{0}, .., x_{M - 1}, s) .

(20)

Table 2 Numerical solution for solving (9)

Full size table

The dual optimization problem mins.t. s≥0f(s) is solved in (19) with a gradient-type algorithm. Since f(s) is convex [31], a gradient search method is guaranteed to converge to a global optimum.

5 Result analysis

5.1 Point to point channel

We present in this section the results of maximizing achievable rates for PtP case using M-PAM constellations with M=4,8,16 and for different values of SNR. To evaluate the contribution of constellation shaping, we compare, for a fixed SNR, the maximal achievable rate calculated by the algorithm proposed in the previous section to the 'standard constellation’ rate, whose symbols are used with equal probability, at the same SNR in terms of SNR saving (called SNR shaping gain). The SNR shaping gain depicted in (Figure 4) is the gain obtained with a fully optimized constellation ( $P_{X}$ and ) compared to the standard M-PAM constellation and when symbols are used with the same probability. To avoid the complexity of constructing nearly optimal input distribution codes, another method for doing constellation shaping is to optimize only the position of symbols in the constellation. Each signal point is assumed to be chosen with the same probability; however, the position of each point in the constellation is optimized. The corresponding shaping gain is given in (Figure 5). We observe the following: the shaping gain depends on the SNR and on the size of the constellation. The maximum gain is obtained for mid-range SNR. The distribution of probability $P_{X}$ (not reported) is very similar to the sampling of a Gaussian distribution. With the half-optimized constellation ( only), a significant degradation is observed for mid-range SNR compared to that for the fully optimized constellation. Hence, we can conclude that symbol pdf optimization is useless at low and high SNR, whereas the fully optimized constellation is efficient for mid-range SNR, in which case the gain increases with the size of the constellation.

5.2 Broadcast channel

Current broadcast systems are using two practical transmission schemes for sending information to users: orthogonal schemes in which the time and/or frequency is split between the users, and superposition modulation schemes where the constellation for each user is fixed. In this section, a comparison is provided between these standard schemes and various (more complex) transmission strategies such as superposition coding. The effect of constellation shaping is evaluated by analyzing the achievable rate region curves obtained for an M-PAM constellation (M=4,8,16) and for several pairs (SNR₁,SNR₂). The following schemes are considered:

Time sharing using standard M-PAM (TS).
SM - 3 possible configurations (see Table 1)
SC - 4 possible configurations (see Table 1)

In the following, we denote by the 'case 1’ of superposition modulation when M₁=2,M₂=4 and when M₁=2,M₂=8. 'Case 2’ is when M₁=4,M₂=2 and when M₁=4,M₂=4. 'Case 3’ refers to the case when M₁=8, M₂=2.

Achievable rate region curves are provided in Figures 6, 7, 8, 9, 10, and 11 for M=4,8,16. For each value of M, the display of the results is limited to two different pairs of SNR. In complement with the achievable rate region curves, comparisons are also conducted in terms of SNR savings for target achievable rates (maximum shaping gain) and in terms of maximum percentage of gain for user 1. These two quantities are defined below.

Definition 1

Consider two transmission strategies (A and B). The pair of rates (R₁+R₂,R₂) is achieved for (SNR₁,SNR₂) with A and for (SNR₁+Δ SNR,SNR₂+Δ SNR) with B. The shaping gain (with A compared to B) is Δ SNR. The maximum shaping gain is defined as

{MG}_{{SNR}_{dB}} (A | B) = {max}_{R_{2}} Δ SNR

(21)

Definition 2

Consider two transmission strategies (A and B). For a given pair of SNR (SNR₁,SNR₂) and a fixed value of R₂, the achievable pair of rates is $(R_{1}^{A} + R_{2}, R_{2})$ and $(R_{1}^{B} + R_{2}, R_{2})$ with A and B, respectively. The gain on the achievable rate for user 1 is given by

G_{R_{1}} (A | B) = \frac{(R_{1}^{A} + R_{2}) - (R_{1}^{B} + R_{2})}{R_{1}^{B} + R_{2}} \cdot 100 (%) .

(22)

The maximum gain on the achievable rate for user 1 (with A compared to B) is given by

{MG}_{R_{1}} (A | B) = {max}_{R_{2}} G_{R_{1}} (A, B) .

(23)

5.2.1 Superposition modulation

In this section, the three possible configurations of superposition modulation are compared. We can see from Figures 6, 7, 8, 9, 10, and 11 that ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ (optimization of only) outperforms ${SM}_{\bar{X}, P_{UX}, P_{X}}$ (optimization of P_UX only) in terms of maximal achievable rates per user when M=4. For M=8 and 16, ${SM}_{\bar{X}, P_{UX}, P_{X}}$ can achieve slightly higher rates than ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ . The implementation of a system with constellation symbols with non-standard positions and generated with the same probability is less complex than the implementation of a system which generates symbols with non-uniform joint distribution of probability. Thus, ${SM}_{\bar{X}, P_{UX}, P_{X}}$ does not seem to be of interest since it is not very efficient in terms of achievable rates and is more complex to implement.

Figures of achievable rate region show that an improvement can be obtained with ${SM}_{X, P_{UX}, P_{X}}$ (full optimization) compared to ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ (optimization of only) and depending on δ_SNR=SNR₁-SNR₂. Numerical values of the maximum gain in achievable rate ( ${MG}_{R_{1}}$ ) and of the maximum SNR savings ( ${MG}_{{SNR}_{dB}}$ ) are given in Table 3. We observe the following: a slight gain in terms of achievable rates can be translated into a noticeable gain in terms of SNR saving. The maximum shaping gain increases with the constellation size. Thus, constellation shaping for the SM strategy seems more useful for high values of M. The analysis of the optimal matrix P_UX (results not reported) leads to the conclusion that X₁ and X₂ are not independent in general when using finite-size constellations. We observe also that the maximum shaping gain for ${SM}_{X, P_{UX}, P_{X}}$ versus ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ increases when δ_SNR decreases, independently of M. In particular, full optimization (vs. optimization of the symbol position) does not provide significant improvement for large SNR gap in the SM strategy.

Table 3 Comparison of ${SM}_{X, P_{UX}, P_{X}} (A)$ and ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}} (B)$ with respect to ${MG}_{{SNR}_{dB}}$ and ${MG}_{R_{1}}$

Full size table

5.2.2 Time sharing or superposition modulation?

This section compares two strategies (TS and SM) classically considered in broadcast systems. In Figures 6 and 7 (M=4), we observe that the achievable rate region can be split into two parts. Indeed, for small and large values of R₂, TS is better than SM. On the contrary, SM is better than TS for middle-range values of R₂. Under a given rate requirement for one user, we can thus determine the best transmission strategy. We can also observe that the region in which SM is better than TS becomes small for larger values of SNR₂. With M=8 (Figures 8 and 9), the area in which SM is better than TS increases (compared to M=4) by considering the union of the two possible configurations for SM: M₁=2,M₂=4 (case 1) and M₁=4,M₂=2 (case 2). This is particularly true when δ_SNR increases. We also observe that TS can achieve higher rates than SM (case 1) for good SNR₂ values. Indeed, the maximum rate of user 2 with SM is the maximum individual rate for a 4-PAM constellation, whereas it is the individual user rate that achieved using standard 8-PAM in the TS case. For low SNR₂ values, optimized 4-PAM may achieve higher rate than standard 8-PAM; thus, SM becomes better in this interval. For a 16-PAM constellation (Figures 10 and 11), SM is always better than TS for the studied pairs of (SNR₁,SNR₂). Table 4 shows the maximum percentage of improvement in achievable rate of user 1 by TS when using ${SM}_{X, P_{UX}, P_{X}}$ (full optimization) strategy in the interval, where ${SM}_{X, P_{UX}, P_{X}}$ is better than TS. Clearly, the maximum percentage of improvement increases when δ_SNR increases, and an important gain is obtained for high values of δ_SNR as in the case of SNR₁=δ_SNR=10 dB for a 4-PAM, where the percentage of gain on achievable rate of user 1 varies between 0% and 40.7%. For a 8-PAM constellation, the percentage of gain on achievable rate of user 1 varies between 0% and 30.21% when SNR₁=16 dB and δ_SNR=8 dB. For a 16-PAM, the percentages of improvements can be up to 35.08% when SNR₁=18 dB and δ_SNR=8 dB. We can conclude that SM is a better option than TS especially for large δ_SNR values. TS is optimal in the region, where we want to maximize the rate of user 2 for good values of SNR₂ because the single user rate achieved by TS is the rate achieved using standard M-PAM constellation (the constellation is split between users with SM). Thus, SM seems more gainful than TS when we want to serve users with very diverse SNRs.

Table 4 Comparison of ${SM}_{X, P_{UX}, P_{X}} (A)$ vs. TS (B) and comparison of ${SC}_{X, P_{UX}, P_{X}} (A)$ vs. $TS ⋃ {SM}_{X, P_{UX}, P_{X}} (C)$

Full size table

5.2.3 Is superposition coding necessary?

For the three constellations under consideration (M=4,8,16), the maximal achievable rate region obtained by the optimal general case of superposition coding when we consider the general form of P_UX (SC) can achieve, depending on M and user SNRs, a large region of rate pairs (R₁+R₂,R₂) that cannot be achieved neither by TS nor by SM. Even when we fully optimize SM ( ${SM}_{X, P_{UX}, P_{X}}$ ), we are far from maximal achievable rate region. Sometimes, the maximal achievable rate region curve is very close or even coincides with the ${SM}_{X, P_{UX}, P_{X}}$ achievable rate region in a pair of rates ( $R_{1}^{*} + R_{2}^{*}, R_{2}^{*}$ ). This is the case when ${SM}_{X, P_{UX}, P_{X}}$ is the optimal superposition coding in terms of achievable rates. We can see for example in Figure 6 that the pair of rates ( $R_{1}^{*} + R_{2}^{*} = 1.096, R_{2}^{*} = 0.531$ which corresponds to the optimal rate pair when we optimize the general case of SC for θ=0.23) is an intersection point with ${SM}_{X, P_{UX}, P_{X}}$ achievable rate region.

We are interested now in the numerical evaluation of the gain in rate of user 1 (R₁+R₂) when we use ${SC}_{X, P_{UX}, P_{X}}$ (full optimization) compared to the best strategy between TS and SM. This gain ( ${MG}_{R_{1}} ({SC}_{X, P_{UX}, P_{X}} | TS ⋃ {SM}_{X, P_{UX}, P_{X}}$ ) calculated in % is the distance between the limit of the maximal achievable rate region and the limit of the union of achievable rate regions of TS and ${SM}_{X, P_{UX}, P_{X}}$ .

The results are reported in Table 4. We observe that the part of the maximal achievable rate region which is unachievable by TS and SM is bigger when M is small because we observe that for the case of 4-PAM, we have one configuration for SM. However, we have two configurations of SM for 8-PAM constellation and three configurations for 16-PAM constellation. Thus, when M increases, the union of achievable rates for all SM cases tends to the sets of achievable rates by the general superposition coding. Asymptotically, we know that when $M \to \infty, {SM}_{X, P_{UX}, P_{X}}$ is the optimal superposition coding scheme because it allows the capacity region for two-user AWGN BC using Gaussian alphabet for each user to be achieved. Thus, the maximum gain in user 1 rate decreases when the constellation order M increases. We observe also that the gain in achievable rates is high for high values of δ_SNR. On the other hand, the experiments show that by using the general superposition coding strategy with the constraint that symbols should be equiprobable ( ${SC}_{X, P_{UX}, \bar{P_{X}}}$ ), the loss is limited compared to the full optimization ( ${SC}_{X, P_{UX}, P_{X}}$ ), 4.84%, 7.66%, and 3.94% for the simulated pairs of (SNR₁,SNR₂) when M=4, 8, and 16, respectively. This means that we can use equiprobable symbols with, in general, a small loss in achievable rates. However, ${SC}_{X, P_{UX}, \bar{P_{X}}}$ is not an interesting case when ${SM}_{X, P_{UX}, P_{X}}$ can achieve better rates since SM is less complex to implement than SC.

Moreover, with standard M-PAM symbols, the two possible configurations ( ${SC}_{\bar{X}, P_{UX}, P_{X}}$ (optimization of P_UX and P_X) and ${SC}_{\bar{X}, P_{UX}, \bar{P_{X}}}$ (optimization of P_UX only)) give very similar results in most considered pairs of SNR. We also observe that the loss in maximum achievable rate experienced by user 1 with ${SC}_{\bar{X}, P_{UX}, P_{X}}$ is less than 10% under the rate experienced with ${SC}_{X, P_{UX}, P_{X}}$ . Thus, we can use standard values of symbol positions without losing much on achievable rates.

In general, one can conclude that fixing constellations of users (i.e., assigning labels to the constellation so that we distinguish between the bits intended for each user) is not optimal for coding and may result in important loss in terms of rates for systems using finite-size constellations especially for low-order constellations. A better solution is to determine the optimal alphabet of the auxiliary alphabet U which is not necessarily a constellation and then to generate the codewords xⁿ which are not necessarily the sum of two codewords (see Section 3.4).

6 Application: coverage extension

We first consider a transmission over a broadcast channel with finite size input alphabet. For simplicity of the illustration and without loss of generality, let us assume that the existing user alphabet belongs initially to a standard constellation whose symbols are used with equal probability. We assume that the existing user is at distance d₀ from the sender achieving a rate R₀. Some information is also to be transmitted to an upgraded layer of users. The sender can use up to 16 symbols, then several transmission schemes can be used. We are interested in comparing the transmission schemes to serve the new user under two scenarios: either the new user is closer to the transmitter than the existing user or the new user is farther than the existing one. For a target rate R₀ that is fixed for the existing user and achievable using a standard M-PAM and equiprobable symbols, we are interested in determining the variation of the coverage’s diameter ratio between the two layer of users as a function of the achievable rate by the upgraded user for various broadcast transmission strategies. We assume that $SNR \propto \frac{1}{d^{2}}$ .

6.1 The sender can use up to 16 symbols

6.1.1 Scenario 1

In this scenario, the system consists initially of one layer of users. Now, assume that the data information is also to be transmitted to a second layer of users with higher SNR. In the following, we keep the notation from the preceding section, where the user with greater SNR is denoted by user 1. Thus, in this scenario, the legacy receivers are denoted by user 2 which is at a distance d₂ from the transmitter and achieving a rate R₀ when the data is modulated using standard 4-PAM constellation and equiprobable symbols. The upgraded receivers are denoted by user 1 (SNR₁>SNR₂). We intend that the good user receives more throughput than user 2 via the use of 16-PAM.

In this example SNR₂ is fixed to 10 dB. Initially, user 2’s alphabet belongs to a 4-PAM standard constellation (see section 3.1), and the rate transmitted to user 2 is R₀=1.582 bits/ch. use.

Now, a new layer of users called user 1 is introduced in the system with SNR₁>SNR₂. Our target is to provide the maximum bit rate to the new user without changing R₀ or d₀ and using a 16-PAM. By enlarging the constellation and optimizing the symbol positions and probability distribution, we ensure that the rate of the initial user will not decrease after introducing a new user.

Consider now the results for the following strategies which can achieve a positive private-message rate for user 1: time sharing using standard 16-PAM, ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}} M_{2} = 8 / M_{1} = 2$ (optimization of only), ${SM}_{X, P_{UX}, P_{X}} M_{2} = 8 / M_{1} = 2$ (full optimization) and ${SC}_{X, P_{UX}, P_{X}}$ (full optimization). Figure 12 illustrates the variation of d₁/d₂, which is the ratio of the diameter of the coverage area for user 1 over the diameter of the initial coverage area for user 2, as a function of the achievable rate for user 1 for a target rate R₀=1.582 for user 2.

Let us assume for example that the new user is midway between the transmitter and user 2 (d₁/d₂=0.5). Figure 12 shows that the most simple case of superposition modulation ( ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}} M_{2} = 8 / M_{1} = 2$ ) provides 16.3% more bit rate than time sharing for the new user. If we move immediately to a more complex case and optimize P_UX ( ${SM}_{X, P_{UX}, P_{X}} M_{2} = 8 / M_{1} = 2$ ), a gain of 21% is obtained on the bit rate of user 1 compared to time sharing. This gain on achievable rate for the new user is equivalent to a gain of 1 dB on SNR₁ compared to superposition modulation with uniform P_{UX x}. However, if we move to the most general case of superposition coding, it does not provide significant gain compared to superposition modulation.

Now, we assume that the new user is close to the transmitter such that d₁/d₂=0.2. We observe that the gain on the bit rate of user 1 using the simple case of superposition modulation increases to 45.7% compared to time sharing. By moving to a more complex case ( ${SM}_{X, P_{UX}, P_{X}} M_{2} = 8 / M_{1} = 2$ ), a gain of 47.8% is obtained on the bit rate of user 1 compared to time sharing. We observe also that it is relevant in this case to move to the most general case of superposition coding since it provides a gain of 61.8% on the bit rate of user 1 compared to time sharing.

Consequently, using superposition modulation provides always noticeable gain compared to time sharing. The general case of superposition coding ${SC}_{X, P_{UX}, P_{X}}$ is useful when user 1 is close to the transmitter, but not when it is close to user 2.

6.1.2 Scenario 2

Initially, consider a system of one layer of users, denoted by user 1, at a distance d₁ from the transmitter and achieving a rate R₀. Moreover, the alphabet of user 1 belongs to a standard 8-PAM constellation. In this example, SNR₁ is fixed to 18 dB. Thus, user 1 can achieve a rate R₀=2.73 bits/ch. use in the initial situation. In this scenario, we want to serve a second layer of users denoted by user 2 which is farther to the transmitter than the existing user, i.e., SNR₂x<SNR₁.

Achievable rates for user 2 are obtained at different distance d₂ from the transmitter and using various transmission strategies for a target rate of user 1 equal to R₀ and a coverage diameter for user 1 fixed to d₁. Figure 13 illustrates the variation of d₂/d₁, which is the ratio of the diameter of the coverage area for user 2 over the diameter of the initial coverage area for user 1, as a function of the achievable rate for user 2 when a target rate for user 1 is fixed to R₀=2.73 bits/ch. use.

We observe in Figure 13 that superposition modulation can always achieve better rates for user 2 than time sharing using 16-PAM. Let us assume first that we want to increase the diameter of the coverage area for the new user (user 2) such that d₂/d₁=4. Time sharing provides a bit rate less than 0.06 bits/ch. use. The most simple case of superposition modulation ( ${SM}_{X, P_{UX}, P_{X}} M_{2} = 2 / M_{1} = 8$ ) provides a significant improvement on the achievable rate for user 2 which is equal to 0.4 bits/ch. use in this case. If we increase the complexity by optimizing the joint probability distribution P_UX, we obtain 35% more bit rate for user 2 comparing to superposition modulation with uniform P_UX. If we move to the general case of superposition coding, we gain only 10% on the bit rate of the new user compared to superposition modulation (see Table 5). However, when the new layer of users is at distance d₂=2.25 d₁, the general case of superposition coding provides a significant gain of 41% on the achievable rate of user 2 comparing to superposition modulation.

Table 5 Comparison of ${SC}_{X, P_{UX}, P_{X}}$ and ${SM}_{X, P_{UX}, P_{X}} M_{2} - PAM / M_{1} - PAM w.r.t {MG}_{R_{2}}$ (%)

Full size table

Consequently, the general case of superposition coding can bring significant gains compared to superposition modulation, depending on the diameter of the coverage area for the new layer of users. For superposition modulation, optimizing the joint distribution of probability P_UX provides often significant shaping gains.

6.2 The cardinality of the existing user alphabet is kept fixed

In this section, we study scenario 1 (and 2) supposing that the legacy receivers will continue working as in the initial situation, still using 4-PAM (8-PAM). The system consists initially one layer of users at distance d₀ from the transmitter and achieves a rate R₀. Now, we want to change the transmitter, such that the upgraded receivers closer (farther) in range will be able to decode a refinement (coarse) layer and use a 16-PAM constellation. Thus, only time sharing with M₁=M₂=4(M₁=8,M₂=2) and superposition modulation strategies can be used. We aim to study how small the reduction in legacy coverage can be made, depending on the rate of the refinement (coarse) information achieved by the upgraded users. Thus, suppose that the legacy coverage can be reduced from d₀ to d₂ (from d₀ to d₁). We have studied this problem for SNR₀=12 dB and for SNR₁-SNR₂=4 dB in scenario 1 (and for SNR₀=16 and SNR₂=14 dB in scenario 2). Figures 14 and 15 represent the reduction in coverage d₂/d₀ (and d₁/d₀ respectively) as a function of the rate of the refinement R₁ (of the coarse R₂), while the rate achieved by the legacy receivers is kept fixed to its initial situation, i.e., R₀.

We observe in Figures 14 and 15 that the gain of superposition modulation strategies over time sharing becomes more important when d₂/d₀ (d₁/d₀) is small. These figures show that using superposition modulation when both symbol positions and P_UX are optimized, we gain around 5% from the initial coverage compared to the case of superposition modulation where symbols are used with equal probability. We can observe also that a reduction of only 10% and 20% in coverage area for the existing user can serve the upgraded user with a rate up to 20% and 35% (9% and 15%) from the rate achieved by the legacy users, using ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ . Consequently, by using ${SM}_{X, \bar{P_{UX}}, \bar{P_{X}}}$ , the legacy receivers still use 4-PAM (8-PAM in scenario 2), and we can serve a new layer of users with an acceptable rate, a small reduction in coverage area, and with less complexity compared to ${SM}_{X, P_{UX}, P_{X}}$ .

7 Conclusion

In this work we considered the problem of maximizing the achievable rate region for power-constrained AWGN broadcast channel of two users using M-PAM constellations. The achievable rate region is given for various transmission strategies. Maximal achievable rate region for superposition coding and superposition modulation is obtained using constellation shaping. An iterative algorithm was proposed to solve this optimization problem. Then, the efficiency of several strategies are compared. For superposition modulation, the results showed that constellation shaping seems more useful for high values of M. Moreover, the gain in using a complex case of superposition modulation increases when the SNR gap between users decreases. We observed also that superposition modulation outperforms time sharing in a large part of the achievable rate region. On the other hand, it is shown that using the general case of superposition coding can bring important gains compared to classical schemes. We observed also that in the case of finite input alphabet, superposition modulation is not the optimal strategy as in the case of Gaussian input alphabets. Finally, in order to make clear that this paper provides useful tools for the system designer, we considered two scenarios of coverage areas and user alphabets where the systems served initially one layer of users. Then, we propose to serve a second layer of users, and we evaluate the achievable rate of the new layer depending on the broadcast strategy. To improve the system performance compared to time sharing, we can optimize the joint probability distribution and symbol positions of the superimposed modulations or consider the general case of superposition coding. In this work, we showed that the optimization of probabilities was often useful, but not always. However, superposition coding brings sometimes significant gains compared to superposition modulation, depending on the diameter of coverage area for the new layer of users.

This work can also be extended to two-dimensional constellations like M-QAM and other channel models. The maximization achievable rates using various transmission strategies can be performed also using the proposed algorithm based on alternative maximization with respect to symbol positions and the joint distribution of probability.

References

Cover TM: Broadcast channels. IEEE Trans. Inform. Theory 1972, 18: 2-14. 10.1109/TIT.1972.1054727
Article MathSciNet MATH Google Scholar
Bergmans PP: Random coding theorem for broadcast channels with degraded components. IEEE Trans. Inform. Theory 1973, 19(2):197-207. 10.1109/TIT.1973.1054980
Article MathSciNet Google Scholar
Bergmans PP: A simple converse for broadcast channels with additive white Gaussian noise. IEEE Trans. Inform. Theory 1974, 20: 279-280. 10.1109/TIT.1974.1055184
Article MathSciNet MATH Google Scholar
Gallager RG: Capacity and coding for degraded broadcast channels. Probl. Infor. Transm 1974, 10(3):185-193.
MathSciNet MATH Google Scholar
Imai G, Hirakawa S: A new multilevel coding method using error correcting codes. IEEE Trans. Inform. Theory 1977, 23: 371-377. 10.1109/TIT.1977.1055718
Article MATH Google Scholar
Ungerboeck G: Channel coding with multilevel/phase signals. IEEE Trans. Inform. Theory 1982, 28: 55-67. 10.1109/TIT.1982.1056454
Article MATH Google Scholar
Bergmans PP, Cover TM: Cooperative broadcasting. IEEE Trans. Inform. Theory 1974, 20: 317-324. 10.1109/TIT.1974.1055232
Article MathSciNet MATH Google Scholar
European Telecommunications Standards Institute: EN 300 744: Digital Video Broadcasting (DVB)—framing structure, channel coding and modulation for digital terrestrial television. France,: European Telecommunications Standards Institute; 2004–2006)
European Telecommunications Standards Institute: ETSI TS 102: Digital Video Broadcasting (DVB)—system specifications for satellite services to handheld devices (SH) below 3 GHz. France: (European Telecommunications Standards Institute; 2008.
Google Scholar
Meric H, Lacan J, Amiot-Bazile C, Arnal F, Boucheret ML: Generic approach for hierarchical modulation performance analysis: application to DVB-SH. In Wireless Telecommunications Symposium. New York: ; 13–15 April 2011.
Google Scholar
Calderbank AR, Ozarow LH: Nonequiprobable signaling on the Gaussian channel. IEEE Trans. Inform. Theory 1990, 36(4):726-740. 10.1109/18.53734
Article MathSciNet Google Scholar
Sommer D, Fettweis G: Shaping by non-uniform QAM for AWGN channels and applications using turbo coding. In ITG Conference Source and Channel Coding. Munich, Germany: ; 17–19 Jan 2000.
Google Scholar
Fragouli C, Wesel RD, Sommer D, Fettweis GP: Turbo codes with non-uniform constellations. Proc. IEEE Int. Conf. Commun 2001, 1: 70-73.
Google Scholar
Varnica N, Ma X, Kavcic A: Capacity of power constrained memoryless AWGN channels with fixed input constellations. GLOBECOM 2002, 2: 1339-1343.
Google Scholar
Raphaeli D, Gurevitz A: Constellation shaping for pragmatic turbo-coded modulation with high spectral efficiency. IEEE Trans. Commun 2004, 52(3):341-345. 10.1109/TCOMM.2004.823564
Article Google Scholar
LeGoff SY, Khoo BK, Tsimenidis CC, Sharif BS: Constellation shaping for bandwidth-efficient turbo-coded modulation with iterative receiver. IEEE Trans. Wireless Commun 2007, 6(6):2223-2233.
Article Google Scholar
Ngo NH, Barbulescu SA, Pietrobon SS: Performance of nonuniform M-ary QAM constellation on nonlinear channels. In Australian Communications Theory Workshop. Australia: ; 2–4 Feb 2005.
Google Scholar
Zhang J, Chen D, Wang Y: A new constellation shaping method and its performance evaluation in BICM-ID. In Vehicular Technology Conference Fall (VTC 2009-Fall). Anchorage, AK: ; 20–23 Sept 2009.
Google Scholar
Valenti M, Xiang X: Constellation shaping for bit-interleaved LDPC coded APSK. IEEE Trans. Commun 2012, 60(10):2960-2970.
Article Google Scholar
Huppert C, Bossert M: On achievable rates in the two user AWGN broadcast channel with finite input alphabets. In ISIT. Nice: ; 24–29 June 2007.
Google Scholar
Cover TM, Thomas JA: Elements of Information Theory. Hoboken: Wiley; 2006.
MATH Google Scholar
Gledhill J, Macavock P, Miles R: DVB-T: Hierarchical Modulation. Geneva: DVB; 2000.
Google Scholar
Schertz A, Weck C: Technical Review: Hierarchical Modulation-the Transmission of Two Independent DVB-T Multiplexes on a Single Frequency. Switzerland: EBU; 2003.
Google Scholar
Singh VOn superposition coding for wireless broadcast channels. Master’s thesis, Royal Institute of Technology, Sweden (2005). www.ee.kth.se/php/modules/publications/reports/2005/IR-SB-EX-0507.pdf On superposition coding for wireless broadcast channels. Master’s thesis, Royal Institute of Technology, Sweden (2005).
Mheich Z, Duhamel P, Szczecinski L, Alberi-Morel ML: Constellation shaping for broadcast channels in practical situations. In 19th European Signal Processing Conference. Barcelona: ; 29 Aug–2 Sept 2011.
Google Scholar
Mheich Z, Alberi-Morel ML, Duhamel P: Optimization of unicast services transmission for broadcast channels in practical situations. Bell Labs Techn. J 2012., 17(5–24):
Mheich Z, Alberge F, Duhamel P: On the efficiency of transmission strategies for broadcast channels using finite size constellations. In 21st European Signal Processing Conference. Marrakech: ; 9–13 Sept 2013.
Google Scholar
Cover TM: Comments on broadcast channels. IEEE Trans. Inform. Theory 1998, 44(6):2524-2530.
Article MathSciNet MATH Google Scholar
Blahut RE: Computation of channel capacity and rate-distortion functions. IEEE Trans. Inform. Theory 1972, 18(4):460-473. 10.1109/TIT.1972.1054855
Article MathSciNet MATH Google Scholar
Yasui K, Matsushima T: Toward computing the capacity region of degraded broadcast channel. In ISIT. Austin, TX: ; 13–18 June 2010.
Google Scholar
Bertsekas DP: Nonlinear Programming. Nashua: Athena Scientific; 1999.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

University Paris-Sud, UMR8506, Orsay, F-91405, France
Zeina Mheich, Florence Alberge & Pierre Duhamel
CNRS, Gif-sur-Yvette, F-91192, France
Zeina Mheich, Florence Alberge & Pierre Duhamel
Supelec, Gif-sur-Yvette, F-91192, 3 rue Joliot-Curie, 91192, Gif-sur-Yvette, Cedex, France
Zeina Mheich, Florence Alberge & Pierre Duhamel

Authors

Zeina Mheich
View author publications
You can also search for this author in PubMed Google Scholar
Florence Alberge
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Duhamel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Florence Alberge.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mheich, Z., Alberge, F. & Duhamel, P. Achievable rates optimization for broadcast channels using finite size constellations under transmission constraints. J Wireless Com Network 2013, 254 (2013). https://doi.org/10.1186/1687-1499-2013-254

Download citation

Received: 27 June 2013
Accepted: 20 October 2013
Published: 31 October 2013
DOI: https://doi.org/10.1186/1687-1499-2013-254

Achievable rates optimization for broadcast channels using finite size constellations under transmission constraints

Abstract

1 Introduction

2 AWGN broadcast channels

3 Broadcast transmission strategies

3.1 Time sharing

3.2 Hierarchical Modulation (HM)

3.3 Superposition modulation

3.4 Superposition coding

4 Achievable rate regions

4.1 Problem formulation

4.2 Numerical solution

5 Result analysis

5.1 Point to point channel

5.2 Broadcast channel

Definition 1

Definition 2

5.2.1 Superposition modulation

5.2.2 Time sharing or superposition modulation?

5.2.3 Is superposition coding necessary?

6 Application: coverage extension

6.1 The sender can use up to 16 symbols

6.1.1 Scenario 1

6.1.2 Scenario 2

6.2 The cardinality of the existing user alphabet is kept fixed

7 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords