Low complexity PAPR reduction methods for carrier-aggregated MIMO OFDMA and SC-FDMA systems

New generation wireless systems target to support much higher data rate which requires larger bandwidth than the existing systems. Lack of contiguous spectrum resources and the compatibility requirement with the legacy systems make the concept of carrier aggregation (CA) a practical means for supporting high data rate in next generation systems. One of the practical issues related to the CA is the dramatically increased peak-to-average power ratio of the time domain signal. This article proposes several low-complexity and no-overhead PAPR reduction methods for bandwidth aggregated systems with orthogonal frequency division multiplexing/multiple-access or single-carrier frequency division multiple access in multi-in multi-out system configurations. For the proposed phase-rotation based PAPR reduction methods, a reliable phase rotation detector at the receiver is developed to maintain the receiver performance. Performance evaluation under 3GPP LTE-Advanced environment shows the advantage of the proposed methods while not requiring any signaling overhead.


Introduction
The demand of higher data rate to support various wireless services and applications increases dramatically over the last decade. 3GPP has adopted carrier aggregation (CA) [1] where multiple LTE Release 8 [2,3] component carriers (CCs) are combined to provide the targeted or required data rate. One of the downsides that CA systems face is the significantly increased peak-to-average power ratio (PAPR) of their time domain signal. Hence, PAPR control is an important practical issue for carrier aggregated systems.
However, all of those works consider typical OFDM systems which are not carrier aggregated systems. In addition, some systems (e.g., uplink (UL) of LTE Release 8 and LTE-A) use single carrier frequency division multiple access (SC-FDMA) which has different PAPR characteristics [27] than OFDM or OFDMA systems [28]. The promising schemes with significant PAPR reduction such as PTS and SLM require substantial signaling overhead which grows linearly with the number of CCs. The complexity of such schemes in the CA systems increases exponentially with the number of CCs. Thus, PAPR reduction methods with reduced complexity and lower or no signaling overhead are needed for CA systems.
In this article, we propose several PAPR reduction methods which aim to reduce system complexity, eliminate the need of signaling overhead and provide good PAPR reduction performance in carrier-aggregated OFDM/OFDMA and SC-FDMA multi input multi output (MIMO) systems. The first three methods namely multi-symbol SLM (MSSLM), partial SLM (PSLM) and sequential SLM (SSLM) are based on the SLM technique with some modifications. The fourth method called partial linear precoding across time (PLPT) relies on several different representations of the signal through linear precoding of subcarrier data across time, and the transmitter selects the representation with the lowest PAPR to transmit.
In a recent work [29] which considers a singleantenna OFDM system, signaling overhead associated with PAPR reduction is eliminated by means of inserting pilot tones in each OFDM symbol. The receiver obtains the phase rotations applied in the PAPR reduction from the inserted pilots. While applying a similar concept from [29] in our first three methods, we aim to reduce the cost of pilot tones at each OFDM symbol. We propose to use the pilots inserted for channel estimation and tracking according to the standard specification, thus eliminating requirement of additional pilots at each multiple access (MA) symbol, i.e., either OFDMA or SC-FDMA symbol. This leads to the use of PAPR optimization intervals with multiple symbol durations for the first three methods. Another difference from [29] is that we consider MIMO carrier-aggregated systems which have their own characteristics and constraints.
In the LTE-A, there are several MIMO schemes specified, including space frequency block coding (SFBC), spatial multiplexing (SM) and beam forming. The later two schemes require the modifications made on the signal of each antenna for PAPR reduction to be identical; otherwise, the original MIMO properties will be destroyed. Hence, in our proposed PAPR reduction schemes, all phase changes or linear precodings are made to be the same across all antennas for all MIMO schemes. This will limit the PAPR reduction capability, but the MIMO properties can be preserved. Our PAPR reduction methods can be applicable to multi-user MIMO scenario (also a mode of LTE-A downlink (DL)) as will be discussed later in the article.
Regarding PSLM, MSSLM and SSLM, while adopting SLM concept, our contributions include (i) the concept of multi-symbol PAPR optimization intervals for signaling overhead reduction, (ii) the concept of partial group selection for complexity reduction, (iii) adaptation to CA systems, (iv) the extension to the MIMO scenarios, and (v) a reliable phase rotation detector at the receiver to remove the effects of PAPR reducing phase rotations on the receiver performance. These methods provide low-complexity and require no signaling overhead by the exploitation of existing reference symbols for signaling overhead reduction. Parts of these contributions were presented in [30].
The fourth proposed method PLPT offers an alternative means of reducing PAPR by a linear precoding of data on some subcarriers across two adjacent OFDMA or SC-FDMA symbols. Like SLM based methods, every different representation of the same signal will have different PAPR value and the transmitter selects a representation which has the minimum PAPR to transmit. While LPT and SLM based methods share some contributions together, PLPT does not need multi-symbol PAPR optimization intervals, each of which contains pilot signal for signaling overhead reduction, because PLPT does not use phase rotation vectors. For PLPT, a modified receiver can detect whether the precoding has been applied or not.
The rest of the article is organized as follows. Section 2 presents system model, and Section 3 describes the proposed methods. Simulation results are discussed in Section 4, and Section 5 concludes this article.

System model
In this article, we assume a general cellular system which adopt OFDMA for its DL and use SC-FDMA (or discrete Fourier transform (DFT) spread OFDM) in its UL. The aggregation of several CCs is introduced in order to utilize wider bandwidth (for higher data rate) while keeping backward compatibility with the older standard. Each DL/UL transmission subframe for a user on a CC contains several subcarriers over several MA symbols forming a resource grid (RG).
Let N t , N l and N symb denote the number of the transmit antennas, number of the MIMO layers and the subframe duration in unit of OFDM or SC-FDMA symbol duration, respectively. Define f i = f c i − f c 0 which represents the center frequency difference of the ith CC and the reference 0th CC in the considered link (DL or UL). Figure 1 shows the baseband model of two CCs where the top CC is the reference 0th CC and the bottom CC is shifted to center frequency f 1 . In general, there are two primary modes, SM and transmit diversity (TD) for DL. For UL, only SM mode is considered and a DFT precoding is applied between layer mapping and MIMO precoding stages.

DL model
In the DL MIMO SM scenario, the modulation symbols {d(n)} are divided into multiple layers denoted as is the nth modulation symbol in the vth layer of ith CC and W is a N t × N l MIMO precoding matrix (see [2]). Z i (n) = [z  For the DL MIMO TD scenario, the precoding for TD is defined for two or four antennas. For transmission on two antennas, space-frequency block code (SFBC) is adopted and the precoding operation is defined by

UL model
For the UL MIMO scheme, after the modulation symbols are layered, each layer b (v) (n) will be DFT precoded as where R is the number of assigned subcarriers to the UL per user and l ranges from 0 to N symb -1 except those slots for the reference signal. In UL, only SM is adopted in the MIMO scheme; therefore, we can write the MIMO precoded symbols Z i (n) in (1) as where y i (n) = [y increasing order) of the ith CC and F i (m) represents the mth index of F i . The final output z where m = 0, ..., M sc -1, l = 0, ..., N symb -1. The con- i,l (t) in the lth OFDMA or SC-FDMA symbol of the ith CC in a subframe of antenna p is given by for 0 ≤ t ≤ NT s (which will be shifted to the lth symbol interval later) and l = 0, ..., N symb -1, where N is the inverse DFT (IDFT) size without oversampling and T s is the corresponding sample duration. Note that the lowpass equivalent representation uses f c 0 , the center frequency of the 0-th CC, as the reference bandpass carrier frequency in the conversion; hence the term e j2π f i t appears in (7). We ignore cyclic prefix (CP) in the PAPR calculation since it is merely the cyclic extension of the signal itself.

Multi-symbol selective mapping (MSSLM)
In SLM, PAPR optimization is done for each OFDM symbol, therefore it requires high complexity and overhead. An intuitive approach for reducing complexity and overhead will be processing across multiple MA symbols, i.e., processing for PAPR reduction is not done at each symbol independently but rather applied the same across multiple symbols jointly. Therefore, we propose the concept of multi-symbol optimization interval denoted by the MA symbol index set T . The choice of the PAPR optimization interval (the number of OFDM/ SC-FDMA symbols over which PAPR is defined and minimized) is constrained by the tradeoff between PAPR reduction capability and control signaling overhead. The optimization interval of one symbol provides the best PAPR reduction capability at the cost of substantial control signaling overhead. On the other hand, using the interval with a very large number of symbols will substantially save the signaling overhead but the PAPR reduction might be negligible or unjustifiable for the associated complexity. Thus, a proper tradeoff is necessary. Based on our investigation, we recommend a PAPR optimization interval of around four symbols for SC-FDMA based systems while for OFDM DL systems a slightly larger interval may be selected if associated PAPR reduction is acceptable.
As pilots or reference symbols are typically transmitted in practical systems, we can take advantage of them in removing or reducing the control signaling overhead. By including reference symbols in the optimization interval, i. e., applying the same phase rotations for both reference symbols and data symbols, the effects of the phase rotations can be absorbed in the channel estimates, or the phase rotation can be detected (see Section 3.5) and its effect can be removed at the receiver. By this way, no separate signaling overhead is required to inform the receiver of the applied phase rotations.
A transmission frame may contain more than one reference symbols. In this case, we may have more than one PAPR optimization intervals. To illustrate, let us consider the DL of the LTE-A system. There are seven symbols (normal CP length) in a slot and reference symbols for DL are placed at either first and fourth OFDMA symbols or at second and fifth OFDMA symbols. Considering the former, we can form two PAPR optimization intervals of three symbols and four symbols, with the corresponding symbol index sets T 1 = {0, 1, 2} and T 2 = {3, 4, 5, 6} , in each slot as shown in Figure 2. As there is one reference symbol in each interval, we do not need any control signaling overhead at all in this case.
With the introduction of the optimization interval, we propose MSSLM to overcome the complexity and overhead burden of the original SLM. Consider a PAPR optimization interval T . All symbols X (p) i,l,k of the IDFT input in the ith CC are partitioned into S disjoint groups over the above optimization interval T , repre- applies phase rotations to all groups. The phase rotation j i,q which rotates the ith CC's qth group is from the set F = {Φ m : m = 0, ..., P -1}, where P is the number of phase choices applied to each group of subcarriers. Set M as the number of CCs. We define a phase combination v F SM which is a length-SM vector where its elements belong to F. Each different vector v refers to one set of possible phase rotations that can be applied onto the transmit signal. Therefore, the phase shifted signal corresponding to a phase combination v, denoted bỹ where the set J i,l,q contains indices of all subcarriers which belong to the qth group of ith CC at lth symbol. Note that we can set j 0,0 to be zero without affecting the performance, thus the search space would be F SM-1 . Each combination results in a different final PAPR value. The transmitter will simply choose the combination which provides the best result. Note that the PAPR reduction phase vectors are the same across antennas but the actual PAPR values are in general different from antenna to antenna. We need only one metric to decide which phase vector to be used for PAPR reduction. Denote V as the collection of all possible combinations vectors v where each vector represents one possible phase combination. We have tested two candidate metrics: (1) using the maximum PAPR across all antennas as (9) and (2) using the average PAPR across all antennas as v * = arg min where PAPR s We have found out that using the latter provides a better result, thus we have adopted it in our methods. Therefore, we can write the time-domain transmit signals which have the minimum PAPR as

Partial selective mapping (PSLM)
In MSSLM, since all S groups are selected for phase optimization, the complexity increases exponentially with S. To be practical, S will be limited to a small number. Alternatively, we propose PSLM which applies i,l,k of the IDFT input in the ith CC are partitioned into S' disjoint groups over the optimization interval T where S' can be larger S. Next, PSLM applies phase rotations to K selected groups out of all S'. Note that the phase rotations φ i,q for unselected groups are set to zero.
The total number of groups per CC, S', as well as the number of selected groups for processing per CC, K, provide a tradeoff between PAPR reduction capability and complexity (plus overhead) if no reference or pilot symbols are available. The underlying system parameters (e.g., resource block sizes) also shape the choice of those group numbers.
The other aspect that should be considered is how to address for different resource block sizes especially for UL. In order to limit the complexity regardless of the resource block size, we adopt to use a fixed number of groups while adjusting the size of each group to accommodate for the change in the resource block size.
The next step is to find out which groups should be selected. For example, in the LTE-A scenario with four selected groups out of six groups, there are 15 possible selections and it is unlikely that all will give the same PAPR reduction performance. As will be shown later in Section 4, a proper selection does yield a better PAPR reduction performance.
As DL contains data for all users, it has much larger resource size than the UL. The same procedure can be applied for DL but the values of S' and K can be chosen differently (especially, larger than UL case to gain more PAPR reduction) if desired. On the other hand, base stations typically have power amplifiers with large linearity range and energy consumption is of less concern. Hence, even using the same values of S' and K as in the UL may be sufficient for the DL.
Note that all the steps for the partial group selection can be done offline once, and hence there is no implementation complexity associated with it. The group numbers and selection are fixed across all CCs of all antennas. This provides simplicity and modularity in implementation.
With the concept of optimization interval and partial group selection, the partially phase shifted signal s (p) CA (v, t) corresponding to a phase combination v F KM for PSLM can be written as where q' = 0, ..., S' -1 and the selection of signal which has the minimum PAPR is similar to (12). The detection of the phase rotations can be done at the receiver in a similar manner as in MSSLM, and hence no signaling overhead is required for PAPR reduction.

Sequential selective mapping (SSLM)
This scheme aims to reduce the complexity of SLMbased approaches by adopting a sequential optimization. Define the phase combination set V r which contains all length-S' M vectors as where its (r+1)th to S' Mth elements are zeros i.e.  (15) which yields the same PAPR reduction characteristics based on our investigation results. In fact, we can set j 0,0 to be zero without affecting the PAPR reduction performance, thus we can start from r = 2.
In SSLM, we can represent the sequentially phase shifted signals where r = 2, ..., S' M. For every r, there are P possible phases to apply, therefore, P(S' M -1) combinations of phase rotations are available to apply to the RGs differently. At the rth iteration, the selected combination v' * r is given by where the definition of the PAPR is given by (11). The search continues in an increasing order of r until a predefined PAPR threshold PAPR threshold is reached bỹ for all antennas or until r = S' M. The phase rotation effect at the receiver can be addressed in the same way as in MSSLM, thus requiring no associated signaling overhead.

Partial linear precoding across time (PLPT)
This scheme aims to remove the requirement of explicit signaling (e.g., SLM) or pilot-based implicit signaling (e.g., MSSLM, SSLM and PSLM), thus providing overhead saving and flexibility in its application. It applies linear precoding of data of some subcarrier groups across every two MA symbols. The choice of which subcarrier groups is based on the minimum average PAPR as defined in (10). Denote w to be the group indices selected to be PLPT-processed and r be the index of the MA symbol pair corresponding to the MA symbol index l = 2r and 2r + 1. After the PLPT process, the subcarrier data of the rth MA symbol pair are given by X' where and The corresponding time domain signal over two MA symbol interval can be written as Similar to (12), the signal transmitted is the one with the optimal selection w* which yields the lowest average PAPR. An illustration of PLPT method is shown in Figure 3. PLPT does not require any reference signal to convey the information of the selection made by the transmitter. At the receiver, whether PLPT is applied on a selected group or not can be determined from the equalized received signals on those subcarriers. Denote those equalized signals corresponding to i,2r+1 . Then, the receiver can apply the same LPT processing on those equalized signals as (23) to get estimates of the original signal before precoding at the transmitter. Alternatively, the receiver can perform maximum likelihood (ML) detection over a larger signal space consisting of both LPT-processed and unprocessed symbols.

Phase rotation detection
Reference signal aids in acquiring and tracking channel knowledge for data detection at the receiver. The use of phase rotations for PAPR reduction over multiple intervals within a transmission frame should not affect the channel estimation and tracking. This issue can easily be addressed by detecting the phase rotations at the receiver based on the reference signal. There can be many ways to find out the phase rotations at the receiver. Here we provide a simple solution. For each reference signal on each CC at each receive antenna m, consider the same group partitions as at the transmitter with J q and L (q,l) denoting the subcarrier indexes of group q and of pilots for transmit antenna l in group q, respectively. Note that L (q,l) is different for different l as in LTE-A. Suppose the phase rotation applied for PAPR reduction at transmit antenna l is j q (same for all l). Then the received reference signal Y m (k) and the original reference signal R l (k) on pilot tone k ∈ L (q,l) are related by Y m (k) = H l,m (k)e jφq R l (k) + N m (k), k ∈ L (q,l) (24) where H l,m (k) is the channel gain on subcarrier k corresponding to transmit antenna l and receive antenna m and N m (k) is the noise term on tone k at receive antenna m.
The channel estimateĤ l,m (k) at pilot location which includes the effect of phase rotation can be easily obtained aŝH , k ∈ L (q,l) . (25) For each group q, based on Ĥ l,m (k) : k ∈ L (q,l) , the estimates of phase-rotated channel gains on all tones within the group q denoted by Ĥ l,m (k) : k ∈ J q are obtained by interpolation/extrapolation (we use cubic interpolation). Note that the last subcarrier of group q denoted by q 0 and the first subcarrier of group q + 1 denoted by q 1 are adjacent and hence the channel gains on these two tones will be almost the same. Once all groups have been processed for all transmit-receive antenna pairs, the phase differenceˆ φ (q),(q+1) between j q and j q+1 can be obtained from the estimates of phase-rotated channel gains on q 0 and q 1 aŝ where 〈⋅〉 represents a slicer which maps the angle in the argument to the nearest phase from the possible phase rotations. To retrieve {j q } from ˆ φ (q),(q+1) , a reference phase is needed. For MSSLM and SSLM, the phase of the first group of each CC can be set to zero and used as the phase reference. For PSLM, as some of the groups are not phase-rotated, their group phases are zero and can be used as the phase reference.
Once the phase rotations have been known at the receiver, the estimates of the channel gains without phase rotation can easily be obtained by simply removing the phase rotation effects from Ĥ l,m (k) . Next, further improvement of channel estimates (e.g. by exploiting limited channel delay spread) or/and channel tracking for time-varying channels can be done as in conventional systems. Thus, with reliable phase rotation detection as will be corroborated in the simulation section, the performance of the remaining receiver processing algorithms will not be affected.

Applicability to multi-user MIMO
In multi-user MIMO (MU-MIMO), data streams of multiple users occupy the same spectrum. In LTE-A DL, MU-MIMO mode supports two simultaneous users on the same set of subcarriers. MU-MIMO uses additional user specific pilot sequences so that MIMO channels of different users can be distinguished at each user. Any processing for PAPR reduction should not disturb the relationship among different user specific pilot sequences.
For the proposed MSSLM, PSLM, and SSLM methods, the size of a group is 18 resource blocks (i.e., 216 subcarriers). If the users of MU-MIMO mode occupy the resource blocks within a group of the PAPR reduction approach, then the phase rotation of the proposed methods will be the same for all subcarriers assigned to those users. Then there will be no effect on the interrelationship of user specific pilot sequences and the proposed methods can be applied to the MU-MIMO scenario. The required  Figure 3 An illustration of PLPT processing on two adjacent MA symbols.
condition for the applicability can easily be met by means of an appropriate resource allocation algorithm.
The PLPT method can be applicable to MU-MIMO scenario as follows. When we apply LPT processing on a group, we can skip LPT processing on the tones which contain user specific pilot sequences. Then those user specific pilot sequences are untouched, and MU-MIMO transmission is maintained. In this case, the resource blocks assigned to MU-MIMO users need not be limited within a group of the PAPR reduction approach.

System setup
System parameters are adopted from the current LTE-A specification up to date [1] where N sc = 1320 and Δf = 15 kHz. The oversampling factor for PAPR calculation is U = 8, and hence IDFT size of each CC is N × U = 2, 048 × 8 = 16, 384 for the reliable computation of PAPR. The predefined phase set is F = {0, π} for PSLM and MSSLM. We use = 0, π 2 , π , 3π 2 for SSLM because of its low complexity. Note that, unlike MSSLM and PSLM, in SSLM, the complexity increases only linearly with a larger P. To evaluate MIMO CA systems without complexity burden, we use two non-contiguous CCs (300 MHz apart) and N t = 2 antennas for both UL and DL system. We expect that the use of larger number of CCs would give larger PAPR reduction. Number of groups S' = 6 for PSLM, SSLM and PLPT and the number of selected groups K = 4 for both PSLM and PLPT. MSSLM uses S = K = 4 groups in each CC and SSLM uses all 6 groups without setting a PAPR threshold. The optimization interval for MSSLM, PSLM and SSLM is set to four symbols and PLPT uses a pair of MA symbols as its optimization interval. For the UL where SC-FDMA is used, the considered UL user occupies R = 72 subcarriers (contiguous) on each CC. In DL OFDM scenario, we assume 1,296 subcarriers are occupied by multiple users on each CC, where in LTE 1296 is the largest number that a base station can allocate to a user.
In evaluating PAPR reduction techniques, complementary cumulative distribution function (CCDF) of PAPR is commonly used. Fran et al. [27] reported a linear dependency between average PAPR and required power backoff for the range of power backoff up to 9 dB. On the other hand, [26,31] proposed that CCDF of instantaneous normalized power (INP) conveys a more relevant measure. Here, we will present both performance metrics. To have a fair comparison among different methods, in each result, PAPR calculation is done based on each MA symbol for each antenna.

PAPR performance
First, we evaluate the effect of group selection in LTE-A UL SISO scenario. 72 subcarriers are divided into 6 groups. We select 4 out of all 6 groups and all 15 possible selections are evaluated by simulation, and some selected results in terms of the CCDF of PAPR are shown in Figure 4. The CCDF curve of "Original" represents the time domain signal without any PAPR reduction method applied. Selection 1 curve represents the case where the four phase-rotated groups are located at the last four group locations. Selection 2 curve corresponds to the 2-5th group locations, while selection three curve chooses the first four group locations.
The results show that the group selection can make a difference in PAPR reduction. The group selection 1 shows the best performance and hence it is adopted in the rest of the simulations for PSLM and PLPT.
The CCDF performances for the UL MIMO SC-FDMA are shown in Figure 5. Note that the choice of SC-FDMA for UL is due to its low PAPR characteristics, yet the proposed techniques enable additional PAPR reduction gains. The MSSLM provides about 1.2 dB PAPR reduction over the original signal, while the PSLM achieves about 1.5 dB PAPR reduction at a CCDF level of 0.01. PLPT has better performance than PSLM, achieving about 2 dB PAPR reduction. SSLM performs similarly to MSSLM, however, it requires less complexity. These gains translate into better energy efficiency for handheld mobile devices. Figure 6 shows the CCDF of INP. As it conveys sample-wise average performance, the gaps between different curves are much reduced if compared to the PAPR curves in Figure 5. Noticeably, PLPT method outperforms all other methods.
The DL MIMO OFDMA results are shown in Figure 7. Although base stations can handle much larger PAPR than mobile devices, the use of multiple CCs results in relatively large PAPR even for the base stations as can be seen in the figure with only two CCs. PLPT reduces PAPR by 2.1 dB at a CCDF level of 0.01 while PSLM has 1.8 dB in PAPR reduction. When MSSLM produces 1.6 dB PAPR reduction, the low complexity SSLM decreases PAPR by 1.5 dB. Similarly, we provide Figure 8 to show the CCDF of INP of a time domain sample. The same discussion as for Figure 6 applies here as well.
We tested the performance of our proposed phase rotation detection for the DL 2 × 2 MIMO setup described above with four possible phases at SNR of 3 dB. Out of the results for 10,000 independent channel realizations, we observed no phase rotation detection error. This corroborates reliable performance of the proposed phase rotation detector and confirms the applicability of the proposed phase-rotation-based PAPR reduction methods without affecting the receiver performance.

Complexity and overhead
The PAPR reduction performance of original SLM will be better than the proposed methods as it applies PAPR 4.5 5 5.5 6 6.5 7 7.5 8 10    reduction on each symbol. However, it is too complex to simulate with the aforementioned simulation parameters, and it requires substantial signaling overhead while the scope of the proposed methods stays within the framework with low complexity and no signaling overhead. If the original SLM [18] were applied for UL scenario, it would require M × N symb × P SM = 2 × 4 × 2 6 × 2 = 32,768 IDFT operations with IDFT size N × U = 16,384, N symb × P SM = 4 × 2 6 × 2 = 16,384 PAPR calculations, each based on 16, 384 samples, and N symb × ⌈log 2 P SM ⌉ = 48 bits (not coded yet for error protection) for control overhead. Hence it is not practically appealing due to its high complexity and overhead.

Conclusions
This article has presented PAPR reduction schemes for carrier aggregated systems, using LTE-A standard as an example. Both OFDMA and SC-FDMA are considered and MIMO setup is included. The proposed MSSLM reduces the complexity and removes the signaling overhead required by the traditional SLM scheme with the introduction of the multi-symbol optimization interval together with the aid of a reference/pilot signal within that interval. Partial group selection is then introduced in PSLM as an additional means to ease the processing requirement. SSLM relieves more computation burden by reducing the size of the signal representation space in the PAPR optimization. PLPT method takes a different approach on generating different signal representation by linearly precoding two adjacent MA symbols groupwisely. PLPT would be better suited of handheld device (UL) as it offers more PAPR reduction gain and shifts some signal processing (detection of precoding) to the receiver side (base station). On the other hand, if a lower complexity receiver is needed at handheld devices, any one from MSSLM, PSLM and SSLM can be adopted for the DL. With a minor performance difference, SSLM offers the most complexity-efficient solution for both UL and DL. The proposed phase rotation detection at the receiver shows very reliable performance, confirming applicability of the proposed SLM based methods without affecting receiver performance. Overall, the proposed schemes provide appreciable PAPR reduction gains while requiring low complexity and no signaling overhead. Thus, they offer appealing solutions to the PAPR reduction for bandwidth-efficient carrier aggregated systems.