Design and implementation of an OFDMA-TDD physical layer for WiMAX applications
© Carro-Lagoa et al.; licensee Springer. 2013
Received: 19 April 2013
Accepted: 18 September 2013
Published: 17 October 2013
This work describes the design, implementation, and performance evaluation of an orthogonal frequency division multiple access (OFDMA) time-division duplexing (TDD) physical layer (PHY) compliant with the worldwide interoperability for microwave access (WiMAX) standard using a costeffective software-defined radio (SDR) platform containing field programmable gate array (FPGA) and digital signal processor (DSP) modules. We show that the proposed SDR architecture is capable of supporting the wide variety of configuration options described in the WiMAX standard while fulfilling the stringent requirements of WiMAX OFDMA TDD PHYs. The architecture allows for the implementation of all TDD functionalities in the downlink and the uplink at both the base station and the mobile station. The proposed design is shown to efficiently use the available FPGA and DSP resources. We also carried out specific experiments that take into account the frame and the downlink map messages detection over ITU-R wireless channel models to illustrate the performance of the proposed design. Finally, we discuss the utilization of the proposed hardware architecture to implement the wirelessMAN-advanced air interface.
Worldwide interoperability for microwave access (WiMAX) is a wireless communication standard developed to provide broadband wireless access over large distances. The term WiMAX was adopted by the WiMAX Forum, an organization created to to promote the interoperability between the IEEE 802.16 family of wireless communication standards.
Together with long-term evolution (LTE), WiMAX is one of the competing radio access technologies to be used by the fourth generation (4G) of mobile communication systems. Although today, LTE seems to be the predominant technology for broadband mobile access, there exists over 150 WiMAX operators with more than 30 million users across the world . In 2013, the number of users is expected to grow between 6 and 8 million users boosted by some industrial markets that cannot use conventional cellular systems .
Over the last years, several IEEE 802.16 amendments have been approved. For a complete survey of the IEEE 802.16 historical evolution up to 2010, see  and references therein. The most important 802.16 amendments are 802.16d, released in 2004 for point-to-point applications and commonly known as fixed WiMAX, and 802.16e released in 2005 and referred to as mobile WiMAX because it supports mobility and multiple users.
In 2011, the WiMAX standard evolved to amendment 802.16m [3, 4] which focuses on enhancements related to air interface specifications to fulfill the requirements and performance goals established by IMT-advanced while maintaining full backward compatibility with previous WiMAX versions. In August 2012, the latest revision of WiMAX was published and termed 802.16-2012 . This revision consolidates material from amendments 802.16j-2009 and 802.16h-2010 and also incorporates 802.16m-2011 but excluding the wirelessMAN-advanced air interface, which is now specified in the IEEE Std 802.16.1-2012 . The latest amendments to the standard are 802.16p-2012  and 802.16.1b-2012 , which incorporate improvements to support machine-to-machine applications.
WiMAX supports several physical layer (PHY) modes. In particular, the most attractive PHYs are those that utilize OFDMA as the multiple access mechanism to support several users at the same time. OFDMA is based on orthogonal frequency division multiplexing (OFDM) modulation and allows WiMAX base stations (BSs) to divide the available subcarriers into subchannels that can be assigned to different users. OFDMA enables a large flexibility in the radio interface since users can be assigned different bandwidths, time durations, and modulation orders according to their demanded quality of service (QoS) and the available bandwidth.
There exist two basic methods to achieve two-way communication in a wireless communication system: frequency-division duplexing (FDD) and time-division duplexing (TDD). The FDD technique has been preferred by the first generation of wireless systems mainly optimized for symmetric voice traffic. However, with the advent of high-speed data services, TDD is more flexible in supporting variable and asymmetric data traffic. In addition, only TDD offers an efficient and flexible support for the ad hoc and multihop communication scenarios considered in 802.16-2012.
In the literature, most works focus on non-real-time deployments and performance evaluation of WiMAX PHY (see [9–11] as an example). Few of them tackle the complete real-time design, implementation, and performance analysis of the standard. References exist that present the results of several tests in real scenarios to compare the channel models to the obtained data. As an example, path-loss measurements conducted in a rural environment using fixed WiMAX commercial equipment are presented in , and tests in outdoor scenarios using commercial mobile WiMAX equipment are shown in . Another example is  where a performance analysis of the 802.16e OFDMA downlink in vehicular environments (ITU-R M.1225) is considered.
References also exist that consider the real-time implementation of the OFDMA-TDDPHY used in mobile WiMAX, but most of them only consider simplex communications, either downlink [15, 16] or uplink . Only two papers were found which consider a bidirectional OFDMA-TDD PHY, an implementation of 802.16e-2005 carried out in a system-on-chip (SoC) platform , and a SoC baseband implementation integrated in a USB device for high mobility scenarios .
It is possible to find implementations of individual processing blocks of the standard but not integrated in a complete system. As an example, the design of a channel encoder prototyped in a reconfigurable hardware architecture is presented in . Another example is a FPGA architecture of a fixed sphere decoder for a WiMAX system presented in . There are some implementations that aim to support two standards by extracting common signal processing between them and sharing hardware resources, like the study of a dual-mode baseband receiver for 802.11n and 802.16e , and a 802.16m and LTE downlink implementation .
This work differs from existing ones in the literature because it presents a hardware architecture for the implementation of both the downlink and the uplink of OFDMA-TDD PHY for WiMAX applications. We discuss a large number of practical issues and show how they can be solved to fit into the proposed hardware architecture. Although most of the work focuses on mobile WiMAX, we also explain how the proposed architecture can be used to implement the recently standardized wirelessMAN-advanced air interface.
The remainder of this article is organized as follows. Section 2 provides a brief description of the OFDMA-TDD mobile WiMAX PHY. Section 3 describes the proposed hardware architecture for the implementation of an OFDMA-TDD PHY compliant with the mobile-WiMAX standard. Section 4 presents the amount of FPGA and DSP resources consumed by an implementation made with Xilinx system generator while Section 5 is devoted to its experimental evaluation over ITU-R wireless channel models. Section 6 explains how the proposed hardware architecture can be used to implement the PHY of the WirelessMAN-Advanced Air Interface. Finally, Section 7 presents the concluding remarks.
2 Mobile WiMAX physical layer
This section describes the primary features of the mobile WiMAX PHY to be used in the ensuing sections. For a more detailed description, see .
Mobile WiMAX profiles
Uplink resources are shared among mobile stations (MSs), and their allocation and scheduling are centralized on the BS. The latter decides how many slots are assigned to each MS depending on their QoS parameters and bandwidth requirements. Additionally, rectangular time-frequency-shaped regions can be defined in the uplink to allow MSs to perform network entry, improve uplink synchronization parameters, or send special feedback messages, among other tasks.
Data and pilot carriers transmitted in either the uplink or the downlink go through a process of scrambling just before the inverse fast Fourier transform (IFFT) operation, and then a cyclic prefix (CP) is appended at its output. The size of this CP is defined as a ratio of the FFT size and can be variable, being valid values 1/4, 1/8, 1/16, and 1/32, although theWiMAX Forum only requires the support of the 1/8 value.
The channel coding procedure has five steps: randomization, forward error correction (FEC), bitinterleaving, repetition coding, and modulation. Variable coding rate and modulation are supported to enable adaptive modulation and coding (AMC) capabilities.
3 Mobile WiMAX physical layer design and implementation
This section describes the design and implementation of an OFDMA-TDD PHY compliant with the mobileWiMAX standard. We focus on the mandatory parts of the standard for both the BS and the MS, i.e., OFDMA frame structure, PUSC permutation scheme in downlink and uplink subframes, ranging, and channel coding with tail-biting convolutional codes (TBCC).
3.1 Hardware description
Both BS and MS were implemented using the same hardware elements, namely three commercial off-the- shelf (COTS) modules placed on a peripheral component interconnect (PCI) carrier board, as shown in Figure 2. The first module contains a Texas Instruments TMS320C6416 DSP together with a Xilinx Virtex-II XC2V2000 FPGA. The second module is an FPGA Xilinx Virtex-4 XC4VSX55, and the third module contains an FPGA Xilinx Virtex-4 XC4VSX35 and an analog add-on module with two digital-to-analog converters (DACs) and two analog-to-digital converters (ADCs). The DACs are Texas Instruments DAC5686 , with 16 bits of precision and a maximum sampling rate of 160 Msample/s. The ADCs are Texas Instruments ADS5500 , with 14 bits of precision and maximum sampling rate of 125 Msample/s. Both Xilinx Virtex-4 XC4VSX55 and Xilinx Virtex-4 XC4VSX35 FPGAs are provided with a large number of embedded multipliers allowing for intensive signal processing operations.
Two kinds of buses were used for communicating between modules: data buses and control buses. The latter are exclusively used for configuration messages. The throughput of the data and control buses is 400 and 20 MB/s, respectively. The communication between the host PCs and their corresponding carrier boards is done through the PCI bus.
It is important to mention that all calculations in our implementation are done in fixed point with 16 bits of precision since there was no need to use less bits. On the one hand, no saving is obtained in the DSPs if less bits are used, and on the other hand, our design already fitted into the FPGAs doing calculation with 16 bits of precision.
3.2 Digital up/downconversion
The digital up converter (DUC) and the digital down converter (DDC) are responsible for adapting the signal to the ADCs and DACs sampling rate and I/Q modulation/demodulation. During upconversion, the following tasks are done: upsampling, pulse shaping, and I/Q modulation to a configurable intermediate frequency. The downconverter performs the complementary operations in inverse order, i.e., I/Q demodulation, filtering, and downsampling.
In the proposed OFDMA-TDD WiMAX PHY layer design, the profiles selected by the WiMAX Forum are supported by means of five different bit streams to the FPGAs, each one with a different up/downsampling factor. The converters sampling frequency is fixed at 80 MHz. Hence, the up/downsampling factors to obtain profiles from #1 to #5 are 20, 100/7, 10, 8, and 50/7, respectively. In order to efficiently implement these sample-rate conversions, each FPGA bit stream has a different optimized combination of interpolation/decimation filters as explained in .
3.3 Downlink synchronization
Since ADCs are not equipped with a programmable gain amplifier (PGA), normalization of the received signal is performed after the DDC stage. This is done by first computing the average power of the received signal and then applying the resulting value as a constant scale factor during the whole downlink subframe after synchronization. This normalization strategy has been selected because it provides a good compromise between clipping and quantization errors. The frame detection time is also fed to an uplink transmission control block which schedules the emission of the uplink subframe taking into account the subframes size and the transmit/receive transition gap (TTG) and receive/transmit transition gap (RTG) guard intervals.
The energy estimations computed during the first 1,024 samples after the preamble and during the RTG guard interval are stored in a configuration register to allow for their reading from the DSP. These values are eventually used to estimate the signal-to-noise ratio (SNR).
Quantization consists of mapping the input signal and the preamble into -1, 0, and 1 values to avoid the use of complex multipliers and reduce correlation calculation complexity.
Notice that since the received input signal is normalized, the maximum of this function can be easily determined as the sample time when this function overcomes a predefined threshold value.
where is the combined frequency offset estimation. In the above expression, the frequency offset estimation range of goes from to , while ranges from to . When is out of its range, its value should be adjusted by adding or subtracting multiples of 1/N until it matches the value obtained with the metric.
3.4 Ranging and uplink synchronization
In multiuser mobile environments, time and frequency estimations obtained at MSs cannot be directly used to construct the uplink signal because the relative distance and speed with respect to the BS are not known . In the IEEE 802.16e standard, this problem is solved with the so-called ranging process. In such a process, MSs transmit pseudonoise (PN) sequences generated from a shift register in specific regions of the uplink reserved for this purpose by the BS in a contention-based policy. At the receiver side, the BS must detect the arrival of a ranging code and estimate the synchronization parameters from it. Finally, these parameters are sent back to the MSs in a medium access control (MAC) management message and used to construct the synchronized uplink frames to be transmitted by the MSs.
Two types of ranging regions are defined: initial ranging, used during network entry, and periodic ranging, used when the MSs are already connected. In the case of initial ranging, OFDM symbols containing ranging codes must be transmitted by MSs in pairs, the first symbol with a CP and the second one with a cyclic postfix, hence allowing a wider time synchronization window. In our implementation, the mobile station has a special version of the IFFT block which can receive as a parameter the pattern of cyclic prefixes and postfixes of the sent symbols to accomplish this requirement.
Ranging codes, p c (k), are sequences of 144 BPSK symbols generated from the output of a pseudorandom binary sequence (PRBS). Different sets of codes are used depending on the purpose of the MS: initial ranging, periodic ranging, bandwidth requests, or handover. When a MS decides to start a ranging process, it selects a code randomly from the corresponding set and then maps it to a ranging region. This mapping in a PUSC zone is done in a distributed fashion, and only groups of four symbols are guaranteed to be transmitted in contiguous subcarriers. The BS must identify the ranging code sent by the MS in order to estimate the uplink synchronization errors.
In the literature, several uplink frequency offset estimation algorithms can be found. These algorithms can be divided into three groups, from lower to higher computational complexity: subband, interleaved, and generalized allocation of subcarriers. Ranging in mobile WiMAX is an example of generalized allocation where the subcarriers reserved to the ranging process can take up any position in the available spectrum. The algorithms defined for this kind of structures are based on a joint maximum likelihood (JML) estimation of the channel response and the frequency offset but with a very high complexity . Notice that the uplink synchronization algorithms selected for our design avoid the complexity of JML algorithms by exploiting the redundancy present in the ranging codes.
Once the ranging code is known, frequency offset can be extracted through reconstruction of the transmitted signal sent by the mobile station. To do so, the received PN sequence is mapped back to the OFDM symbol. Since the initial ranging forces mobile stations to transmit the same ranging code twice in two consecutive symbols, this property can be used to extract the frequency offset through a correlation computation.
3.5 Subchannelization and channel equalization
Tasks related with the OFDM modulation are placed in the Virtex-4 SX55 FPGA module. The most important operation is the FFT, which has been implemented using the Xilinx LogiCORE IP fast Fourier transform , allowing for run-time configuration of the transform point size.
Subchannelization in WiMAX involves three operations: interleaving, randomization of subcarriers according to some permutation scheme, and pilot insertion. This structure is specified in the DL-MAP and UL-MAP messages sent by the BS in each frame. As described in Section 2, the DL-MAP message is always mapped on the first two symbols of the downlink subframe, hence providing a complete description of the permutation schemes used and bursts contained inside the subframe. At the receiver, the task of decoding DL-MAP messages showed itself as a critical one since most of the processing of the downlink subframe at the receiver cannot start until this message is completely decoded. On the other hand, the randomization of subcarriers in the uplink cannot be applied to the ranging bursts. As a consequence, this process depends entirely on the uplink burst scheme defined by the BS.
Taking these issues into account, we decided to implement the subchannelization and channel equalization processes in the DSPs to provide maximum flexibility regarding FFT sizes, burst mapping, and eventual support of other permutation schemes. In the MS, the extraction of DL-MAP messages is optimized through the different design layers to minimize the delay of the decoding pipeline rather than implementing a hardware low-level MAC for this purpose .
The selected channel estimation and equalization algorithms are piecewise linear channel coefficients interpolation and zero forcing, respectively. Several analysis of channel estimation and equalization algorithms for WiMAX can be found in the literature showing that the selected method offers an acceptable performance in terms of mean squared error (MSE) and bit error rate (BER) with a low complexity implementation [34, 35]. In the downlink, each symbol is equalized independently in frequency dimension, and in the uplink, all pilot subcarriers in a tile, made up of four subcarriers during three OFDM symbols, are used together to perform this task with a two-dimensional interpolation.
3.6 Channel coding
Information bits received from higher layers are mapped into constellation points after a channel coding process that includes randomization and bit interleaving. Additionally, the repetition coding step is performed over the constellation-mapped data in a slot-by-slot manner. In the proposed design, channel coding is mainly implemented in the Virtex-II FPGA, although the optional repetition coding step and the processing control are carried out in the DSP, using the FPGA as a coprocessor. In this work, we focus on the TBCC coding scheme with variable rate and constellation sizes from QPSK, 16-QAM, and 64-QAM, both in the downlink and in the uplink.
The encoder in a tail-biting scheme has a complexity similar to that of a zero-tail encoder. The encoder was implemented adding a CP to each FEC block with a size equal to the constraint length of the shift register (in the case of mobile WiMAX, this value is seven). The decoder has a higher complexity because the starting state of the trellis is unknown before decoding. Maximum likelihood (ML) decoding achieves optimum performance, but it requires decoding the received block starting with all the possible initial states, which increases decoding complexity to unacceptable levels . The implemented channel decoding process uses a suboptimal technique which provides a good compromise between decoding quality and complexity, where the first bits of the block are appended after the block, and the last bits at the beginning of the block . The size of the chunks added at the beginning and at the end of the blocks is equal to the traceback length configured in the Viterbi decoder. If a block is shorter than the traceback length, it is just sent three times to the decoder and only the output corresponding to the second repetition is taken into account.
Additionally, the decoder performs a carrier-to-interference and noise ratio (CINR) estimation based on the demodulated data symbols by computing an error vector magnitude (EVM) measurement. This estimation was implemented in the soft decisor by mapping the soft bits back to symbols, hence obtaining a reliable estimation of the transmitted symbols. Then, the MSE of the received signal and the estimated transmitted symbol is calculated and saved in a register in order that the DSP can read the value. This algorithm provides an accurate estimation of the CINR as long as decision errors are kept at low levels. If this is not the case, an overestimation of the CINR will occur.
3.7 Physical layer control
The subframes structure is controlled from the higher layers in the BS using a service access point (SAP) protocol and is sent to the MS through MAC management messages (DL-MAP, UL-MAP, downlink channel descriptor (DCD), and uplink channel descriptor (UCD)). This SAP allows for defining the subframes structure, for sending and receiving data bursts, and for transmitting and detecting ranging codes.
The downlink subframe must follow some constraints regarding the permutation zone and burst definitions. First of all, bursts must be time-frequency rectangular-shaped and should always span a multiple of two symbols in time and a multiple of a subchannel size in frequency (this is the so-called slot unit according to WiMAX terminology). Moreover, several users can be grouped into a single burst to reduce overhead in the DL-MAP definition and to speed up the generation of bursts. Finally, the BS has to distribute the available resources between users taking into account their QoS parameters.
There are several solutions to face these problems , but in our implementation, the Ohseki algorithm  was chosen because of its good compromise between computational complexity and allocation losses. The general idea of this algorithm is to assign all users with equal burst profile to the same burst and to allocate its resources in a frequency-first policy, hence avoiding any burst overlapping in the frequency domain.
Resource management in the uplink is more flexible since it is only necessary to indicate the number of slots allocated to each station with no constraints regarding the time-frequency burst-shape. The allocation size is decided by the MAC layer taking into account the QoS parameters negotiated for connections and the bandwidth requirements sent by the MSs as signaling headers in the uplink.
4 Resource utilization
FPGA resource utilization
The FPGAs resource allocation shown in Table 2 considers separately the cases of the BS and the MS. The main difference between both designs lies in the synchronization block in the MS, which requires 58% of the slices of the Virtex-4 SX35. The quantized cross-correlation algorithm is the most demanding block inside this synchronization module. Another difference is caused by the ability of the MS to add cyclic postfixes to the output of the IFFT. This requirement is necessary for sending the initial ranging codes.
FPGA resource utilization of each processing block
Clock frequency (critical path)
80 MHz (12.47 ns)
Frame control (BS)
DUC 10 MHz
DUC 8.75 MHz
DUC 3.5 MHz
DDC 10 MHz
160 MHz (12.10 ns)
DDC 8.75 MHz
160 MHz (12.10 ns)
DDC 3.5 MHz
160 MHz (12.10 ns)
100 MHz (9.47 ns)
100 MHz (9.79 ns)
The Virtex-4 SX55 is a high-resource FPGA that allowed for the implementation of the FFT blocks without a resource-optimized design, hence a pipelined architecture was used allowing for continuous data processing. However, the Virtex-II V2000 is resource limited, which forced us to optimize the FEC design.
Regarding DSP resources, Table 4 shows the memory usage of each task and an estimation of the DSP cycles required for the processing performed inside each task. The estimation of the DSP cycles is obtained from a static analysis of the assembly code generated by the compiler. We also present an estimation of the time required to execute each task in the last column of the table. These time estimations were obtained making the following assumptions:
The 8.75-MHz profile is used with 1,024 subcarriers and a cyclic prefix length of 1/8.
The frame duration is 5 ms, with 25 symbols in the downlink and 18 symbols in the uplink.
The subframes are used entirely for data transmission.
Data subcarriers are modulated in 64-QAM and convolutional coding with rate 3/4.
Every 16 frames, there is a ranging burst of 30 subchannels and 3 symbols.
The tasks which use the internal DSP memory are executed at 600 MHz, while the tasks that only use ZBTRAM memory are executed at 100 MHz.
The data copy between the DSP tasks is performed at 800 MB/s. The communication with the FPGAs does not consume DSP time.
Static analysis of DSP resource utilization
DSP cycles per frame
Estimated time (μ s)
BS+MS: PHY control (FEC)
3,000 K BS
0 K BS
5,500 K MS
0 K MS
Rep 2: 56 ×nslots
Rep 4: 83 ×nslots
Rep 6: 134 ×nslots
FEC data preprocessing
7 + nbunc × 2/4
FEC data postprocessing
9 + nbunc × 2/4
BS: TX PUSC DL
6 + n fft + (3 + N fft /4) × nsymb
Add burst to frame
147 × nsubc × nsymb/2
(11 + nused × 2) × nsymb
BS: RX PUSC UL
8,501 + (11 + Nused × 2) × nsymb
Extract burst from frame (equalization included)
6 + nslots × 1,140
BS: RX ranging
36,059 × nsubc/6 × nsrang/3
82 × nsubc/6 × nsrang/3
Extract ranging code
21,504 × nsubc/6 × nsrang/3
144 × nsubc/6 × nsrang/3
14,329 × nsubc/6 × nsrang/3
MS: RX PUSC DL
(11 + nused × 2) × nsymb
425 × nsubc × nsymb
Extract burst from frame
(16 + nsubc × 8) × nsymb/2
MS: TX PUSC UL
14 + nslots × 7
Add burst to frame
6 + nslots × 182
8,501 + (11 + Nused × 2) × nsymb
The estimation of the total DSP time used is 958.56 and 927.89 µs for the BS and MS, respectively. This is an optimistic estimation since we are not taking into account the time consumed by the kernel as well as the context switches and interrupt handling. Furthermore, the delay of the communication with the FPGA and the interdependence between the processing tasks can lead up to long waiting times for FPGA data. This means that a good concurrent processing planning is also needed to fulfill the 5-ms frame duration.
5 Experimental results
The channel emulator was implemented on a Xilinx Virtex-4 FPGA using the Xilinx XtremeDSP development kit. As shown in Figure 4, it consists of a channel coefficient generator, an interpolator, a channel filtering stage, and an additive white Gaussian noise (AWGN) generator. It accepts parameters like the average power and delay of each tap, the noise power, and the intermediate frequency of the input signal. The coefficient interpolation factor as well as the Doppler power spectrum are defined at compilation time, and they are fixed during the emulation.
ITU-R M.1225 channel models
Number of paths
Power of each path (dB)
Path delay (ns)
All channel models use the Jakes Doppler power spectrum density, and a 2.4-GHz carrier frequency was assumed for the Doppler spread calculations. The maximum delay of these channels (3,700 ns) does not exceed in any case the default 1/8 CP length (11,429 ns); hence, intersymbol interference (ISI) is avoided. It is important to note that the pedestrian A scenario stands out because it has a low multipath diversity. Multipath diversity is an inherent property of wireless channels that occurs whenever the channel power delay profile is rich enough to provide replicas of the transmitted signal at the receiver input. In time-varying scenarios, the amplitude and the phase of such replicas change over time. The pedestrian A channel model only contains four paths with the last two being rather attenuated. Furthermore, the path delay spread is rather small so the frequency selectivity of this channel is rather low, hence allowing for a good channel equalization. On the contrary, notice that the pedestrian B and vehicular A scenarios have higher multipath diversity and larger path delay spreads.
As explained in Section 3, MSs estimate the SNR of the received signal using the values obtained during the synchronization process. These estimated SNR values were used to calibrate the AWGN generator, hence matching the noise power added in the emulator with the estimated SNR in the MS. This way, the SNR at the receiver is under control in all scenarios.
6 WirelessMAN-advanced air interface
The IEEE 802.16m standard introduces a completely new definition of the PHY known as advanced air interface (AAI). The configurability of the parameters is reduced to a large extent, but additional features like multiple-input multiple-output (MIMO) and hybrid automatic repeat request (H-ARQ) are now mandatory to accomplish the minimum requirements of the standard, and also, backward compatibility is mandatory. For a more detailed description, see .
A new profile with a channel bandwidth of 20 MHz and 2,048 subcarriers is added while the 3.5-MHz profile is discarded. To implement this new profile, the FFT size needs to support 2,048 subcarriers, and the DUC/DDC blocks have to support an additional up/downsampling factor of 25/7. The new frame structure is divided into superframes of 20 ms. Each superframe is made up of four 5-ms frames. The main difference with the old frame structure is the way the frames are subdivided into subframes to increase the flexibility of the allocation of downlink and uplink zones. Each subframe can be dynamically configured for downlink or uplink transmission. This dynamic behavior imposes the need to improve the Frame control block to be more flexible.
The synchronization mechanisms have been improved by defining two new preambles: the PA preamble, with a fixed number of pilot subcarriers regardless the FFT size to be used by the advanced base station (ABS), and the SA preamble, with a structure and purpose similar to the preamble of the previous release.
The new subchannelization scheme is designed to simplify the channel estimation and to reduce the signaling overhead required for the burst placement, and it only depends on the MIMO scheme at use.
The AAI defines newMIMO configurations to support single userMIMO (SU-MIMO) and multiple user MIMO (MU-MIMO) schemes, both with adaptive and non-adaptive precoding. The WiMAX Forum defines the minimum number of ABS antennas as two, while the advanced mobile station (AMS) can operate with only one antenna. This leads to the need to replicate processing in transmit and receive chains only in the ABS.
For the initial ranging and handover mechanisms, new ranging preambles are added with extended length. Ranging preambles are transmitted with a subcarrier spacing which is a fraction of the regular frequency spacing. This behavior can be achieved with larger FFT sizes; hence, an adjustable FFT size in the corresponding processing blocks could be desirable.
Channel coding in 802.16 m only uses two FEC schemes. On the one hand, convolutional turbo codes (CTC) is the encoder defined to transmit the data bursts. On the other hand, a TBCC encoder with rate 1/5 is used to encode the control information. In this case, it would be necessary to implement two encoding and decoding algorithms inside the FEC processing block. The mandatory H-ARQ processing can be addressed inside the PHY Control task.
We have addressed the design and implementation of real-time OFDMA-TDD PHYs compliant with the WiMAX standard. We have presented a cost-effective SDR hardware architecture made up of FPGA and DSP modules that allows for the real-time implementation of all OFDMA-TDD PHY functionalities in the downlink and in the uplink at both the BS and the MS of the mobile WiMAX standard. We explained in detail the different design decisions adopted to accomplish this stringent objective. The proposed design is shown to efficiently use the available FPGA resources. Experimental evaluation of the downlink and the uplink obtained with the implemented BS and MS was carried out in real time using a hardware device that emulates AWGN and ITU-R wireless channel models. Specific performance metrics that take into account the frame and the DL-MAP messages detection were considered to illustrate the adequate performance of the proposed design. Finally, the utilization of the proposed hardware architecture to implement the WirelessMAN-advanced air interface is discussed.
a Set to 36 in the IEEE Std. 802.16e.
This work has been partially supported by Indra Sistemas S.A., the Spanish Ministry of Defence with the technical direction of PEC/ITM under grant DN8644-COINCIDENTE, MINECO of Spain under grant TEC2010-19545-C04-01 and Xunta de Galicia, Spain, under grant 2012/287. The authors wish to thank J. M. Camas-Albar from Indra Sistemas S.A. for his help.
- WiMAX Forum Mobile 4G (2001), . Accessed 15 Oct 2013. http://www.wimaxforum.org/mobile-4g (2001), . Accessed 15 Oct 2013.
- Pareit D, Lannoo B, Moerman I, Demeester P: The history of WiMAX: a complete survey of the evolution in certification and standardization for IEEE 802.16 and WiMAX. IEEE Commun. Surv. Tutorials 2012, 14(4):1183-1211.View ArticleGoogle Scholar
- IEEE: IEEE standard for local and metropolitan area networks part 16: air interface for broadband wireless access systems amendment 3: advanced air interface. IEEE Std 802.16m-2011 (Amendment to IEEE Std 802.16-2009). IEEE, Piscataway; 2011.Google Scholar
- Mourad A, Gutierrez I: System level evaluation for WiMAX IEEE 802.16m. IEEE 28th International Performance Computing and Communications Conference (IPCCC), Scottsdale, 14–16 Dec 2009. IEEE, Piscataway; 2009:418-424.Google Scholar
- IEEE: IEEE standard for air interface for broadband wireless access systems. IEEE Std 802.16-2012 (Revision of IEEE Std 802.16-2009). IEEE, Piscataway; 2012.Google Scholar
- IEEE: IEEE standard for WirelessMAN-advanced air interface for broadband wireless access systems. IEEE Std 802.16.1-2012. IEEE, Piscataway; 2012.Google Scholar
- IEEE: IEEE standard for air interface for broadband wireless access systems amendment 1: enhancements to support machine-to-machine applications. IEEE Std 802.16p-2012 (Amendment to IEEE Std 802.16-2012). IEEE, Piscataway; 2012.Google Scholar
- IEEE: IEEE standard for WirelessMAN-advanced air interface for broadband wireless access systems amendment 1: enhancements to support machine-to-machine applications. IEEE Std 802.16.1b-2012 (Amendment to IEEE Std 802.16.1-2012). IEEE, Piscataway; 2012.Google Scholar
- Mohamed MA, Zaki FW, Mosbeh RH: Simulation of WiMAX physical layer: IEEE 802.16e. IJCSNS Int. J. Comp. Sci. Netw. Secur 2010., 10(11):Google Scholar
- Hu S, Wu G, Guan YL, Law CL, Yan Y, Li S: Development and performance evaluation of mobile WiMAX testbed. IEEE Mobile WiMAX Symposium, Orlando, 25–29 Oct 2007. IEEE, Piscataway; 2007:104-107.View ArticleGoogle Scholar
- Mehlführer C, Caban S, Rupp M: Experimental evaluation of adaptive modulation and coding in MIMO WiMAX with limited feedback. EURASIP J. Adv. Signal Process 2008., 2008:Google Scholar
- Imperatore P, Salvadori E, Chlamtac I: Path loss measurements at 3.5 GHz: a trial test WiMAX based in rural environment. 3rd International Conference on Testbeds and Research Infrastructure for the Development of Networks and Communities, Lake Buena Vista, 21–23 May 2007. IEEE, Piscataway; 2007:1-8.Google Scholar
- Zaggoulos G, Tran M, Nix A: Mobile WiMAX system performance - simulated versus experimental results. IEEE 19th International Symposium on Personal, Indoor and Mobile Radio Communications, Cannes, 15–18 Sept. 2008. IEEE, Piscataway; 2008:1-5.View ArticleGoogle Scholar
- Colda R, Palade T, Pucchita E, Vermecan I, Moldovan A: Mobile WiMAX: system performance on a vehicular multipath channel. Proceedings of the Fourth European Conference on Antennas and Propagation (EuCAP), Barcelona, 12–16 Apr 2010. IEEE, Piscataway; 2010:1-5.Google Scholar
- Font-Bach O, Bartzoudis N, Pascual-Iserte A, Bueno DL: A real-time MIMO-OFDM mobile WiMAX receiver: architecture, design and FPGA implementation. Comput. Netw 2011, 55(16):3634-3647. 10.1016/j.comnet.2011.02.018View ArticleGoogle Scholar
- Chang KC, Lin JW, Chiueh TD: Design of a downlink baseband receiver for IEEE 802.16E OFDMA mode in high mobility. IEEE International SOC Conference, Hsin Chu, 26–29 Sept. 2007. IEEE, Piscataway; 2007:301-304.Google Scholar
- Wu YJ, Lin JM, Yu HY, Ma HP: A baseband testbed for uplink mobile MIMO WiMAX communications. IEEE International Symposium on Circuits and Systems, Taipei, 24–27 May 2009. IEEE, Piscataway; 2009:794-797.View ArticleGoogle Scholar
- Wu YJ, Lin JM, Yu HY, Liu SC, Ma HP: An SoC evaluation platform for dual link MIMO-OFDMA communications. 5th International ICST Conference on Communications and Networking in China (CHINACOM), Beijing, 25–27 Aug 2010. IEEE, Piscataway; 2010:1-5.Google Scholar
- Chuang G, Ting PA, Hsu JY, Lai JY, Lo SC, Hsiao YC, Chiueh TD: A MIMO WiMAX SoC in 90nm CMOS for 300km/h mobility. IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), San Francisco, 20–24 Feb. 2011. IEEE, Piscataway; 2011:134-136.Google Scholar
- BB C, Kunst R, Henes F: Channel encoding block for mobile WiMAX networks using reconfigurable hardware. J. Appl. Comput. Res 2011, 1(2):69-75.Google Scholar
- Khairy M, Abdallah M, Habib SD: Efficient FPGA implementation of MIMO decoder for mobile WiMAX system. IEEE International Conference on Communications, Dresden, 14–18 June 2009. IEEE, Piscataway; 2009:1-5.Google Scholar
- Hsiao C, Chen CY, Chiueh TD: Design of a dual-mode baseband receiver for 802.11n and 802.16e MIMO OFDM/OFDMA. International Symposium on VLSI Design, Automation and Test, Hsinchu, 28–30 Apr 2009. IEEE, Piscataway; 2009:331-334.Google Scholar
- Hsu JY, Kao CY, Kuo PH, Ting P: Configurable baseband designs and implementations of WiMAX/LTE dual systems based on multi-core DSP. IEEE International SOC Conference (SOCC), Taipei, 26–28 Sept. 2011. IEEE, Piscataway; 2011:265-271.Google Scholar
- Nuaymi L: WiMAX: Technology for broadband wireless access.. Wiley; 2007. . Accessed 15 Oct 2013. http://dx.doi.org/10.1002/9780470319055View ArticleGoogle Scholar
- Texas Instruments (2013), 16 bit, 500 MSPS, 2x-16x Interpolating Dual-Channel Digital-to-Analog Converter (DAC5686), . Accessed 15 Oct 2013 http://www.ti.com/product/dac5686
- Texas Instruments (2013), 14 bit, 125 MSPS Analog-to-Digital Converter (ADS5500), . Accessed 15 Oct 2013 http://www.ti.com/product/ads5500
- Suárez-Casal P, Carro-Lagoa A, García-Naya JA, Castedo L: A multicore SDR architecture for reconfigurable WiMAX downlink. 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools (DSD), Lille, 1–3 Sept. 2010. IEEE, Piscataway; 2010:801-804.Google Scholar
- Kang H, Im SB, Choi HJ, Rhee DJ: Robust OFDMA frame synchronization algorithm on inter-cell interference. Asia-Pacific Conference on Communications, Busan, 31 Aug–1 Sept. 2006. IEEE, Piscataway; 2006:1-5.Google Scholar
- Schmidl T, Cox D: Robust frequency and timing synchronization for OFDM. IEEE Trans. Commun 1997, 45(12):1613-1621. 10.1109/26.650240View ArticleGoogle Scholar
- Morelli M, Kuo CC, Pun MO: Synchronization techniques for orthogonal frequency division multiple access (OFDMA): a tutorial review. Proc. IEEE 2007, 95(7):1394-1427.View ArticleGoogle Scholar
- Mahmoud H, Arslan H, Ozdemir M: Initial ranging for WiMAX (802.16e) OFDMA. IEEE Military Communications Conference, Washington, D.C., 23–25 Oct. 2006. IEEE, Piscataway; 2006:1-7.Google Scholar
- Altera Corporation: WiMAX OFDMA ranging. Appl. Note 2006, 430. http://www.altera.com.cn/literature/an/an430.pdfGoogle Scholar
- Xilinx: LogiCORE IP fast Fourier transform v7.1. (Xilinx, 2011),. . Accessed 15 Oct 2013. http://www.xilinx.com/support/documentation/ip_documentation/xfft_ds260.pdf
- Yucek T, Ozdemir M, Arslan H, Retnasothie F: A comparative study of initial downlink channel estimation algorithms for mobile WiMAX. Mobile WiMAX Symposium, Orlando, 25–29 Mar 2007. IEEE, Piscataway; 2007:32-37.Google Scholar
- Pham PTT, Wada T: Effective scheme of channel tracking and estimation for mobile WiMAX DL-PUSC system. J. Comp. Sys. Netw. Comm 2010, 2010: 6:1-6:9. . Accessed 15 Oct 2013. http://dx.doi.org/10.1155/2010/806279Google Scholar
- Cox R, Sundberg C: An efficient adaptive circular Viterbi algorithm for decoding generalized tailbiting convolutional codes. IEEE Trans. Vehicular Technol 1994, 43: 57-68. 10.1109/25.282266View ArticleGoogle Scholar
- Francis M: Viterbi Decoder block decoding - trellis termination and tail biting, (Xilinx, 2011),. . Accessed 15 Oct 2013. http://www.xilinx.com/support/documentation/application_notes/xapp551.pdf
- Intel: OFDMA PHY SAP interface specification for 802.16 broadband wireless access base stations. (Intel Corporation, 2007)Google Scholar
- So-In C, Jain R, Tamimi AK: Scheduling in IEEE 802.16e mobile WiMAX networks: key issues and a survey. IEEE J. Selected Areas Commun 2009, 27(2):156-171.View ArticleGoogle Scholar
- Ohseki T, Morita M, Inoue T: Burst construction and packet mapping scheme for OFDMA downlinks in IEEE 802.16 Systems. IEEE Global Telecommunications Conference, Washington, D.C., 26–30 Nov. 2007. IEEE, Piscataway; 2007:4307-4311.Google Scholar
- International Telecommunication Union (ITU): Recommendation ITU-R M.1225: guidelines for evaluation of radio transmission technologies for IMT-2000. (ITU, 1997)Google Scholar
- WiMAX Forum: Requirements and recommendations for WiMAX Forum™mobility profiles. (WiMAX Forum™, 2005)Google Scholar
- Efron B, Hinkley DV: An introduction to the bootstrap. In CRC Monographs on Statistics & Applied Probability, 1st edn.. Chapman & Hall; 1994.Google Scholar
- Ahmadi S: Mobile WiMAX: a systems approach to understanding IEEE 802.16 m radio access technology. Academic Press, Waltham; 2010.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.