# Adaptive antenna selection and Tx/Rx beamforming for large-scale MIMO systems in 60 GHz channels

- Ke Dong
^{1}, - Narayan Prasad
^{2}, - Xiaodong Wang
^{3}Email author and - Shihua Zhu
^{1}

**2011**:59

https://doi.org/10.1186/1687-1499-2011-59

© Dong et al; licensee Springer. 2011

**Received: **24 November 2010

**Accepted: **11 August 2011

**Published: **11 August 2011

## Abstract

We consider a large-scale MIMO system operating in the 60 GHz band employing beamforming for high-speed data transmission. We assume that the number of RF chains is smaller than the number of antennas, which motivates the use of antenna selection to exploit the beamforming gain afforded by the large-scale antenna array. However, the system constraint that at the receiver, only a linear combination of the receive antenna outputs is available, which together with the large dimension of the MIMO system makes it challenging to devise an efficient antenna selection algorithm. By exploiting the strong line-of-sight property of the 60 GHz channels, we propose an iterative antenna selection algorithm based on discrete stochastic approximation that can quickly lock onto a near-optimal antenna subset. Moreover, given a selected antenna subset, we propose an adaptive transmit and receive beamforming algorithm based on the stochastic gradient method that makes use of a low-rate feedback channel to inform the transmitter about the selected beams. Simulation results show that both the proposed antenna selection and the adaptive beamforming techniques exhibit fast convergence and near-optimal performance.

## Keywords

## 1 Introduction

The 60 GHz millimeter wave communication has received significant recent attention, and it is considered as a promising technology for short-range broadband wireless transmission with data rate up to multi-giga bits/s [1–4]. Wireless communications around 60 GHz possess several advantages including huge clean unlicensed bandwidth (up to 7 GHz), compact size of transceiver due to the short wavelength, and less interference brought by high atmospheric absorption. Standardization activities have been ongoing for 60 GHz Wireless Personal Area Networks (WPAN) [5] (i.e., IEEE 802.15) and Wireless Local Area Networks (WLAN) [6] (i.e., IEEE 802.11). The key physical layer characteristics of this system include a large-scale MIMO system (e.g., 32 × 32) and the use of both transmit and receive beamforming techniques.

To reduce the hardware complexity, typically, the number of radio-frequency (RF) chains employed (consisting of amplifiers, AD/DA converters, mixers, etc.) is smaller than the number of antenna elements, and the antenna selection technique is used to fully exploit the beamforming gain afforded by the large-scale MIMO antennas. Although various schemes for antenna selection exist in the literature [7–10], they all assume that the MIMO channel matrix is known or can be estimated. In the 60 GHz WPAN system under consideration, however, the receiver has no access to such a channel matrix, because the received signals are combined in the analog domain prior to digital baseband due to the analog beamformer or phase shifter [11]. But rather, it can only access the scalar output of the receive beamformer. Hence, it becomes a challenging problem to devise an antenna selection method based on such a scalar only rather than the channel matrix. By exploiting the strong line-of-sight property of the 60 GHz channel, we propose a low-complexity iterative antenna selection technique based on the Gerschgorin circle and the stochastic approximation algorithm. Given the selected antenna subset, we also propose a stochastic gradient-based adaptive transmit and receive beamforming algorithm that makes use of a low-rate feedback channel to inform the transmitter about the selected beam.

The remainder of this paper is organized as follows. The system under consideration and the problems of antenna selection and beamformer adaptation are described in Section 2. The proposed antenna selection algorithm is developed in Section 3. The proposed transmit and receive adaptive beamforming algorithm is presented in Section 4. Simulation results are provided in Section 5. Finally Section 6 concludes the paper.

## 2 System description and problem formulation

*N*

_{ t }transmit and

*N*

_{ r }receive antennas both of omni-directional pattern operating in the 60 GHz band. The radio wave propagation at 60 GHz suggests the existence of a strong line-of-sight (LOS) component as well as the multi-cluster multi-path components because of the high path loss and inability of diffusion [3, 4]. Such a near-optical propagation characteristic also suggests a 3-D ray-tracing technique in channel modeling (see Figure 1), which is detailed in [12]. In our analysis, the transceiver can be any device, defined in IEEE 802.15.3c [5] or 802.11ad [6], located in arbitrary positions within the room. For each location, possible rays in LOS path and up to the second-order reflections from walls, ceiling, and floor are traced for the links between the transmit and receive antennas. In particular, the impulse response for one link is given by

*A*

^{(i)},

*T*

^{(i)}, , , , , are called the inter-cluster parameters that are the amplitude, delay, departure, and arrival angles (in azimuth and elevation) of ray cluster

*i*, respectively, and

denotes the cluster constitution by rays therein, where *α*^{(i,k)}, *τ*^{(i,k)},
,
,
,
are the intra-cluster parameters for *k* th ray in cluster *i*. Some inter-cluster parameters are usually location related, e.g., the severe path loss in cluster amplitude; some are random variables, e.g., reflection loss, which is typically modeled as a truncated log-normal random variable with mean and variance associated with the reflection order [12], if linear polarization is assumed for each antenna. Besides, most intra-cluster parameters are randomly generated. On the other hand, for the short wavelength, it is reasonable to assume that the size of antenna array is much smaller than the size of the communication area, which leads to a similar geographic information for all links. It naturally accounts for the strong and near-deterministic LOS component and the independent realizations from reflection paths in modeling the overall channel response.

*i*= 1, 2, ...,

*N*

_{ r }and

*j*= 1, 2, ...,

*N*

_{ t }, where the entry

*h*

_{ ij }denotes the channel response between transmitter

*j*and receiver

*i*by aggregating all

*N*

_{rays}traced rays between them at the delay of the LOS component,

*τ*

_{0}; and is the amplitude of ℓth ray in the corresponding link. Analytically, we can further separate the channel matrix in (3) into H

_{LOS}and H

_{NLOS}accounting for the LOS and non-LOS components, respectively

where the Rician *K*-factor indicates the relative strength of the LOS component.

*N*

_{ t }and

*N*

_{ r }, are large. However, the numbers of available RF chains at the transmitter and receiver,

*n*

_{ t }and

*n*

_{ r }, are such that

*n*

_{ t }≪

*N*

_{ t }and/or

*n*

_{ r }≪

*N*

_{ r }. Hence, we need to choose a subset of

*n*

_{ t }

*× n*

_{ r }transmit and receive antennas out of the original

*N*

_{ t }

*× N*

_{ r }MIMO system and employ these selected antennas for data transmission (see Figure 2). Denote

*ω*as the set of indices corresponding to the chosen

*n*

_{ t }transmit antennas and

*n*

_{ r }receive antennas, and denote H

_{ ω }as the submatrix of the original MIMO channel matrix H corresponding to the chosen antennas.

_{ ω }, a transmit beamformer , with ||w

**||**= 1, is employed. The received signal is then given by

*s*is the transmitted data symbol; is the system signal-to-noise ratio (SNR) at each receive antenna;

*E*

_{ s }and

*N*

_{0}are the symbol energy and noise power density, respectively; is additive white Gaussian noise vector. At the receiver, a receive beamformer , with ||u|| = 1, is applied to the received signal r, to obtain

*ω*and known channel matrix H

_{ ω }, the optimal transmit beamformer w and receive beamformer u, in the sense of maximum received SNR, are given by the right and left singular vectors of H

_{ ω }corresponding to the principal singular value

*σ*

_{1}(H

_{ ω }), respectively. The optimal antenna subset is then given by the antennas whose corresponding channel submatrix has the largest principal singular value. Letting be a set each element of which corresponds to a particular choice of

*n*

_{ t }transmit antennas and

*n*

_{ r }receive antennas, we have

One variation to the above antenna selection problem is that instead of the numbers of available RF chains (*n*_{
t
} , *n*_{
r
} ), we are given a minimum performance requirement, e.g., *σ*_{1} ≥ *ν*. The problem is then to find the antenna subset with the minimum size such that its performance meets the requirement.

### Problem statement

Our problem is to compute the optimal antenna set
and the corresponding transmit and receiver beamformers w and u for a ray-traced MIMO channel realization H. However, for the system under consideration, H is not available to us, but rather, we only have access to the receive beamformer output *y*(*ω*, w , u ). This makes the straightforward approach of computing the singular value decomposition (SVD) of H_{
ω
}to obtain the beamformers impossible. Furthermore, the brute-force approach to antenna selection in (7) involves an exhaustive search over
possible antenna subsects, which is computationally expensive.

In this paper, we propose a two-stage solution to the above problem of joint antenna selection and transmit-receive beamformer adaptation. In the first stage, we employ a discrete stochastic approximation algorithm to perform antenna selection. By setting the transmit and receive beamformers to some specific values, this method computes a bound on the principal singular value of H_{
ω
}corresponding to the current antenna subset *ω*, and then iteratively updates *ω* until it converges. Once the antenna subset *ω* is selected, in the second stage, we iteratively update the transmit and receive beamformers w and u using a stochastic gradient algorithm. At each iteration, some feedback bits are transmitted from the receiver to the transmitter via a low-rate feedback channel to inform the transmitter about the updated transmit beamformer.

In the next two sections, we discuss the detailed algorithms for antenna selection and beamformer adaptation, respectively.

## 3 Antenna selection using stochastic approximation and Gerschgorin circle

### 3.1 The stochastic approximation algorithm

*y*(

*ω*, w , u ) in (6), which is a noisy function of the channel submatrix H

_{ ω }. On the other hand, the objective function to be maximized for antenna selection is the principal singular value of H

_{ ω }as in (7). If we could find a function

*ϕ*(

*·*) of

*y*such that it is an unbiased estimate of

*σ*

_{1}(H

_{ ω }), then we can rewrite the antenna selection problem (7) as

In [10], the stochastic approximation method is introduced to solve the problem of the form (8). The basic idea is to generate a sequence of the estimates of the optimal antenna subset where the new estimate is based on the previous one by moving a small step in a good direction towards the global optimizer. Through the iterations, the global optimizer can be found by means of maintaining an occupation probability vector π, which indicates an estimate of the occupation probability of one state (i.e., antenna subset). Under certain conditions, such an algorithm converges to the state that has the largest occupation probability in π. Compared with the exhaustive search approach, in this way, more computations are performed on the "promising" candidates, that is, the better candidates will be evaluated more than the others.

Due to the potentially large search space in the present problem, which not only limits the convergence speed but also makes it difficult to maintain the occupation probability vector, the algorithms in [10] can become inefficient. Here, we propose a modified version of the stochastic approximation algorithm that is more efficient to implement, and more importantly, it fits naturally to a procedure for estimating the principal singular value of H_{
ω
}based on the receive beamformer output *y*(*ω*, w , u ) only.

Specifically, we start with an initial antenna subset *ω*^{(0)} and an occupation probability vector *π*^{(0)} = [*ω*^{(0)}, 1] ^{
T
} , which has only one element, with the first entry serving as the index of the antenna subset and the other entry indicating the corresponding occupation probability. We divide each iteration into *n*_{
t
} + *n*_{
r
} subiterations, and in each sub-iteration, we replace one antenna in the current subset *ω* with a randomly selected antenna outside *ω*, resulting in a new subset
that differs from *ω* by one element. By comparing their corresponding objective functions, the better subset is updated as well as the occupation probability vector. This procedure is repeated until all *n*_{
t
} + *n*_{
r
} antennas are updated.

Instead of keeping records for all candidates, we dynamically allocate and maintain record in π for the new subset found in each iteration. If a subset already has a record in **π**, the corresponding occupancy probability will be updated. Otherwise, a new element is appended in π with the subset index and its occupation probability. Such a dynamic scheme avoids the huge memory requirement, since typically in practice, only a small fraction of the all possible subsets is visited.

We replace the selected subset with the current subset if the current subset has a larger occupation probabilities in **π**. Otherwise, keep the selected subset unchanged, thus completes one iteration.

In general, the convergence is achieved when the number of iterations goes to infinity. In practice, when it happens that one subset is selected in a large number, say 100, consecutive iterations, the algorithm is regarded as convergent and terminated, and the last selected subset is the global (sub)optimizer. Since most of the evaluations and decisions are generally made at the receiver, a low-rate and error-free feedback channel is assumed to coordinate the transmitter via feedback information. In each subiteration, the transmitter should know in advance which transmit antennas have been left in the current subset (i.e., *ω*^{(n)}) from last subiteration (because the current subset might have been changed in the previous subiteration), and then could generate a new subset by replacing the one with a random transmit antenna outside *ω*^{(n)}. Without feedback an invalid situation might happen such that a transmit antenna, which is already assigned to one RF chain in the current subset, is selected again for another RF chain. In other words, feedback is necessary only in subiterations in which the current subset has changed for the transmit antennas during the last update in the previous subiteration. This implies that the amount of feedbacks is rather limited.

The modified stochastic approximation algorithm for antenna selection is summarized in Algorithm 1. In what follows we discuss the form of the objective function *ϕ*(*·*) in (8) and its calculation.

### 3.2 Estimating the principal singular value using Gerschgorin circle

The Gerschgorin circle theorem [13] gives a range on a complex plane within which all the eigenvalues of a square matrix lie. In this section, we show that a good approximation to the largest eigenvalue can be calculated as long as the Rician *K*-factor is high enough. By calculating the G-circles, a simple estimator *ϕ*(·) of the objective function in (8) is developed and employed in the stochastic approximation algorithm for antenna selection, i.e., Algorithm 1.

*k*th transmit antenna and the

*n*

_{ r }receive antennas in the subset

*ω*. The correlation matrix of H

_{ ω }is then

_{ ω }in descending order as . Then, according to the Gerschgorin circles theorem [13], these

*n*

_{ t }eigenvalues lie in at least one of the following circles

*nt*circles are centered along the positive real axis. Since the correlation matrix R

_{ ω }is positive semi-definite, all eigenvalues are located along the positive real axis within these circles, as illustrated in Figure 3. Note that from (10) to (11), a circle with a larger center coordinate implies a larger channel gain for the corresponding transmit antenna; and a circle with a smaller radius implies a smaller channel correlation between the corresponding antenna and the other selected antennas. As seen from Figure 3, the right-most point among the

*n*

_{ t }circles is the upper bound for all eigenvalues and such a point can be used as the estimate of the largest eigenvalue of R

_{ ω }. That is,

*σ*

_{1}of H

_{ ω }is related to

*λ*

_{1}through , we can rewrite (7) as

*B*

_{1}is the maximum over the λ

_{1}norms of the rows of R

_{ ω }. In particular, letting R

_{ ω }= [

*r*

_{ ij }] we have

Next we prove a lemma that provides a useful bound on *B*_{1} and *λ*_{1}.

*n*

_{ t },

*r*}, it can be readily verified that

Combining (18) with (19) we have the desired result.

*y*(

*ω*, w , u ) in (6) is available. We will obtain an approximation to the lower bound on

*B*

_{1},

*λ*

_{1}given in the right-hand side (RHS) of (15) in the following way. For each transmit antenna in the subset

*ω*,

*k*= 1, ...,

*n*

_{ t }, we set the transmit and receive beamformers as

_{ k }is a length-

*n*

_{ t }column vector of all zeros, except for the

*k*-th entry which is one; and

**1**is a length-

*n*

_{ r }column vector of all ones. The transmitted symbol is set as

*s*= 1. Then by (5)-(6), we have the corresponding receive beamformer output given by

^{1}

Then, using Lemma 1 and its proof, we see that
is indeed a lower bound on *B*_{1} as well as *λ*_{1}(R_{
ω
}).

_{ k }, we will make multiple, say

*M*transmissions, and denote the corresponding receive beamformer outputs as

*y*(

*k*)

^{(m)},

*m*= 1, ...,

*M*. A smoothed version of the estimator

*β*(

*k*) is then given by

*M*→ ∞ we have

Recall that in the stochastic approximation algorithm for antenna selection, at each iteration, we sequentially update the transmit and receive antennas and compute the corresponding objective functions. The above approach for calculating the objective function fits naturally in this framework, since for each transmit antenna candidate, we only need to transmit a pilot signal from it and then compute the corresponding . The complete antenna selection algorithm is now summarized in Algorithm 1.

*Remark-1:*We note that a typical scenario in 60 GHz has a strongly LOS channel with

*K*≫ 1 and one dominant path, so that H

_{LOS}= ab

^{ H }is a rank one matrix. Moreover, in many applications, it is feasible to retain all receive antenna elements, so that the task reduces to selection of the optimal transmit antenna subset. In this case, neglecting H

_{NLOS}and the background noise (which holds for

*K*,

*M*≫ 1), it can be verified that

*the transmit antenna subset which maximizes*

*also results in the largest eigenvalue*.

*In particular*

*where we use*
*to denote the*
*evaluated for a particular subset and where the approximation becomes exact in the limit of large K*, *M*.

*Remark-2:*So far, we have assumed that only one receive beamformer is employed for a given choice of receive antenna subset. Suppose upto

*r*receive beamformers {u

_{1}, ..., u

_{ r }} (which are columns of a

*n*

_{ r }

*× n*

_{ r }unitary matrix) could be used for each transmit beamformer e

_{ k },

*k*= 1, ...,

*n*

_{ t }. Then, invoking Lemma 1 and defining , we see that a better approximation can be obtained as

Finally, we note that for a given *n*_{
t
} , *n*_{
r
} , *r*, the channel-independent constant can be omitted when computing the metric in (25) or (30).

## 4 Adaptive Tx/Rx beamforming with low-rate feedback

_{ ω }is chosen, the transmit and receive beamformers w and u will be computed. As mentioned in Section 2, w and u should be chosen to maximize the received SNR, or alternatively, to maximize the power of the receive beamformer output in (6),

*|y*(

*ω*, w , u )

*|*

^{2}, i.e.,

Since the channel matrix H_{
ω
}is not available, we resort to a simple stochastic gradient method for updating the beamformers.

### 4.1 Stochastic gradient algorithm for beamformer update

*K*

_{ t }perturbation vectors for the transmit beamformer, , and

*K*

_{ r }perturbation vectors for the receive beamformer, . Then for each of the normalized perturbed transmit-receive beamformer pairs

**Algorithm 1** Adaptive antenna selection using stochastic approximation and G-circle

INITIALIZATION:

*n* ⇐ 0;

Select initial antenna subset *ω*^{(0)} and set π^{(0)} = [*ω*^{(0)}, 1] ^{
T
} ;

Transmit pilot signals from each selected transmit antenna and obtain the received signals using the selected receive antennas {*y*(*k*)^{(m)}, *m* = 1, ..., *M*; *k* = 1, ..., *n*_{
t
} + *n*_{
r
} };

Compute the objective function *ϕ*(*ω*^{(0)}) using (24)-(25);

For *n* = 1, 2, ...

For *k* = 1, 2, ..., *n*_{
t
} + *n*_{
r
}

SAMPLING AND EVALUATION:

Replace the *k* th element in *ω*^{(n)}by a randomly selected antenna that is not in *ω*^{(n)}to obtain a new subset
that differs with *ω*^{(n)}by only one element;

For a newly selected transmit antenna, transmit pilot signals from it and obtain the received signals {*y*(*k*)^{(m)}, *m* = 1, ..., *M*};

For a newly selected receive antenna, sequentially transmit pilot signals from all transmit antennas and obtain the received signals;

Recalculate the objective function using (24)-(25).

ACCEPTANCE:

EndIf

Feed back *ω*^{(n)}if the update affects any transmit antenna therein

EndIf

ADAPTIVE FILTERING:

Set forgetting factor: *μ*(*n*) = 1*/n*;

π^{(n)}= [1 - *μ*(*n* + 1)] π^{(n)};

*π*^{(n)}(*ω*^{(n)}) = *π*^{(n)}(*ω*^{(n)}) + *μ*(*n* + 1);

EndFor (*k*)

SELECTION:

EndIf

*ω*^{(n+1)}= *ω*^{(n)}; π^{(n+1)}= π^{(n)};

EndFor (*n*)

where *β* is a step-size parameter, the corresponding received output power *|y|*^{2} are measured, and the effective channel gain *|u*^{
H
}*H*_{
ω
}*w|*^{2} can be used as a performance metric independent of transmit power. Finally, the beamformers are updated using the perturbation vector pair that gives the largest output power at the receiver. The transmitter is informed about the selected perturbation vector by a ⌈log *K*_{
t
} ⌉-bit message from the receiver. The algorithm is regarded as convergent, and the iteration terminates when the performance metric fluctuates below a tolerance threshold. The algorithm is summarized as follows.

**Algorithm 2** Stochastic gradient algorithm for beamformer update

INITIALIZATION:

Initialize w^{(0)} and u^{(0)}

For *n* = 0, 1, ...

PROBING:

Generate *K*_{
t
} and *K*_{
r
} new beamformer vectors using (32) based on w^{(n)}and u^{(n)}, respectively;

Evaluate the received power *|y|*^{2} for each one of the *K*_{
t
}*K*_{
r
} perturbed beamformer pairs;

UPDATE AND FEEDBACK:

Let p_{
j*
}and q_{
i*
}be the perturbation vectors that give the largest received power;

Feedback the index of the best transmit perturbation vector using *⌈* log *K*_{
t
} ⌉ bits;

EndFor

### 4.2 Implementation issues

We next discuss some implementation issues related to the above stochastic gradient algorithm for beamformer update.

#### Initialization

A good initialization can considerably speed up the convergence of the above stochastic gradient algorithm compared with random initialization. For the application considered in this paper, recall that the channel consists of a deterministic LOS component H_{LOS} and a random component. When the *K*-factor is high, the LOS component mostly determines the largest singular mode. Hence, we can initialize the transmit and receive beamformers as the right and left singular vectors of H_{LOS}, respectively, which we will call it a hot start.

#### Parameterization

^{ T }}, Im{w

^{ T }}]

^{ T }, it is equivalent to a point on the surface of the 2

*n*

_{ t }-dimensional unit sphere. Thus, v can be parameterized by (2

*n*

_{ t }- 1)-dimensional vector ψ as follows [15]

*ε*> 0 and generate i.i.d. random variables , which are uniformly distributed within and another independent uniform random variable . Then, new parameters are obtained within some predefined boundaries, given by

where
denotes that *x* is confined in the interval of [*a*, *b*], i.e.,
if *a* ≤ *x* ≤ *b*,
if *x > b* and
if *x < a*. As a result, uniform search for the better weight vector is confined within a fixed space defined by [*a*_{
i
} , *b*_{
i
} ], 1 ≤ *i* ≤ 2*n*_{
t
} - 1 and the range of the perturbation depends on the definition of {*δ*_{
i
} }. For example, given a hot start, the current weight vector maybe very close to the optimizer, and it is necessary to set a smaller search region and a finer perturbation.

#### Parallel reception

Since at each iteration, the best beamformer pair is chosen out of *K*_{
t
}*K*_{
r
} combinations based on the corresponding output powers *|y|*^{2}, it would require *K*_{
t
}*K*_{
r
} transmissions. In practice, instead of switching to different the receive beamformers and making the corresponding transmissions for each transmit beamformer, we can set up *K*_{
r
} parallel receiver beamformers to obtain *K*_{
r
} receiver outputs simultaneously. Then, only *K*_{
t
} transmissions are needed for each iteration.

#### Conservative update

If all candidate *K*_{
t
} + *K*_{
r
} beamformers at each iteration are generated anew, then the algorithm is termed aggressive. On the other hand, a conservative strategy keeps the best transmit and receive beamformers from the previous iteration and generates *K*_{
t
} -1 new transmitand *K*_{
r
} -1 new receive beamformers for the current iteration. With a fixed step size and a single feedback bit, the advantage of the aggressive update is the quicker convergence. But with multiple feedback bits, such an advantage is less significant. Therefore, the conservative update is preferable for a finer performance upon convergence.

## 5 Simulation results

We consider an empty conference room with dimension 4m(L) × 3m(W) × 3m(H) for analysis, in which a large-scale MIMO system with *N*_{
t
} = 32 and *N*_{
r
} = 10 transmit and receive antennas operating at the 60 GHz band is randomly located. All the antennas are omni-directional with 20 dBi gain and vertical linear polarized. There are 10 available RF chains at both the transmitter and the receiver, i.e., *n*_{
t
} = 10 and *n*_{
r
} = 10. To generate the channel realizations, 3-D ray tracing is performed between the transceiver using the inter- and intra-cluster parameters specified for the conference room scenario in [12]. By the result of ray tracing, the 32 × 10 channel matrix is gathered using (3). The channel remains static during antenna selection and beamformer update. Note that the channels simulated in the sequel are covered by Remark-1 in Section 3.2. Also, OFDM-based PHY is used as suggested in [5], where 512 subchannels divide total 2.16 GHz bandwidth. The default system SNR is assumed as *ρ* = 60dB. The insertion loss on signal power due to the switches between the RF chains and antennas is considered as an extra 5 dB increase in noise figure.

### Performance of antenna selection with fixed size

*M*= 20. Since the search space is quite large, i.e., , in the same figure, we also plot the largest eigenvalues of the best and the worst subsets among 1,000 randomly selected antenna subsets. Moreover, the single-run performance of the antenna selection algorithm in [10] is also shown. In Figure 5, the average performance of 100 runs for the above schemes is plotted in a larger span of iterations. Several observations are in order. First, it is seen that the G-circle estimates are quite close to the actual largest eigenvalues, which validates the use of G-circle as a metric for antenna selection in strong line-of-sight channels. Secondly, Algorithm 1 has a much faster convergence rate than the algorithm in [10], which at each iteration picks the next candidate subset randomly and independent of the current subset, whereas Algorithm 1 searches for the next candidate subset in the neighborhood of the current subset. Thirdly, Algorithm 1 can lock onto a near-optimal antenna subset very quickly, e.g., in 10-20 iterations, and it significantly outperforms the exhaustive search over a large number (e.g., 1,000) of subsets.

### Performance of antenna selection with variable size

*n*

_{ t }, Algorithm 1 is performed to generate a sequence of, e.g., 20, antenna subsets. If all of them meet the requirement, i.e.,

*λ*

_{1}≥ 0.05, we backup the current parameters (i.e., current iteration number, selected subset, probability vector, etc.), and then terminate the current iteration and set

*n*

_{ t }←

*n*

_{ t }-1. If again the condition is met, a new backup is performed to simply replace the previous one. As shown in Figures 6 and 7,

*n*

_{ t }keeps decreasing until the selected subsets do not meet the requirement for a number of iterations, e.g., 50, which means the last

*n*

_{ t }is the desired minimum size . Therefore, by restoring the last backup data, the terminated iteration in Algorithm 1 is resumed till the optimal antenna subset with size is found. In Figure 6, we show both the G-circle estimates and the exact largest eigenvalues of the selected subsets. Since the estimation provides a lower bound to the largest eigenvalue and G-circle, a margin should be taken into consideration when setting the minimum performance requirement in order to guarantee that the actual performance of the selected subset meets the requirement with minimum number of selected antenna.

### Performance of adaptive beamforming

*K*

_{ t }= 16 and

*K*

_{ r }= 16, respectively; hence, the number of feedback bits is log 16 = 4. The conservative update with step size 0.05 is used. The performance of the Algorithm 2 with a random initialization and hot start is plotted, as well as the exact largest eigenvalue of the channel obtained by SVD. It is seen that the hot start can significantly speed up the convergence. In Figure 9, we compare the performance of Algorithm 2 with different number of feedback bits, i.e.,

*K*

_{ t }= 2, 4, 8, 16 and fixed

*K*

_{ r }= 16. It is seen that by employing more feedback bits, the convergence rate can be significantly increased. Similar behavior can be seen if we fix

*K*

_{ t }and vary

*K*

_{ r }.

### Overall performance of adaptive antenna selection and beamforming

*|u*

^{ H }

*H*

_{ ω }

*w|*

^{2}, is a metric indicating the overall performance by associating the adaptive antenna selection with beamforming. In this simulation, the transceiver is dropped at 100 random locations with minimum distance 30 cm in the room independently, and we generate the channel realizations therein using 3-D ray-tracing technique. By running the proposed adaptive algorithms for these drops, Figure 10 shows the averaged effective channel gain against different system SNR. For comparison, the non-adaptive solutions, i.e., the best out of 1,000 random subsets and SVD are also investigated. We have several observations. First, for both beamforming algorithm (Algorithm 2 and SVD), Algorithm 1 outperforms the best out of random 1,000 subsets at the high SNR region, but its performance is inferior at the lower SNR. This is because when the SNR is low, the accuracy and reliability can not be guaranteed in estimating the objective function value and ranking the subsets, which prevents the adaptive algorithms from converging to better solutions. Second, for the same reason, Algorithm 2 is inferior to SVD at lower SNR, but approaching SVD at high SNR by using both antenna selection strategies. It implies that the accuracy in objective function estimation is a key factor that largely affects the convergence and overall performance. From (24), we see that it is feasible to increase

*M*in order to guarantee the estimation accuracy and maintain the overall performance in the low SNR region.

## 6 Conclusions

We have proposed a sequential antenna selection algorithm and an adaptive transmit/receive beamforming algorithm for large-scale MIMO systems in the 60 GHz band. One constraint of the system under consideration is that the receiver can only access a linear combination of the receive antenna outputs, which makes the traditional antenna selection schemes based on the channel matrix not applicable. The proposed antenna selection method uses a bound on the largest singular value of the channel matrix based on the Gerschgorin circle. The method is particularly useful over the 60 GHz channel, which has a strong line-of-sight component, and it employs a discrete stochastic approximation technique to quickly lock onto a near-optimal antenna subset. We have also proposed an adaptive joint transmit and receive beamforming technique based on the stochastic gradient method that makes use of a low-rate feedback channel to inform the transmitter about the selected beam. Simulation results show that both the proposed antenna selection and the adaptive beamforming techniques exhibit fast convergence and near-optimal performance.

## Note

^{1}Note that in obtaining (20) without loss of generality we have absorbed *ρ* into H.

## Declarations

### Acknowledgements

The authors wish to acknowledge financial support from the National Key Specialized Project of China (2009ZX03003-008-02) and the National Science Foundation of China (60902043).

## Authors’ Affiliations

## References

- Doan CH, Emami S, Sobel DA, Niknejad AM, Brodersen RW: Design considerations for 60 GHz CMOS radios.
*IEEE Commun Mag*2004, 42(12):132-140.View ArticleGoogle Scholar - Yong SK, Chong CC: An overview of multigigabit wireless through millimeter wave technology: potentials and technical challenges.
*EURASIP J Wirel Commun Netw*2007, 2007(1):50-50.Google Scholar - Seyedi A: On the capacity of wideband 60 GHz channels with antenna directionality.
*IEEE GLOBECOM 2007, (Washington D.C.)*2007, 4532-4536.Google Scholar - Nsenga J, Van Thillo W, Horlin F, Bourdoux A, Lauwereins R: Comparison of OQPSK and CPM for communications at 60 GHz with a nonideal front end.
*EURASIP J Wirel Commun Netw*2007, 2007(1):51-51.View ArticleGoogle Scholar - IEEE Standard for Information technology - Telecommunications and information exchange between systems - Local and metropolitan area networks - Specific requirements. Part 15.3: Wireless Medium Access Control (MAC) and Physical Layer (PHY) Specifications for High Rate Wireless Personal Area Networks (WPANs) Amendment 2: Millimeter-wave-based Alternative Physical Layer Extension IEEE Std 802.15.3c-2009 (Amendment to IEEE Std 802.15.3-2003) 2009.Google Scholar
- Draft standard for IEEE 802.11ad, IEEE P802.11ad/D0.1 2010.Google Scholar
- Gharavi-Alkhansari M, Gershman AB: Fast antenna subset selection in MIMO systems.
*IEEE Trans Sig Proc*2004, 52(2):339-347. 10.1109/TSP.2003.821099MathSciNetView ArticleGoogle Scholar - Gorokhov A, Gore D, Paulraj A: Receive antenna selection for MIMO flat-fading channels: Theory and algorithms.
*IEEE Trans Inform Theory*2003, 49(10):2687-2696. 10.1109/TIT.2003.817458MathSciNetView ArticleGoogle Scholar - Molisch AF, Win MZ, Choi YS, Winters JH: Capacity of MIMO systems with antenna selection.
*IEEE Trans Wirel Commun*2005, 4(4):1759-1772.View ArticleGoogle Scholar - Berenguer I, Wang X, Krishnamurthy V: Adaptive MIMO antenna selection via discrete stochastic optimization.
*IEEE Trans Sig Proc*2005, 53(11):4315-4329.MathSciNetView ArticleGoogle Scholar - Choi C, Grass E, Kraemer R, Derham T, Roblot S, Cariou L, Christin P: Beamforming training for ieee 802.11ad. doc.: IEEE 802.11-10/0493r1. 2010.Google Scholar
- Maltsev A: Channel models for 60 GHz WLAN systems. doc.: IEEE 802.11-09/0334r8. 2010.Google Scholar
- Horn R, Johnson C:
*Matrix Analysis*. Cambridge University Press, New York, NY; 1985.View ArticleGoogle Scholar - Banister BC, Zeidler JR: A simple gradient sign algorithm for transmit antenna weight adaptation with feedback.
*IEEE Trans Sig Proc*2003, 51(5):1156-1171. 10.1109/TSP.2002.808104MathSciNetView ArticleGoogle Scholar - Wang X, Krishnamurthy V, Wang J: Stochastic gradient algorithms for design of minimum error-rate linear dispersion codes in MIMO wireless systems.
*IEEE Trans Sig Proc*2006, 54(4):1242-1255.View ArticleGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.