 Research
 Open Access
 Published:
Deep learningbased BackCom multiple beamforming for 6G UAV IoT networks
EURASIP Journal on Wireless Communications and Networking volume 2021, Article number: 50 (2021)
Abstract
Combining unmanned aerial vehicles (UAVs) with 6G, Internet of Things (IoT) and other emerging communication technologies could better satisfy various IoT applications and create more innovative services. This paper develops a novel hierarchical 6G IoT network with UAVs in the sky and intelligent reflective surface (IRS) equipped. The system employs backscattering communication (BackCom) to transmit data in a freeride manner. Through beamforming, IRS enhances the energy of the reflectable signal, thereby improving the distance and performance of the BackCom. Simulation results reveal that our approach makes a significant improvement to the performance of the whole system and takes obvious advantage over traditional solutions.
Introduction
In recent years, with the development of 6G and Internet of Things (IoTs) technology [1,2,3,4,5,6,7], the integration of UAVs and cellular system has become a new network development trend. At present, UAVs are showing a vigorous development momentum in many industrial applications. It is expected that UAVs will bring significant economic benefits in many fields such as smart city construction, power and oil pipeline inspections, emergency communications, agriculture, forestry and plant protection, mineral exploration and disaster assessment and have broad application prospects. In a word, 6G UAV network could better satisfy various IoT applications and create more innovative services.
In the UAV cellular converged network, the UAV can act as an aerial base station or access node (AP) to collect information from a large number of IoT nodes distributed in a certain range and realize the connection with the 6G network. UAVs can also be equipped with IoT devices such as cameras and communication equipment to form a UAV IoT network. In many IoT applications, the collaborative work of multiple UAVs will be a common requirement. UAVs can not only communicate with ground cellular base stations, but also form a selforganizing cluster network through remote intelligent control platform. Multiple UAVs equipped with different IoT devices collect IoT data at different locations at the same time and transmit them directly to nearby ground base stations or communicate with ground base stations through a leader UAV with AP capability.
However, UAV flight and IoT devices are highly dependent on power supply, and they are all energy limited, which will seriously affect the promotion and popularization of UAVbased IoT applications. Moreover, under 6G millimeter wave communication, the system requires sophisticated radio frequency transceiver units and complex signal processing to achieve highperformance communication, which requires the system having sufficient energy. The cooperative work of multiple UAVs is also an important challenge to the use of radio spectrum resources. Therefore, it is necessary to carry out innovative research on spectrum use and energysaving technology under the premise of low hardware cost.
If IoT nodes can still perceive the world without being bound by batteries, the passive IoT without batteries is no longer a dream [8]. Backscatter technology brings hope for the IoTs to get rid of the battery shackles. When the node backscatters the incident signal, it can encode and modulate the sensed data by modifying the three parameters of the signal's amplitude, phase and frequency. Therefore, the backscattering system uses the incident electromagnetic waves to load the data that IoT nodes need to transmit in a freeriding manner to the scattered signal, and then transmits it to the receiver.
The backscatter system 'cuts' the powerconsuming RF circuit part and obtains energy from the incident signal. IoT devices can transmit data with extremely low power consumption and cost, and the energy consumption can be reduced to microwatts, which is an important feature of backscatter technology. At present, backscatter is mainly used in radar systems to measure the distance and azimuth of the target by using this reflected wave.
However, a fatal disadvantage of backscattering technology is that the reflective node obtains weak energy from the surrounding environment, resulting in a too short distance between the reflective node and the receiver, and it is difficult for the receiver to distinguish extreme weak reflection signals from the original signal and other noises. Intelligent reflective surface (IRS) [9, 10] uses beamforming technology to directionally gather signal energy, which may become an effective means for backscattering systems to increase the communication distance.
IRS is a surface that reflects incident signals in beam form. It consists of a large number of lowcost, reconfigurable passive components, each of which can be phase modulated independently to reflect the incident signal [11]. By cleverly adjusting the phase shift of all IRS passive devices, the incident signal received by the IRS can be beamformed and reflected to the receiving end, so that threedimensional (3D) passive beamforming can be achieved without any transmission radio. Through beamforming, IRS enhances the energy of the reflectable signal, thereby improving the distance and performance of the backscatter communication system [11,12,13].
IRS has no transmitter but reflects the received signal in the form of a passive array, so there is no transmission power consumption [14]. In the view of an implementation, IRS has low implementation cost, strong adaptability and very convenient deployment. Although IRS has been used in radar systems, remote sensing and satellite communications, it is rarely used in mobile wireless communications.
Combining unmanned aerial vehicles (UAVs) with 6G, Internet of Things (IoT) and other emerging communication technologies could better satisfy various IoT applications and create more innovative services. This paper develops a novel hierarchical 6G IoT network with UAVs in the sky and intelligent reflective surface (IRS) equipped. The system employs backscattering communication (BackCom) to transmit data in a freeride manner. Through beamforming, IRS enhances the energy of the reflectable signal, thereby improving the distance and performance of the BackCom.
The reminder of this paper is organized as follows. Sections 2 and 3 present the general system model and improved MIMO IRS model, respectively. Section 4 develops ALSTMbased trajectory prediction scheme, in a bid to handle the high speed mobility of UAVs. Section 5 gives numerical results to justify the performance of our proposed system, followed by Sect. 6 to conclude the paper.
System model
System architecture
In the face of the future 6G Internet of Things demanding for ultrahigh coverage, expanding communication coverage has become an inevitable trend in the development of smart cities. Due to the high cost and timeconsuming of adding additional base stations, drones are currently one of the most effective solutions to improve communication coverage. There are two main differences between UAV communication and traditional ground wireless communication. First of all, because UAVs usually have a strong line of sight connection with ground nodes, they provide better channel conditions than ground fading channels and can even predict the channel state information (CSI) of different UAVs in 3D positions based on the location information of ground nodes and communication performance. Secondly, UAV has fully controllable maneuverability 3D. The UAV can be used to adjust its height and horizontal position at any time to optimize its communication performance with ground nodes.
As shown in Fig. 1, the use of a cylindrical antenna array can effectively enhance the robustness of the communication link. To ensure the communication efficiency between the UAV and the base station, the base station needs to perform accurate beamforming and capture the specific position of the UAV. Using the traditional cylindrical antenna array DOA estimation algorithm, the current position information of the UAV may not be obtained. Because the UAV moves too fast, accompanied by a certain time delay, the traditional DOA estimation method to estimate the current position of the UAV is not enough to meet the needs of the entire system. Therefore, an angle predictor is needed to predict the position of the UAV at the next moment.
In order to accurately grasp the status information of the UAV, as shown in Fig. 1, a new backscatter communication scenario for drone communication is considered, which consists of a backscatter device (BD), a receiver and a UVA. It is assumed that the drone can freely adjust its heading movement at a fixed height \(H\), and the limited flying time of drone is \(T\). In order to make the problem easier to deal with, we divide the period \(T\) into \(N\) periods of duration \(\delta = \frac{T}{N}\). Therefore, the horizontal position of the recording UAV at time \(n\) is \(q_{n} ,n \in \left\{ {1, \ldots ,N} \right\}\). The horizontal coordinates of the BD and the receiver are fixed at \(w_{b}\) and \(w_{r}\), respectively. In this paper, we consider offline optimization, assuming that the UAV fully knows the location of the receiver and the channel propagation environment (channel parameters) to facilitate joint maneuvering and power control design. In the case of partial/incomplete understanding of location and channel information, this provides key insights and upper performance limits for actual design. However, the problem with these backscatter communication devices is the limited battery life. Therefore, if the battery runs out, most effective wireless communication protocols will not work. In the following, we consider the energy constraints of the UAV backscatter communication system.
Ambient BackCom: a solution to limited batterylife
The demand for high data rate and highfrequency spectrum of UVA communication network, as well as the goal of uninterrupted internet connection from drone to drone and drone to receiver, prompt us to explore the use of batteryfree equipment in emerging wireless communications [1]. Batteryfree devices can use the same bandwidth for continuous communication. The advantage of using passive devices in wireless communication lies in the uninterrupted exchange of information. In addition, achieving reliable communication with limited battery life is an important research area. In order to solve the problem of limited battery life, various measures have been taken, including the use of millimeter waves and energy harvesting in mobile networks, to design highly energyefficient network architectures. We will first discuss power issues in cellular, D2D and the IoTs, and then we will review various attempts to solve energy constraints.
Figure 1 shows the ambient backscatter communication (AmBC) system, the source \(S\) is a UAV, the receiver \(R\) is equipped with an \(M\) antenna, the antenna form is a uniform linear array (ULA), and the single antenna is a passive tag \(G\). \(R\) not only directly receives the signal from \(S\), but also collects the backscattered signal from \(G\). \(G\) first obtains energy from the drone signal. By deliberately changing the load impedance, \(G\) carries its information on the UAV carrier to disperse or absorb the received signal.
Let s(n) be the signals of the UAV source with power Ps and \(B(n) \in \left\{ {0,1} \right\}\) be the modulated signal at the tag which keeps unchanged during N consecutive UAV signals. Define \(\theta_{0} \in \left[ {  \frac{\pi }{2},\frac{\pi }{2}} \right]\) and \(\theta_{1} \in \left[ {  \frac{\pi }{2},\frac{\pi }{2}} \right]\) as the signal azimuth angles or direction of arrivals of paths S − R and G − R, respectively. Denote channel gains of S − R, S − G and G − R as h_{sr}, h_{sg} and h_{gr}, respectively. The attenuation factor inside the tag is denoted as \(\eta \in \left( {0,1} \right]\).
Let \(s(n)\) be the signals from the UAV source with power \(P_{S}\), and \(B(n) \in \{ 0,1\}\) is the modulated signal at \(G\), and \(G\) remains unchanged during \(N\) consecutive UAV signals. Define \(\theta_{0} \in \left[ {  \frac{\pi }{2},\frac{\pi }{2}} \right]\) and \(\theta_{1} \in \left[ {  \frac{\pi }{2},\frac{\pi }{2}} \right]\) as the signal azimuth or direction of arrival of the paths \(S  R\) and \(G  R\), respectively. Denote the channel gains of \(S  R\), \(S  G\) and \(G  R\) as \(h_{sr} ,h_{sg} ,h_{gr}\), respectively. The attenuation factor inside \(G\) is denoted as \(\eta \in (0,1]\). The signal of the reader is
where the environmental backscatter communication channel is
\(H_{sr} = h_{sr} \left[ {1,e^{{j2\pi d\sin \frac{{\theta_{0} }}{\lambda }}} \ldots e^{{j2\pi d\left( {M  1} \right)\sin \frac{{\theta_{0} }}{\lambda }}} } \right]T\),\(H_{gr} = h_{gr} \left[ {1,e^{{j2\pi d\sin \frac{{\theta_{1} }}{\lambda }}} \ldots e^{{j2\pi d\left( {M  1} \right)\sin \frac{{\theta_{1} }}{\lambda }}} } \right]T\), \(w(n) \sim CN\left( {0,\sigma 2I} \right)\) is circularly symmetric complex Gaussian noise vector distributed. \(d\) is the distance between two adjacent antennas, and \(\lambda\) is the wave length of the UAV signal. Compared with the first antenna, the delay distance at the mth antenna is assumed as \(\left( {M  1} \right)d\sin \theta_{i}\), \(i \in \left\{ {0,1} \right\}\). The equivalent channel at the mth antenna is
where h_{0} = h_{sr} and h_{1} = ηh_{sg}h_{gr}.
Remark 1
Since hm is a function of modulating bits at \(G\) and channels \(h_{sr} , h_{sg} , h_{gr}\), it may be different from traditional pointtopoint wireless communication systems. However, when \(G\) modulates the bit '0,’ the effective channel degenerates to a traditional communication channel.
Reconfigurable reflectarray
The reflectarray antenna is a directional antenna that behaves a bit like a parabolic reflector. Instead of relying on the physical shape of the antenna to determine the reflection characteristics, the reflected light is composed of many reflective elements. Since the components on the antenna can provide phase compensation, the antenna reflects the incident wave of the electromagnetic radiation source and finally forms the main beam in a specific direction. In this way, the reflected wave is beamformed, and the reflectarray antenna receives the input signal wave and reflects it to a predetermined spatial direction, as shown in Fig. 2. The reflectarray antenna is composed of an array of reflectarray elements and a power supply. It collimates the radiation wave from the power supply by adjusting the reflection phase of each reflecting light element. In the design of the reflected wave antenna, the key issue is how to change the reflection phase of the reflected wave element.
The reflectarray antenna usually works at a single frequency and has a fixed main beam. A reconfigurable technology, reconfigurable reflector antenna (RRA), is a combination of a parabolic antenna and a phased array antenna. It adopts plane structure and is easy to process. In addition, RRA is more flexible than traditional mirrors and realizes beam scanning through mechanical scanning. The feeding network is simple, the transmission loss is reduced, and the radiation efficiency is greatly improved. The component design is flexible, and different resonant components can be designed to achieve multibeam and beam scanning functions [9].
The topology of the novel slotcoupled digitally reconfigurable reflective array element is shown in Fig. 2a. The DC bias circuit controls the switch of the PIN diode on the phase delay line, thereby changing the propagation path of the electromagnetic wave, and finally achieving a phase difference of 180° degrees for beam scanning.
The reconfigurable reflectarrays can change the delay of each element and direct the reflected light in different directions at different time. These elements are represented by dots in Fig. 2. The elements of the reflective surface are called subatomic or reflective elements. In short, we can think of elements as antennas, who captures the radio signal, keeps it inside for a short time and then sends the signal. The reflectarrays can be regarded as a passive MIMO array.
From a conceptual point of view, the establishment of future networks is indeed an exciting prospect. However, researches on this topic is still in its infancy. The most important thing is to demonstrate practically important use cases of the reconfigurable reflectarrays.
Improved MIMO IRS system model
We considered IRSassisted downlink communication in a singlecell network, where IRS is deployed to assist communication from multiantenna APs to K singleantenna users on a given frequency band. The number of transmitting antennas at the AP and the number of reflecting units at the IRS are denoted by M and N, respectively. IRS is equipped with a controller to coordinate its switching between two working modes, namely the receiving mode for channel estimation and the reflection mode for data transmission [10]. Due to the high path loss, it is assumed that the power of the signal reflected twice or more by the IRS is negligible and therefore can be ignored. In order to characterize the theoretical performance gain brought by IRS, we assume that the AP fully understands the channel state information (CSI) of all involved channels. In addition, all channels use a quasistatic flat fading model. Since IRS is a passive reflection device, we consider time division duplex (TDD) protocol for uplink and downlink transmission and assume channel reciprocity for CSI acquisition in downlink based on uplink training.
We consider IRSassisted downlink communication in a singlecell network, where IRS assists communication from multiantenna APs to \(K\) singleantenna users on a given frequency band. The number of transmitting antennas at the AP and the number of reflecting units at the IRS are denoted as \(M\) and \(N\), respectively. IRS is equipped with a controller to coordinate its switching between two operating modes, namely the receiving mode for channel estimation and the reflection mode for data transmission [10]. Due to the higher path loss, it is assumed that the power of the signal reflected by the IRS two or more times is ignored. The AP is assumed to fully understand all CSI information. In addition, all channels use a quasistatic flat fading model. Since IRS is a passive reflection device, time division duplex (TDD) protocol for uplink and downlink transmission is adapted.
We consider performing linear transmission precoding on the AP. Therefore, the complex baseband transmission signal at the AP can be expressed as \(x_{k} = \sum\nolimits_{j = 1}^{K} {{\mathbf{w}}_{j} } s_{j}\), where \(s_{j}\) is the jth user transmission data and \({\mathbf{w}}_{j} \in {\mathbb{C}}^{M + 1}\) is the corresponding beamforming vector. It is supposed as an independent random variable, whose mean and variance are zero and 1, respectively. The system model of a single user in MIMO IRS is
where the baseband channels from AP to IRS, IRS to user \(k\) and AP to user \(k\) are denoted as \(G \in {\mathbb{C}}^{N \times M}\), \({\mathbf{h}}_{r,k}^{H} \in {\mathbb{C}}^{1 \times N}\) and \({\mathbf{h}}_{d,k}^{H} \in {\mathbb{C}}^{1 \times M}\), respectively, \(k = 1, \ldots ,K\) and \(n_{k} \; \sim CN(0,\sigma_{k}^{2} )\) denotes the additive white Gaussian noise (AWGN).
We denote \({\mathbf{S}} = \left[ {\begin{array}{*{20}c} {s_{1} } \\ \vdots \\ {s_{k} } \\ \vdots \\ {s_{K} } \\ \end{array} } \right]\), \({\mathbf{Y}} = \left[ {\begin{array}{*{20}c} {y_{1} } \\ \vdots \\ {y_{k} } \\ \vdots \\ {y_{K} } \\ \end{array} } \right]\), \({\mathbf{h}}_{r,k} = \left[ {\begin{array}{*{20}c} {h_{r,k,1} } \\ \vdots \\ {h_{r,k,n} } \\ \vdots \\ {h_{r,k,N} } \\ \end{array} } \right]\), \({\mathbf{h}}_{d,k} = \left[ {\begin{array}{*{20}c} {h_{d,k,1} } \\ \vdots \\ {h_{d,k,m} } \\ \vdots \\ {h_{d,k,M} } \\ \end{array} } \right]\), and \({\mathbf{w}}_{k} = \left[ {\begin{array}{*{20}c} {w_{k,1} } \\ \vdots \\ {w_{k,m} } \\ \vdots \\ {w_{k,M} } \\ \end{array} } \right]\). The parameters \({\mathbf{G}}\) and \({{\varvec{\Theta}}}\) are as follows
And \({{\varvec{\Theta}}}\) represent the reflection coefficient matrix of the IRS, where \(\theta_{n} \in [0,2\pi )\) and \(\beta_{n} \in [0,1][0,1]\), respectively, represent the phase shift and amplitude reflection coefficient of the nth element of the IRS. Therefore, the composite APIRSuser channel is modeled as a series connection of three components, namely the APIRS link, the IRS reflection with phase shift, and the IRSuser link.
Accordingly, the system model MIMO IRS is
where \(\mathop {{\mathbf{H}}_{r}^{H} }\limits_{{(K{\text{N}})}} = \left[ {\begin{array}{*{20}c} {{\mathbf{h}}_{r,1}^{H} } \\ \vdots \\ {{\mathbf{h}}_{{r,{\text{k}}}}^{H} } \\ \vdots \\ {{\mathbf{h}}_{{r,{\text{K}}}}^{H} } \\ \end{array} } \right]\), \(\mathop {{\mathbf{H}}_{r}^{H} }\limits_{{({\text{KM}})}} = \left[ {\begin{array}{*{20}c} {{\mathbf{h}}_{d,1}^{H} } \\ \vdots \\ {{\mathbf{h}}_{d,k}^{H} } \\ \vdots \\ {{\mathbf{h}}_{d,K}^{H} } \\ \end{array} } \right]\), \({\mathbf{N}} = \left[ {\begin{array}{*{20}c} {n_{1} } \\ \vdots \\ {n_{k} } \\ \vdots \\ {n_{K} } \\ \end{array} } \right]\), and \(\mathop {\mathbf{W}}\limits_{(MK)} = \left[ {{\mathbf{w}}_{1} \ldots {\mathbf{w}}_{k} \ldots {\mathbf{w}}_{K} } \right] = \left[ {\begin{array}{*{20}l} {w_{1,1} } \hfill & \ldots \hfill & {w_{K,1} } \hfill \\ \vdots \hfill & \ddots \hfill & \vdots \hfill \\ {w_{1,M} } \hfill & \cdots \hfill & {w_{K,M} } \hfill \\ \end{array} } \right]\).
The \({\text{SINR}}_{k}\) of the single user and the \({\text{SINR}}\) of the system are, respectively
Please refer to the ANNEX chapter for the detailed formula description of the system model.
Method of trajectory prediction
Detailed explanation of ALSTM
The structure of ALSTM model is similar to that of encoderdecoder model. ALSTM model composed of LSTM model and attention mechanism is widely used in time series prediction [15, 16], including machine translation, document extraction, question and answer system, etc.
LSTM, as a more complex recurrent neural network (RNN), is expert in time information processing and solves the problems such as the longterm dependence, gradient disappearance and gradient explosion in backpropagation through time (BPTT) through the truncated gradient and regularization of guided information flow [17]. Figure 3a shows the LSTM cell structure at time t. As shown in Fig. 3a, x_{t} stands for the input vector, c_{t} represents the cell, and h_{t} represents the hidden state at the current time. There are three gated units in the figure, including forget gate f, input gate i and output gate O. The forget gate controls that cell state information is to forget or pass useful information down. The intersection of new information and cell state are controlled by input gate. How much the current cell state will be treated as an output value will be judged by output gate.
Specifically, the mapping relationship between an input vector sequence x = (x_{1},x_{2},…,x_{T}) to an output sequence h = (h_{1},h_{2},…,h_{T}) is precisely specified by:
where f_{t}, i_{t} o and c_{t} represent the forget gate, input gate, output gate and cell state vectors, respectively, at the current time, and σ stands for the logistic function mapping between 0 and 1. W_{*} and b_{*} represent the weight matrixes and bias vectors, respectively [18, 19].
The ALSTM model [20] mainly deals with the problem of 'SeqtoSeq.' We have drawn the details of ALSTM model shown in Fig. 3b. The expressions of encoding, storage and decoding in graphs are sequence data x_{j}, relational vector C_{i} and sequence output y_{i}, respectively. As shown in Fig. 3b, x_{j} and h_{j} represent the input sequence data and hidden state in the encoder. y_{i} and S_{i} represent the output sequence data and hidden state in the decoder. e_{ij} represents the correlation between encoding hidden state information and decoding state. a_{ij} explains the weight vector of e_{ij} and the higher the value of a_{ij}, the greater the influence of x_{j} to y_{i}.
where f and g explain the activation function, x_{j} represents the input vector, j = 1,…,T_{x}, and y_{i} is the output data, i = 1,…,T_{y}.
In terms of automatic information generation, the part of encoder is adopted LSTM. As shown in Fig. 3b, each x_{j} represents the input vector of each time node. As time goes by, h_{j} of the LSTM is updated with the gradual input of x_{j}. We also defined the decoder as an LSTM that outputs sequence data y_{i}. The relational vector C_{i}, which is calculate through a series of function transformations of encoding input vector x_{j} and output sequence hidden state S_{i−1}, indicates the only correlation between the encoder and decoder. According to assigning weight to h_{j} by the function of softmax, vector C_{i} shows different concerns about h_{j}. The relational vector C_{i} including the total useful information of input sequence vector x_{j} has guiding significance for the output of the decoder. The results show that it is necessary to obtain useful sequence information in the training to effectually increase the precision of decoding prediction [21].
Through the above description, it mainly introduces LSTM and ALSTM that is the combination of LSTM and attention mechanism. Meanwhile, in the training process of ALSTM model, the network parameters can be continuously updated by loss function, so as to realize the prediction of time series.
ALSTM location prediction model
In Sect. 3, it will pay attention to ALSTM model prediction process structure in Fig. 3b. Since UAV moves fast and is susceptible to some external factors, it is necessary to obtain location of UAV by using the ALSTM location model at next time. First, according to the spatial spectrum of URA’s DoA, the current UAV communication location information can be obtained, which includes Azimuth information θ and φ, respectively. Afterwards, we will adopt preprocessing system to implement angle data preprocessing to get the redefinition angle information including θ^{*} and φ^{*}. Ultimately, the redefinition information as the input layer will be mapped to next time data through the ALSTM model. As the epoch of training increases, best ALSTM model parameters will be kept to predict pitch and horizontal angles.
Acquisition of ALSTM training samples
For ALSTM model, the acquisition of the training samples is crucial. Consequently, we also have researched DOA estimation. After the discussion above, the covariance matrix of x(t) can be defined as:
where R_{x} expresses the source covariance matrix, σ^{2} is the common variance and I denotes an MN*MN identity matrix.
In addition, the standard subspace method can be applied to convert the covariance matrix of x(t) to
where λ_{1} ≥ λ_{2} ≥ ··· ≥ λ_{P} > λ_{P+1} = ··· = λ_{MN} are the eigenvalue of R_{x} and e_{1},e_{2},…,e_{MN} are the associated eigenvectors of them. E_{s} represents the eigenvectors of P largest eigenvalue, and E_{n} stands for eigenvectors of MNP smallest eigenvalue. In addition, the number of sources P will be evaluated using the principle of minimum description length.
Further, the signal subspace of a(θ_{1}, φ_{1}),…, a(θ_{p}, φ_{p}) is the same as the E_{s} signal subspace and orthogonal to the noise subspace E_{n}. Thus, we have
where · explains the Kronecker product. So, the DoA estimate of URA will be obtained, and the spatial spectrum can be defined due to the multiple signal classification to define [22].
According to Eq. (23), two maximum value of spectrum search can be obtained, which corresponds to signal source incidence angles included pitch angle and horizontal angle of, respectively.
Meanwhile, the ALSTM location predictive model which consists of parameters and structure in this paper is considered as a mapping function. Once URA receive the signal of BS, BS can get the feedback of URA about the relevant received signal vector x(t). Afterwards, these two peak values of the spatial spectrum achieve the current location pitch angle and horizontal angle of UAV, which means that we can continuously obtain 2D DoA angle information. Consequently, it is possible to continuously acquire the received signals during the air communication of UAV, thereby continuously acquiring the 2D arrival angles at different moments, which helps us to obtain training samples.
ALSTM prediction system
For further enhancing the feasibility and stability of ALSTM prediction effect, a series of complex preprocessing is introduced for the acquired dataset. Data cleaning aims at detecting errors and inconsistencies in data, eliminating or correcting them to improve data quality, so that the uniformity of location information data can be achieved. Therefore, we first clean the angle data in the preprocessing system (Fig. 4).
Besides, the constancy of training set directly affects the error of entire training result of proposed ALSTM location predictive model. Therefore, augmented Dickey–Fuller test (ADF) is carried out after data cleaning. The critical value of ADF statistics and ADF statistics is full of guiding significance for stability of the system. Suppose that the ADF result is smaller than threshold level, the assumption that there is a unit root is rejected. Meanwhile, raw dataset shows stable. Therefore, it is assumed that there exits the unit root in the zero hypothesis of ADF, and the criterion is that test statistic value preferably is no more than 1%; the invalid hypothesis can be significantly negated, thereby determining the dataset stability. The detection result of ADF is shown in Table 1. As we can see from the table, the test statistic value is far less than 1% of the critical statistic, which is obviously less than 5% and 10% of the critical value. At the same time, the probability value (P value) of the detection is close to zero. Hence, we can conclude that the obtained angel data is stationary.
Since the position of UAV is various at different moments, some specific angle data at different moments will be generated. Therefore, data integration method is used to gather pitch angle θ and the horizontal angle φ of different time nodes, thus constituting the training angle database. Assuming that the pitch angle is regarded as X axis and the horizontal angle as Y axis, a coordinate system about angle is formed. In this way, the information from two unrelated perspectives can be transformed and given new meaning. In addition, data normalization can help us to solve the impact of single attribute in multiattribute sample data and ensure that the speed and accuracy of finding the optimal solution are accelerated when the gradient descends. Finally, data reconstruction is realized to adapt the data to the input data structure of ALSTM model.
For further enhancing the performance of ALSTM location predictive parameter model, we adopt the sliding window to guarantee the realtime prediction of the ALSTM model. If the length of sliding window is n, n + 1th data will be automatically predicted by ALSTM model. When UAV and BS communicate continuously, we will get the latest UAV azimuth information at every moment. Over time, we also import the latest data into the data structure to be predicted, while automatically deleting outdated data.
After the above processing, the training data of a specific structure are input into the structure of ALSTM. After training, the structure and parameters of the ALSTM model can guarantee the accuracy of prediction, so as to achieve azimuth prediction. By predicting the location of the next moment of the UAV, helping the BS to achieve accurate beamforming can improve the communication quality within the coverage of UAV. At the same time, we will also use the following experiments to express the reliability and stability of ALSTM the accuracy of the prediction.
The results verify that ALSTM model is suitable for trajectory prediction and performs well, which shows that ALSTM model pays more attention to the trajectory angle of UAV.
Results and discussion
The simulation scenario is shown in Fig. 1: Black dots represent base stations (BS); the red dot represents massive unmanned aerial vehicle (Muav); the blue dot represents small IoT unmanned aerial vehicle (Iuav); the hexagon represents the service area covered by the base station, which is composed of three sectors, and each sector corresponds to a phased array antenna. In this part, we discussed two types of links in the simulation: the first type of link is Iuav → Muav; the second type of link is BS → Muav.
For Iuav → Muav, Iuav is an IoT terminal. In order to ensure the power continuity of Iuav, Iuav uses the backscattering mode to transmit to Muav. Iuav environmental electromagnetic wave source comes from BS. The BS points to Iuav through beamforming to ensure that Iuav can receive enough energy for reflection. The direction of beamforming needs to be predicted with LSTM to ensure the accuracy of BS beamforming. Iuav is equipped with an intelligent reflective surface (IRS), which only reflects signals and does not emit signals. After LSTM prediction and calibration, the IRS beamforming points to Muav to ensure the strength of the backscattering link. For BS → Muav, Muav is the main unmanned aerial vehicle (UAV), and it is equipped with a phased array antenna, which has strong capabilities and belongs to the air center node. The BS → Muav link is an important backhaul fronthaul path and is the gateway between the ground network and the air network. Normally, only ray1 exists on the BS → Muav link. For this, the BS needs beamforming to point to Muav. Prior to this, the position of Muav was also predicted with LSTM to ensure the accuracy of beamforming. However, due to the existence of the first type of link (Iuav → Muav), Muav not only has the receiving path of ray1, but also has the receiving path of ray2 based on artificial reflection. Muav's signal receiving path is shown in Fig. 1. Therefore, full use of ray2 energy (precoding technology precoding) can increase the strength of the BS → Muav link.
The two links are compared, as shown in Fig. 5. The first type of link, Iuav → Muav, uses backscattering technology, IRS and LSTM to save resources (including energy, spectrum, and computing power) on Iuav and complete the transmission of the IoT. The second type of link, BS → Muav, uses beamforming, precoding and LSTM to increase the strength of the link. LSTM has the function of predicting the beam forming direction and increasing the antenna gain, as shown in Fig. 6.
In the simulation, the main configuration parameters of the system are shown in Table 2. We set: the number of base stations is 7, the radius of the cell is 450 m, the angle of each sector is 120°, the number of Muav in each sector is 1, the number of Iuav in each sector is 3, and the number of LSTM trainings is 250.
The simulation results are shown in Fig. 7. Link2 capacity (ray1): Under the condition of only ray1, the reception of link2 is CDF (lower), and SINR corresponds to C (upper). The capacity of Link2 (ray1 + ray2): The presence of ray1 and ray2 at the same time makes the received energy rise, so the curve is higher than the capacity of Link2 (ray1). Link2 capacity (ray1 + ray2 + backscatter): The addition of backscatter essentially adds noise to ray2 of link2, which will cause some performance loss. Therefore, this curve is between the capacity curve of Link2 (ray1) and the capacity curve of Link2 (ray1 + ray2). Link1 capacity (backscatter): The capacity generated by backscatter is essentially stolen from the capacity generated by link2ray2. The dotted line part: The imperfect LSTM prediction makes the beamforming of the entire system inaccurate and causes partial loss of antenna gain. Therefore, the dotted line will be a little worse than the solid line. Here, multiple sets of dotted lines can be added to correspond to different RNN algorithms and parameter configurations.
Conclusions
Driven by the market, UAV industries start pushing the digital transformation of their products and services. We are entering the era of ubiquitous IoT with all kinds of things equipped with computing and communication capabilities. This paper, in particular, develops a novel hierarchical 6G IoT network of UAVs equipped with BackCom IRS. We focus on deep learningbased BackCom multiple beamforming, in a bid to improve the energy of the reflective signal. Simulation results justify that our approach can not only save the precious spectrum but also promote the concept of green communication by cutting off the energy consumption.
Availability of data and materials
The author keeps the analysis and simulation datasets, but the datasets are not public.
Abbreviations
 IoT:

Internet of Things
 UAV:

Unmanned aerial vehicle
 IRS:

Intelligent reflective surface
 AP:

Access node
 BD:

Backscatter device
 BackCom:

Backscattering communication
 3 D:

Threedimensional
 CSI:

Channel state information
 ULA:

Uniform linear array
 RRA:

Reconfigurable reflector antenna
 BPTT:

Backpropagation through time
 ADF:

Augmented Dickey–Fuller
 TDD:

Time division duplex
 BS:

Base station
References
 1.
S. Sun, M. Kadoch, L. Gong, B. Rong, Integrating network function virtualization with SDR and SDN for 4G/5G networks. IEEE Netw. 29(3), 54–59 (2015)
 2.
N. Zhang, N. Cheng, A.T. Gamage, K. Zhang, J.W. Mark, X. Shen, Cloud assisted HetNets toward 5G wireless networks. IEEE Commun. Mag. 53(6), 59–65 (2015)
 3.
Y. Wu, B. Rong, K. Salehian, G. Gagnon, Cloud transmission: a new spectrumreuse friendly digital terrestrial broadcasting transmission system. IEEE Trans. Broadcast. 58(3), 329–337 (2012)
 4.
B. Rong, Y. Qian, K. Lu, H. Chen, M. Guizani, Call admission control optimization in WiMAX networks. IEEE Trans. Veh. Technol. 57(4), 2509–2522 (2008)
 5.
N. Chen, B. Rong, X. Zhang, M. Kadoch, Scalable and flexible massive MIMO precoding for 5G HCRAN. IEEE Wirel. Commun. 24(1), 46–52 (2017)
 6.
B. Rong, Y. Qian, K. Lu, Integrated downlink resource management for multiservice WiMAX networks. IEEE Trans. Mob. Comput. 6(6), 621–632 (2007)
 7.
S. Sun, L. Gong, B. Rong, K. Lu, An intelligent SDN framework for 5G heterogeneous networks. IEEE Commun. Mag. 53(11), 142–147 (2015)
 8.
Networking Index, C.V., Cisco visual networking index: forecast and methodology, 2016–2021; white paper; Cisco Systems, Inc.: San Jose, CA, USA, 2017
 9.
S.V. Hum, J. PerruisseauCarrier, Reconfigurable reflectarrays and array lenses for dynamic antenna beam control: a review. IEEE Trans. Antennas Propag. 62(1), 183–198 (2014)
 10.
L. Subrt, P. Pechac, Intelligent walls as autonomous parts of smart indoor environments. IET Commun. 6(8), 1004–1010 (2012)
 11.
C. Boyer, S. Roy, Backscatter communication and RFID: coding, energy, and MIMO analysis. IEEE Trans. Commun. 62(3), 770–785 (2014)
 12.
F. Sohrabi, W. Yu, Hybrid digital and analog beamforming design for largescale antenna arrays. IEEE J. Sel. Top. Signal Process. 10(3), 501–513 (2016)
 13.
C. Huang, A. Zappone, M. Debbah, C. Yuen, Achievable rate maximization by passive intelligent mirrors, in Proceedings of IEEE ICASSP (2018)
 14.
T.J. Cui, M.Q. Qi, X. Wan, J. Zhao, Q. Cheng, Coding metamaterials, digital metamaterials and programmable metamaterials. Light Sci. Appl. 3(10), e218 (2014)
 15.
Y. Heryadi, H.L.H.S. Warnars, Learning temporal representation of transaction amount for fraudulent transaction recognition using CNN, Stacked LSTM, and CNNLSTM, in 2017 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom), Phuket (2017), pp. 84–89. https://doi.org/10.1109/CYBERNETICSCOM.2017.8311689
 16.
C. Wang, D. Han, Q. Liu, S. Luo, A deep learning approach for credit scoring of peertopeer lending using attention mechanism LSTM. IEEE Access 7, 2161–2168 (2019). https://doi.org/10.1109/ACCESS.2018.2887138
 17.
Z. Wang, Y. Lou, Hydrological time series forecast model based on wavelet denoising and ARIMALSTM, in 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China (2019), pp. 1697–1701. https://doi.org/10.1109/ITNEC.2019.8729441
 18.
C. Guo, R. Lin, Analysis of weather and exercise steps based on LSTM neural network, in 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China (2019), pp. 474–478. https://doi.org/10.1109/ITNEC.2019.8729294
 19.
J. Li, L. Lu, C. Liu, Y. Gong, Exploring layer trajectory LSTM with depth processing units and attention, in 2018 IEEE Spoken Language Technology Workshop (SLT), Athens, Greece (2018), pp. 456–462. https://doi.org/10.1109/SLT.2018.8639637
 20.
X. Hu, X. Wei, Y. Gao, W. Zhuang, M. Chen, H. Lv, An attentionmechanismbased traffic flow prediction scheme for smart city, in 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco (2019), pp. 1822–1827. https://doi.org/10.1109/IWCMC.2019.8766639
 21.
Y. Yu, G. Liu, H. Yan, H. Li, H. Guan, Attentionbased biLSTM model for anomalous HTTP traffic detection, in 2018 15th International Conference on Service Systems and Service Management (ICSSSM), Hangzhou (2018), pp. 1–6. https://doi.org/10.1109/ICSSSM.2018.8465034
 22.
H. Zhao, M. Cai, H. Liu, Twodimensional DOA estimation with reduceddimension MUSIC algorithm, in 2017 International Applied Computational Electromagnetics Society Symposium (ACES), Suzhou (2017), pp. 1–2
Funding
This work was supported by National Natural Science Foundation of China (61971053).
Author information
Affiliations
Contributions
In this paper, Qi Fei conceived, designed and wrote the study. All authors read and revised the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Qi, F., Li, W., Yu, P. et al. Deep learningbased BackCom multiple beamforming for 6G UAV IoT networks. J Wireless Com Network 2021, 50 (2021). https://doi.org/10.1186/s13638021019324
Received:
Accepted:
Published:
Keywords
 UAV
 IoT
 6G
 Deep learning