Abstract
For high data rate wireless communication systems, developing an efficientchannel estimation approach is extremely vital for channel detection and signalrecovery. With the trend of highmobility wireless communications betweenvehicles and vehiclestoinfrastructure (V2I), V2I communications poseadditional challenges to obtaining realtime channel measurements. Deeplearning (DL) techniques, in this context, offer learning ability andoptimization capability that can approximate many kinds of functions. In thispaper, we develop a DLbased channel prediction method to estimate channelresponses for V2I communications. We have demonstrated how fast neural networkscan learn V2I channel properties and the changing trend. The network is trainedwith a series of channel responses and known pilots, which then speculates thenext channel response based on the acquired knowledge. The predicted channel isthen used to evaluate the system performance.
Quick Read (beta)
Learning the Wireless V2I Channels Using Deep Neural Networks
Abstract
For high data rate wireless communication systems, developing an efficient channel estimation approach is extremely vital for channel detection and signal recovery. With the trend of highmobility wireless communications between vehicles and vehiclestoinfrastructure (V2I), V2I communications pose additional challenges to obtaining realtime channel measurements. Deep learning (DL) techniques, in this context, offer learning ability and optimization capability that can approximate many kinds of functions. In this paper, we develop a DLbased channel prediction method to estimate channel responses for V2I communications. We have demonstrated how fast neural networks can learn V2I channel properties and the changing trend. The network is trained with a series of channel responses and known pilots, which then speculates the next channel response based on the acquired knowledge. The predicted channel is then used to evaluate the system performance.
I Introduction
Since the first generation (1G) wireless communication network entered the market in the 1980s, the world has been dramatically changed by the development of mobile communication technologies. It has gone through several evolutions in the past few decades, from 1G to 5G, and due to the huge potential demand all over the world, the technology will continue to upgrade rapidly [1]. While 2G, 3G and 4G were about connecting people and parts of things, 5G will connect everything and it can provide unlimited access to anywhere, anytime, anybody and anything [2].
In recent years, orthogonal frequency division multiplexing (OFDM) has become a popular choice for fastspeed and highquality communication systems. However, channel modelling and channel estimation are two major challenges affecting the performance of OFDM systems. In order to estimate the channel response, pilotbased channel estimation is commonly adopted, in which a training sequence composed of known data symbols (pilots) is transmitted, and the channel parameters are initially estimated using the received pilot signals [3]. Minimum meansquare error (MMSE) and least squares (LS) are two traditional estimation approaches.
VehicletoInfrastructure (V2I) communication is about the data transmission between vehicles and infrastructure on the roads. V2I communication system is normally wireless and twoway: infrastructure like traffic lights can provide information to cars and viceversa. This communication system can provide quantitative and realtime information that can be used for safety, mobility, and environmental benefits. When V2I communications are widely used, revolution of roadways will occur all over the world. For example, selfdriving vehicles will become reality and it is vital to make successful and safe autonomous cars. To make this huge change happen, the basic theory and techniques must now be developed urgently.
To enhance the performance of communication systems and to solve signal processing and communications problems, deep learning (DL) has recently drawn utmost popularity [4, 5, 6, 7]. A deep neural network (DNN) is an algorithm with learning ability and optimization capability that can approximate many kinds of functions. Particularly, problems without any precise numerical model can now be solved using DL methods [8]. In order to leverage the advantages of using a large group of data for communication performance improvement, several machine learning methods including supervised, unsupervised and reinforcement learning have been proposed based on the traditional approaches. The machine learning can be useful in analyzing communication environment variance, making decisions autonomously, transmission routing, network security, and system resource management [6, 8].
Recently, some works have concentrated in the area of channel estimation using DL methods [9, 10, 11, 12, 13]. The authors in [10, 11] proposed the back propagation (BP) learning algorithm to build a multilayered perceptron (MLP) neural network as an estimator for OFDM communication channels. In [11], a method of implicit channel state information (CSI) estimation and direct recovery of transmission symbols based on deep learning has been proposed. The authors in [12] proposed an approximate information passing network for millimetrewave massive multipleinput and multipleoutput (MIMO) systems based on learned denoising which is a deep learning model that can analyze channel structure and estimate channel from a big training database. In [13], the authors developed a CSI feedback model by spreading a new learned CSI perception and restoration architecture. By learning the spatial features directly and combining the correlation of samples in timevarying MIMO channels, the system greatly improves the quality of recovery and the tradeoff between the compression ratio (CR) and the recovery quality [13].
All of the above works [9, 10, 11, 12, 13] have developed DLbased channel estimation techniques in traditional quasistationery wireless communication systems. However, with the trend of highmobility wireless communications between vehicles and vehiclestoinfrastructure (V2I), many rising problems appear because of the high variances in communication environments, which are fundamentally different from traditional wireless communication problems. For the communication with large Dopplershift and fastvarying channels, the above methods may not work because in fast timevarying fading environment channel response is difficult to obtain in real time and the outdated CSI exhibits a significant negative impact on the performance. The critical point to fit highmobility networks is to develop future informatized and intelligent vehicles combined with machine learning to benefit the communication networks [14].
In this paper, we address the adverse effects of imperfect CSI on V2I communication systems by developing a DLbased channel prediction approach. Two major challenges facing V2I channel estimation addressed in this paper are i) the way to find CSI in realtime with the knowledge of vehicle information including position and velocity, ii) the technique to estimate the fast timevariant channel properly and recover the information with minimum error. The major contributions of this paper are as follows:

•
Firstly, an OFDM modulation based V2I channel estimation technique is introduced as a baseline to verify effectiveness of the DLbased approach.

•
Secondly, DLbased channel prediction method is proposed to deal with the channel estimation problem in V2I communication system in highmobility environments.

•
Thirdly, the performance of the proposed DLbased approach is compared with other common algorithms through extensive numerical simulations.
II System Model
We consider a highmobility vehicular communication system (cf. Fig. 1) in which a vehicle intends to communicate with a roadside infrastructure, which can be a traditional cellular base station (BS), traffic lamppost, building or any other fixed structure. At both BS and the vehicle, only one antenna for transmitting and receiving is considered. For simplification, only 2D horizontal plane is considered where the vehicle travels at a constant speed along a straight road. Hence, at the same position the channel is highly similar even at different time slots, and it can be considered as position related channel estimation which means that change of channel is related to the position of vehicle. The channel power gain between the vehicle at position $n$ and the BS is defined as [15]
$h(n)=G(n)\beta (n)Ad{(n)}^{\gamma},$  (1) 
where $G(n)$ is assumed to be exponentially distributed fast fading power gain, $\beta (n)$ is the log normal shadow fading component, $A$ is the constant pathloss, $d(n)$ is the distance between the vehicle and the BS, and $\gamma $ is the pathloss exponent. For a frequencyselective multipath fading channel, the channel impulse response (CIR) is given by
$h(n)$  $={\displaystyle \sum _{l=0}^{L1}}{C}_{l}{e}^{j{\varphi}_{l}(n)}\delta (n{\tau}_{l})$  (2)  
$={\alpha}_{l}\delta (n{\tau}_{l}),$  (3) 
where $L$ is the total number of multipath component and ${\varphi}_{l}$, ${\tau}_{l}$, and ${\alpha}_{l}={\sum}_{l=0}^{L1}{C}_{l}{e}^{j{\varphi}_{l}(n)}$ are phase, time delay, and the timedependent complex path gain of the $l$th multipath component. For the V2I channel, the timevariant phase associated with the $l$th path is defined as [16, Chapter 2]
${\varphi}_{l}(n)={\varphi}_{l}2\pi c{\tau}_{l}/{\lambda}_{\mathrm{c}}+2\pi {f}_{\mathrm{D},l}n,$  (4) 
where $c={f}_{c}{\lambda}_{c}$ is the speed of light, ${\lambda}_{c}$ is the wavelength of the arriving plane wave (carrier signal), ${\tau}_{l}={d}_{l}/c$, ${d}_{l}$ is the length of the $l$th path, and ${f}_{\mathrm{D},l}$ is the maximum Doppler shift of the $l$th propagation path. For OFDM transmission, the $m$th modulated time domain symbols can be expressed as
$$\begin{array}{c}{x}_{m}(n)=\frac{1}{\sqrt{N}}\sum _{l=1}^{N1},{X}_{m}(l){e}^{j2\pi nl/N},n=0,\mathrm{\cdots},N1,\hfill \end{array}$$  (5) 
where ${X}_{m}(l)$ is the $m$th modulated symbol, $n$ is the subcarrier index and $N$ is the total number of subcarriers. Cyclic prefix is then appended to the symbols to prevent intersymbol interference (ISI). The signals are then transmitted through the wireless channel and the signal received at the BS can be expressed as
$${y}_{m}(n)={h}_{m}(n)\otimes {x}_{m}(n)+{w}_{m}(n),$$  (6) 
where $\otimes $ indicates the convolution operation and ${w}_{m}(n)$ is the additive white Gaussian noise for the $m$th subcarrier. At the receiver, the cyclic prefix is removed first from the received signal, followed by parallel conversion to frequency domain by applying fast Fourier transform:
$${Y}_{m}(k)=\frac{1}{N}\sum _{n=1}^{N1}{y}_{m}(n){e}^{j2\pi kn/N},k=0,\mathrm{\cdots},N1.$$  (7) 
Thus in the frequency domain, the inputoutput relationship can be expressed as
$Y(k)=H(k)X(k)+W(k),k=0,\mathrm{\cdots},N1.$  (8) 
Consequently, the system can be described as a set of $N$ independent parallel Gaussian channels:
${y}_{k}={h}_{k}{x}_{k}+{w}_{k},k=0,\mathrm{\cdots},N1.$  (9) 
For convenience, we rewrite (10) using matrix notations as [3]
$\mathbf{y}=\mathrm{\mathbf{X}\mathbf{F}\mathbf{g}}+\mathbf{w},$  (10) 
where $\mathbf{X}$ is a $N\times N$ diagonal matrix containing $\mathbf{x}\triangleq {\left[{x}_{m}(0){x}_{m}(1)\mathrm{\cdots}{x}_{m}(N1)\right]}^{T}$ as the main diagonal, $\mathbf{g}\triangleq {\left[{g}_{m}(0){g}_{m}(1)\mathrm{\cdots}{g}_{m}(N1)\right]}^{T}$, $\mathbf{w}\triangleq {\left[{w}_{0}{w}_{1}\mathrm{\cdots}{w}_{N1}\right]}^{T}$ and
$\mathbf{F}\triangleq \left[\begin{array}{ccc}\hfill {F}_{0,0}\hfill & \hfill \mathrm{\cdots}\hfill & \hfill {F}_{0,N1}\hfill \\ \hfill \mathrm{\vdots}\hfill & \hfill \mathrm{\ddots}\hfill & \hfill \mathrm{\vdots}\hfill \\ \hfill {F}_{N1,0}\hfill & \hfill \mathrm{\cdots}\hfill & \hfill {F}_{N1,N1}\hfill \end{array}\right],$  (11) 
is the DFT matrix with ${F}_{n,k}=\frac{1}{\sqrt{N}}{e}^{j2\pi nk/N}$.
The objective of this study is to estimate the CIR ${H}_{m}(k)$ from the observation of ${Y}_{m}(k)$ with known pilot signal ${X}_{m}(k)$. In the following, we will first discuss the conventional channel estimation approaches, and then develop a deep learning based channel estimation algorithm for the V2I system (7).
III Conventional Channel Estimation Technique
In conventional communication systems, both blind and nonblind channel estimation techniques have been considered for estimating the CIR. Popular channel estimation algorithms include maximum likelihood (ML), least mean square (LMS), minimum mean square error (MMSE) and least square (LS) methods. These methods have been studied thoroughly to estimate CSI within a certain time and frequency range. Although the LS algorithm has a worse performance in timevarying environments compared to the other approaches, its implementation is very simple. On the other hand, the MMSE algorithm can be wellbehaved in all general fading channels with both frequency and time selectivity.
IIIA The LS Algorithm
The LS algorithm tries to minimize the squared error (i.e., Euclidian distance) between the transmitted and the received signals which is expressed as
$$\begin{array}{c}{\parallel \mathbf{y}\mathrm{\mathbf{X}\mathbf{F}\mathbf{g}}\parallel}^{2}={\left(\mathbf{y}\mathrm{\mathbf{X}\mathbf{F}\mathbf{g}}\right)}^{H}\left(\mathbf{y}\mathrm{\mathbf{X}\mathbf{F}\mathbf{g}}\right).\hfill \end{array}$$  (12) 
Taking the derivative of (12) and equating the derivative to $0$, the LS estimate of the channel frequency response is given by
${\widehat{\mathbf{h}}}_{\mathrm{LS}}=\mathbf{F}{\left({\mathbf{F}}^{H}{\mathbf{X}}^{H}\mathrm{\mathbf{X}\mathbf{F}}\right)}^{1}{\mathbf{F}}^{H}{\mathbf{X}}^{H}\mathbf{y}.$  (13) 
Since the optimal training sequence is orthonormal [17, 18], (13) eventually reduces to
${\widehat{\mathbf{h}}}_{\mathrm{LS}}={\mathbf{X}}^{1}\mathbf{y}.$  (14) 
IIIB The MMSE Algorithm
The MMSE channel estimator takes the effect of channel noise into consideration. Meanwhile, the MMSE estimate of $\mathbf{g}$ is given by
${\widehat{\mathbf{g}}}_{\mathrm{MMSE}}={\mathbf{R}}_{gy}{\mathbf{R}}_{yy}^{1}\mathbf{y}$  (15) 
where
${\mathbf{R}}_{gy}$  $=\mathrm{E}\left\{{\mathrm{\mathbf{g}\mathbf{y}}}^{H}\right\}={\mathbf{R}}_{gg}{\mathbf{F}}^{H}{\mathbf{X}}^{H},$  (16)  
${\mathbf{R}}_{yy}$  $=\mathrm{E}\left\{{\mathrm{\mathbf{y}\mathbf{y}}}^{H}\right\}={\mathrm{\mathbf{X}\mathbf{F}\mathbf{R}}}_{gg}{\mathbf{F}}^{H}{\mathbf{X}}^{H}+{\sigma}_{w}^{2}{\mathbf{I}}_{N}$  (17) 
are the corresponding covariance matrices and ${\sigma}_{w}^{2}$ is the noise variance $\mathrm{E}\left\{{\left{w}_{k}\right}^{2}\right\}$. Since the columns of $\mathbf{F}$ in (11) are orthonormal, the frequencydomain MMSE estimate of $\mathbf{h}$ is given by [3]
$$\begin{array}{c}{\widehat{\mathbf{h}}}_{\mathrm{MMSE}}=\mathbf{F}{\widehat{\mathbf{g}}}_{\mathrm{MMSE}}={\mathrm{\mathbf{F}\mathbf{R}}}_{gg}{\left[{\left({\mathbf{F}}^{H}{\mathbf{X}}^{H}\mathrm{\mathbf{X}\mathbf{F}}\right)}^{1}{\sigma}_{w}^{2}+{\mathbf{R}}_{gg}\right]}^{1}\hfill \\ \hfill \times {\left({\mathbf{F}}^{H}{\mathbf{X}}^{H}\mathrm{\mathbf{X}\mathbf{F}}\right)}^{1}{\mathbf{F}}^{H}{\mathbf{X}}^{H}\mathbf{y}.\end{array}$$  (19) 
IV Proposed Machine Learning Approach
It has been shown in [11] that a multilayer neural network (MLNN) can provide a very good approximation of channels. Hence in the following, we propose a deep learning based V2I channel estimation algorithm using back propagation technique.
IVA Defining the Neural Network
As shown in Fig. 2, the neural network contains two inputs and two outputs and ${n}_{\mathrm{h}}$ hidden nodes. The inputs to the network are the received signals, and the estimated channel response parameters are the outputs. The two inputs and outputs of the MLNN are connected to the real and imaginary components of the corresponding complex numbers, since neural networks work only with real numbers.
For the machine learning based channel estimation scheme, we consider a supervised learning approach which estimates the CIRs using a fully connected neural network. Note that we do not consider any bias inputs to the neurons at any layer. We apply sigmoid function for the activation of the hidden neurons, while the activation function at the output layer is linear. The output of each neuron is generated by computing the weighted sum of the inputs coming into that node and then applying the activation function.
Let us now assume that ${w}_{i,j}^{(1)}$ is the $i$th input to $j$th hidden neuron weight and ${w}_{j,k}^{(2)}$ is the $j$th hidden node to the $k$th output node weight. The hidden layer activation function is the sigmoid function
$\mathrm{\Phi}(z)={\displaystyle \frac{1}{1+{e}^{z}}}={\displaystyle \frac{{e}^{z}}{{e}^{z}+1}}.$  (20) 
Thus, the hidden layer output is defined as
${O}_{j}^{\mathrm{h}}=\mathrm{\Phi}\left({\text{in}}_{j}\right)={\displaystyle \frac{1}{1+{e}^{{\text{in}}_{j}}}},j=1,\mathrm{\cdots},{n}_{\mathrm{h}},$  (21) 
where ${\text{in}}_{j}={\sum}_{i=1}^{{n}_{\mathrm{in}}}{X}_{i}^{\text{in}}{w}_{i,j}^{(1)}$, ${n}_{\mathrm{in}}$ is the number of input layer nodes and ${X}_{i}^{\text{in}}$ is the $i$th input to the DNN. Similarly, each output node computes its net output as
${O}_{k}^{\mathrm{out}}=\mathrm{\Phi}\left({\text{in}}_{k}\right)={\displaystyle \frac{1}{1+{e}^{{\text{in}}_{k}}}},k=1,2,$  (22) 
where ${\text{in}}_{k}={\sum}_{j=1}^{{n}_{\mathrm{h}}}{O}_{j}{w}_{j,k}^{(2)}$. Then, the sum of weighted inputs to the nodes is applied to the output layer activation function which is assumed to be a linear function. Thus the network outputs can be calculated as
${O}_{k}^{\mathrm{out}}=\mathrm{\Phi}\left({\displaystyle \sum _{j=1}^{{n}_{\mathrm{h}}}}{w}_{j,k}^{(2)}\mathrm{\Phi}\left({\displaystyle \sum _{i=1}^{{n}_{\mathrm{in}}}}{w}_{i,j}^{(1)}{X}_{i}\right)\right),k=1,2.$  (23) 
IVB Training the Neural Network
In training process, the weights of all the layers are adjusted according to the mismatches between the outputs and the targets. In each epoch, the direction of changes are that tending to minimize the meansquared error (MSE). Accordingly, we define the cost function as the MSE between DNN output and the target output as
$\epsilon ={\displaystyle \frac{1}{2}}{\displaystyle \sum _{k=1}^{{n}_{\mathrm{o}}}}{\left({t}_{k}{O}_{k}^{\mathrm{out}}\right)}^{2},$  (24) 
where ${t}_{k}$ is the $k$th desired output and ${n}_{\mathrm{o}}$ is the number of outputs (2 in this case). The gradient descent based back propagation learning rule is exploited to optimize the weights for improving the DNN performance. The gradient descent method minimizes the cost function $\epsilon $ by updating the weights in the opposite direction of the gradient of the objective function w.r.t. to the weights [19]. Thus the weight update in each iteration is given by
$\nabla w=\eta {\displaystyle \frac{\partial \epsilon}{\partial w}},$  (25) 
where $$ is the learning rate that determines the size of the steps in each iteration. Thus the update of hiddentooutputlayer weights can be expressed as
$\nabla {w}_{jk}=\eta \left({t}_{k}{O}_{k}^{\mathrm{out}}\right){\displaystyle \frac{\partial {O}_{k}^{\mathrm{out}}}{\partial {w}_{jk}}}{O}_{j}^{\mathrm{h}}.$  (26) 
Note that the gradient of output layer activation function is always unity. Similarly, the back propagated updates of the inputtohiddenlayer weights can be expressed as
$\nabla {w}_{jk}=\eta \left[{\displaystyle \sum _{k=1}^{{n}_{\mathrm{o}}}}{w}_{jk}\left({t}_{k}{O}_{k}^{\mathrm{out}}\right){\displaystyle \frac{\partial {O}_{k}^{\mathrm{out}}}{\partial {w}_{jk}}}\right]{\displaystyle \frac{\partial {O}_{j}^{\mathrm{h}}}{\partial {w}_{ij}}}{X}_{i}.$  (27) 
For training the DNN, we first generate a large set of pilot symbols following a particular modulation scheme. We then use a series of historical channel responses between moving vehicle and BS for optimizing the weights of the neural network. The training process continues until the target accuracy is reached. The overall training procedure is summarized in Fig. 3.
V Numerical Simulations
In this section, we perform numerical simulations to demonstrate the effectiveness of the proposed machine learning based channel estimation method for wireless V2I communications. Throughout this section, we compare the performance of the proposed approach against the conventional LS and MMSE based estimation schemes in [3]. We first demonstrate the estimation accuracy of the DNN method considering a lowmobility environment. We then test the trained DNN performance for V2I channel estimation.
We assume that the multipath V2I channel has time delays and data is transmitted using 4QAM modulation. The neural network is designed to have a single hidden layer with $10$ neurons and single output layer with two nodes. Sigmoid and linear activation functions has been utilized in the hidden layer and the output layer, respectively. The DNN algorithm is implemented in two ways. The first one is built from sketch using the structure illustrated in Section IV (identified as ‘build’ in the figures), and the second one is implemented based on the ANN toolbox in MATLAB, whose performance has also been compared in simulations where applicable.
Fig. 4 shows the performance of the proposed DNN approach in the training process according to normalized MSE at different SNR environment. We run for 20 epochs for each training data set and It is clear that the error between network outputs and the targets is being minimized as the training progresses. It can also be found that with higher SNR, which means the noise power is low compared with signal power, the error of the estimated channel response becomes lower.
Then, Fig. 5 compares the bit error rate (BER) performances of all the methods. It can be seen that the LS algorithm results in the worst behaviour compared to the others and it cannot correctly estimate the channel especially when SNR is very low. From this figure it is clear that the performance of proposed back propagation DNN approaches are much better than the LS algorithm at low SNR values as well as high SNR values. The results also show that both DNN approaches perform close to each other in a lowmobility environment, and are very close to the MMSE algorithm. Obviously, the MMSE algorithm performs the best according to the simulation results.
Next, simulations of a V2I wireless communication system in a fast timevarying fading channel are carried out to demonstrate the performances of different channel estimation methods. Figs. 6 to 9 show the training process of DNN for V2I channel prediction. In these simulations, the entire data set is divided into three subsets namely: training (70%), validation (15%) and testing (15%), respectively. During the training process, the MSE reduces sharply and the best estimate is achieved at the 49th epoch with the minimum error distribution. Fig. 8 shows the histogram of each subset. From Fig. 9, it can also be seen that the outputs fit the targets very well.
Finally, we plot the BER performance of the algorithms in Fig. 10. It can be seen from the results that the BER performance of the traditional DNN methods is significantly worse than the other methods. The reason for this is that the channel estimation in V2I communication is influenced by many factors, such as fast variation of channels, large Dopplershift, high interference and noise. These results further justify the effectiveness of the proposed DLbased approach for V2I channel estimation.
VI Conclusion
We have introduced a deep learning based channel estimation approach for V2I wireless communication systems. We have demonstrated that the channels in highmobility environment can be estimated using DNN based prediction methods with a group of historical CIR to solve the outdated CSI problem. Extensive simulation results illustrate that the proposed channel prediction method is able to dramatically improve the performance of channel estimation in particular in highmobility environment. Considering the nonuniform movement of vehicles, including variant position and changing velocity in the training process of DNN can be an interesting future work.
References
 [1] F. Tariq, M. R. A. Khandaker, K.K. Wong, M. Imran, M. Bennis, and M. rouane Debbah, “A speculative study on 6G,” IEEE Commun. Magazine, June 2019 (submitted). Available: https://arxiv.org/pdf/1902.06700.pdf.
 [2] J. G. A. et al., “What will 5G be?” IEEE J. Sel. Areas Commun., vol. 32, pp. 1065–1082, June 2014.
 [3] J. J. van de Beek, O. Edfors, M. Sandell, S. K. Wilson, and P. O. Borjesson, “On channel estimation in OFDM systems,” in Proc. IEEE 45th Veh. Technol. Conf., vol. 2, Chicago, IL, USA, 1995, pp. 815–819.
 [4] C. Jiang, H. Zhang, Y. Ren, Z. Han, K.C. Chen, , and L. Hanzo, “Machine learning paradigms for nextgeneration wireless networks,” IEEE Wireless Commun., vol. 24, pp. 98–105, 2016.
 [5] C. Zhang, P. Patras, and H. Haddadi, “Deep learning in mobile and wireless networking: A survey,” IEEE Commun. Surveys & Tutorials, 2019.
 [6] J. Gao, M. R. A. Khandaker, F. Tariq, K.K. Wong, and R. T. Khan, “Deep neural network based resource allocation for V2X communications,” in IEEE 90th Vehicular Technology Conference: VTC2019Fall, 2225 Sep. 2019, Honolulu, Hawaii, USA (submitted). Preprint arXiv:1906.10194.
 [7] H. Ye and G. Y. Li, “Deep reinforcement learning for resource allocation in V2V communications,” in Proc. IEEE Int. Conf. Commun. (ICC), Kansas City, MO, 2018.
 [8] O. Simeone, “A very brief introduction to machine learning with applications to communication systems,” Available online: https://arxiv.org/abs/1808.02342, 2018.
 [9] M. Mehrabi, M. Mohammadkarimi, M. Ardakani, and Y. Jing, “Decision directed channel estimation based on deep neural network kstep predictor for MIMO communications in 5G,” Available: https://arxiv.org/abs/1901.03435, Jan. 2019.
 [10] N. Taspinar and M. N. Seyman, “Back propagation neural network approach forchannel estimation in OFDM system,” in Proc. Wireless Commun., Netw. Inf. Security (WCNIS), June 2010, pp. 265–268.
 [11] H. Ye, G. Y. Li, and B.H. Juang, “Power of deep learning for channel estimation and signal detection in OFDM systems,” IEEE Wireless Commun. Lett., vol. 7, pp. 114–117, Feb. 2018.
 [12] H. He, C. Wen, S. Jin, and G. Y. Li, “Deep learningbased channel estimation for beamspace mmWave massive MIMO systems,” IEEE Wireless Commun. Lett., vol. 7, pp. 852–855, Oct. 2018.
 [13] T. Wang, C. Wen, S. Jin, and G. Y. Li, “Deep learningbased CSI feedback approach for timevarying massive MIMO channels,” IEEE Wireless Commun. Lett., vol. 8, pp. 416–419, Apr. 2019.
 [14] L. Liang, H. Ye, and G. Y. Li, “Toward intelligent vehicular networks: A machine learning framework,” IEEE IoT Journal, vol. 6, pp. 124–135, Feb. 2019.
 [15] L. Liang, G. Y. Li, and W. Xu, “Resource allocation for D2Denabled vehicular communications,” IEEE Trans. Commun., vol. 65, pp. 3186–3197, July 2017.
 [16] G. L. Stüber, Principles of Mobile Communication, 4th ed. Springer, 2017.
 [17] Y. Rong, M. R. A. Khandaker, and Y. Xiang, “Channel estimation of dualhop MIMO relay systems using parallel factor analysis,” IEEE Trans. Wireless Commun., vol. 11, pp. 2224–2233, Jun. 2012.
 [18] Y. Rong and M. R. A. Khandaker, “Channel estimation of dualhop MIMO relay system via parallel factor analysis,” in Proc. 17th AsiaPacific Conf. Commun. (APCC’2011), Sabah, Malaysia, Oct. 25, 2011.
 [19] S. Ruder, “An overview of gradient descent optimization algorithms,” arXiv preprint. Available: arXiv:1609.04747, June 2017.