From: Deep reinforcement learning-based beam training with energy and spectral efficiency maximisation for millimetre-wave channels
Beam training algorithms
Time (s)
Reward
DRL-3
7.12
− 0.17
DRL-5
7.11
DRL-7
7.22
− 0.18
MAB
7.07
− 0.20
RAND
7.39
− 0.60
MR
46.56
− 1.43