Fig. 6From: Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC servicesThe reward curves at various loads. The average and maximum total reward curves of the RL-based hybrid spectrum and power resource allocation scheme with loads of 0.86, 1.03, and 1.28. The x-axis is the number of iterative trainings, and the y-axis is the total reward in Eq. (11)Back to article page