Fig. 14From: Q-learning-based dynamic joint control of interference and transmission opportunities for cognitive radioInterference ratio, transmission opportunity loss ratio, and reward of the fixed case and Q-learning @ − 12 dB. a Interference radio. b Transmission opportunity loss ratio. c Reward. The performance of the interference ratio, transmission opportunity loss ratio, and reward is compared for cases where the sensing time and the reply time are fixed and the proposed Q-learning is used for SNR of − 12 dB. In a representing the interference ratio and b representing the transmission loss ratio, the Q-learning operates within a stable range. For the case where the sensing time and the reapply time are fixed, the interference ratio is low, but the transmission opportunity loss ratio increases significantly in most cases. In c indicating a reward, the Q-learning shows overwhelming performance difference compared to other cases. Thus, it can be shown that the system load can be reduced by using Q-learningBack to article page