Fig. 11From: Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networksThe a Q-value of Q-table when DDR = 40 kbps, and b number of state visits. Similar to Figs. 9 and 10a, the Q-value of Q-table shows the channel movement of low DDR case. The visit count of state is high in the state of low DRE and low CBG because the stable domain is in that regionBack to article page