Fig. 13From: Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networksThe a Q-value of Q-table when DDR = 90 kbps, and b number of state visits. Similar to Figs. 9 and 10b, the Q-value of Q-table shows the channel movement of moderate DDR case. The visit count of a state is high in the state of low DRE and high CBG because the stable domain is in that regionBack to article page