Fig. 5From: Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networksProposed Q-table structure. The column of the Q-table represents the action tuple of the band group bq (q-th band group) and channel \( {c}_m^{b_q} \) (m-th channel of bq). The row of the Q-table is the state tuple of the i-th geographic location zone (li), j-th time zone (tj), k-th band group (bk), and l-th data rate efficiency level (dl)Back to article page