Skip to main content

Table 2 Weight parameters

From: Reinforcement learning-based dynamic band and channel selection in cognitive radio ad-hoc networks

Weights vector (Default) Q-learning parameters Reward parameters DRE parameters
w1 = 0.3, w2 = 0.3, Learning rate (α) = 0.3, overhead (η) = 0.01, r1 = 1/6,
w3 = 0.3, w4 = 0.1 Discount factor (γ) = 0.7 δ = 2 r2 = 5/6