From: Q-learning-based dynamic joint control of interference and transmission opportunities for cognitive radio
Learning rate (α)
Discount factor (γ)
Random selection probability(ε)
0.5
0.3→0.1