Figure 10From: Sensing time and power allocation for cognitive radios using distributed Q-learningDistance d t between the secondary SINRs generated by the Q-learning algorithm and the optimal secondary SINRs when using different exploration strategies in the Q-learning implementation. The learning frequency is f = 100 and a cooperative cost function is used.Back to article page