Figure 8From: Sensing time and power allocation for cognitive radios using distributed Q-learningDistance d t between the secondary SINRs generated by the Q-learning algorithm and the optimal secondary SINRs when using different cost functions in the Q-learning implementation. The randomness of exploration ϵ is constant and the frequency of the learning algorithm f = 100.Back to article page