Fig. 8From: Q-learning-based dynamic joint control of interference and transmission opportunities for cognitive radioQ-learning mechanism of the proposed scheme. In the proposed Q-learning system, the sensor acts as an agent and uses sensing time and reply time as an action. In an environment where the PU operates in an alternating busy/idle state, it obtains the interference ratio and the transmission opportunity loss ratio for the action, and uses it to change the state and calculate the rewardBack to article page