Fig. 7

General Q-learning mechanism. In general Q-learning mechanism, system moves to the next state using the optimal action in the corresponding state and selects the optimal action in the changed state and transitions to the next state
General Q-learning mechanism. In general Q-learning mechanism, system moves to the next state using the optimal action in the corresponding state and selects the optimal action in the changed state and transitions to the next state