Fig. 5From: A Q-learning-based distributed queuing Mac protocol for Internet-of-Things networksThe agent-environment interaction in a Markov decision processBack to article page