Skip to main content
Fig. 4 | EURASIP Journal on Wireless Communications and Networking

Fig. 4

From: Dynamic handoff policy for RAN slicing by exploiting deep reinforcement learning

Fig. 4

Cumulative reward. This figure shows the overall cumulative reward of the system as the handoff decision progresses. In the figure, as the number of decision steps increases, the cumulative return of each strategy gradually increases. Among them, the DQN-based handoff algorithm performs better than other algorithms. And as the number of decision steps increases, the advantages of the algorithm become more apparent

Back to article page