Fig. 8From: Dynamic handoff policy for RAN slicing by exploiting deep reinforcement learningParameter convergence. This figure shows the relationship between the weight of the connection between the input layer neurons and the hidden layer neurons and the number of trainings. As shown in the figure, after 100,000 trainings, the weight parameters tend to be stableBack to article page