Fig. 7From: Survivability-aware routing restoration mechanism for smart grid communication network in large-scale failuresAverage rewards variation vs. no. of Episodes. This figure shows the average reward of the agent received from the environment varies with the number of episodes for the three algorithms, I-DQN, Q-learning, and Natural-DQN, in the scenario of 3 link failures and 15 affected servicesBack to article page