Fig. 9From: A second-order dynamic and static ship path planning model based on reinforcement learning and heuristic search algorithmsComparison of ship reward value for each iteration. The performance comparison of ship reward value achieved by the proposed model and baselines for each iterationBack to article page