Skip to main content

Table 3 The new value function for each state of DP

From: Trajectory optimization for UAV-assisted relay over 5G networks based on reinforcement learning framework

  1. The bold defines the optimal action for each state
  2. The optimal actions are highlighted in green, orange, blue, and yellow for the Suburban, Urban, Dense Urban, and Highrise Urban environments, respectively