Fig. 4From: Using location semantics to realize personalized road network location privacy protectionShows the interaction process between “Agent” and “Environment,” “Agent” is a learner, “Environment” is the learner’s environment, At, St, and Rt represent current action, current state, and current reward, respectively, St + 1 and Rt + 1 represent the action of the next moment and the state of the next moment, the dotted line in the graph represents the time boundaryBack to article page