Fig. 2From: Dynamic spectrum access and sharing through actor-critic deep reinforcement learningPart of structure of twin delayed deep deterministic policy gradient algorithmBack to article page