Figure 1From: A projected primal-dual gradient optimal control method for deep reinforcement learningSchematic representation of the policyBack to article page