_total_steps assignment position bug #42

MichelangeloConserva · 2022-09-19T07:49:44Z

The _total_steps should be increased after the agent takes an action, as in the jax implementation, or during the update function call, as for DQN. If the assignment is done after computing the gradient, as in the current implementation, the agent will not be trained for values of the sgd_period values higher than one.

The _total_steps should be increased after the agent takes an action as in the jax implementation of this agent. For values of the sgd_period higher than one, this bugs prevents the agent from training.

_total_steps assignment position bug

70442c1

The _total_steps should be increased after the agent takes an action as in the jax implementation of this agent. For values of the sgd_period higher than one, this bugs prevents the agent from training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

_total_steps assignment position bug #42

_total_steps assignment position bug #42

Uh oh!

MichelangeloConserva commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

_total_steps assignment position bug #42

Are you sure you want to change the base?

_total_steps assignment position bug #42

Uh oh!

Conversation

MichelangeloConserva commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant