Meta-Gradient RL A2C #207
Unanswered
RobvanGastel
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!
I have been working with your package to test your meta-gradient RL example. As I have it implemented now, the algorithm converges on the cartpole environment, however, the meta-parameter gamma only trends downwards. Do I use torchopt incorrectly in the code sample below?
Any help would be much appreciated! Thank you!
Beta Was this translation helpful? Give feedback.
All reactions