-
Notifications
You must be signed in to change notification settings - Fork 516
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Nan Problems for SAC, TQC, for AntBulletEnv-v0, HalfCheetahBulletEnv-v0 #427
Comments
This may be due to a learning rate too high, see #156 (comment); do you use the default hyperparams? Also related (and probably duplicate): DLR-RM/stable-baselines3#1401 and DLR-RM/stable-baselines3#1418 |
yes, I use the default hyperparams, I try different learning rate later. |
Hello, I also assume you are using pybullet gymnasium repo? I'll try to reproduce the issue in the meantime. Also related: DLR-RM/stable-baselines3#1372 changing to AdamW might solve the problem too. |
I have tried TD3, SAC, TQC on some pybullet envs. And it only happens for the task I mention, the others is fine. I can upload some log file. sac-AntBulletEnv-v0.zip |
Thanks =) Looking at the log it seems to be due to an explosion of std (and you are using a much larger budget that the one we were using by default). I would appreciate a PR that adds this parameter =) Hmm, for TD3 it is weird if it happens as it doesn't rely on any distribution. EDIT: I guess the issue is similar to Stable-Baselines-Team/stable-baselines3-contrib#146 by @qgallouedec |
Bug already encountered in openrlbenchmark, |
For TD3, I only found two runs where you have an explosion of the losses, but this didn't lead to the bug: |
after I change the hyperparams from
to
this problem never happens again, so let's close this issue |
Thanks for trying out =) |
🐛 Bug
Hello. I am trying to reproduce some algorithms or experiments, to record some data. But some expectation happens, nan is generated for some unknown reasons. Any advice to solve?
To Reproduce
Relevant log output / Error message
System Info
Checklist
The text was updated successfully, but these errors were encountered: