Skip to content

Releases: vwxyzjn/cleanba

v1.0.0b3 Refactor PPO

24 Aug 14:38
Compare
Choose a tag to compare
v1.0.0b3 Refactor PPO Pre-release
Pre-release

Full Changelog: v1.0.0b2...v1.0.0b3

v1.0.0b2 pre-release (bug fix)

17 Aug 20:42
Compare
Choose a tag to compare
Pre-release

Full Changelog: v1.0.0b1...v1.0.0b2

v1.0.0b1 pre-release (many refactor)

17 Aug 14:43
6367b2c
Compare
Choose a tag to compare
Pre-release

What's Changed

  • Improvement by @vwxyzjn in #2
  • fix hts-rl paper link by @ChufanSuki in #3
  • Remove the legacy last action rewards script by @vwxyzjn in #7
  • add grad accum for ppo + envpool + impala atari wrapper by @51616 in #8
  • Add actor threads example by @vwxyzjn in #6

New Contributors

  • @vwxyzjn made their first contribution in #2
  • @ChufanSuki made their first contribution in #3
  • @51616 made their first contribution in #8

Full Changelog: v0.0.1...v1.0.0b1

Dummy Release

22 Feb 17:06
Compare
Choose a tag to compare
v0.0.1

Add acknowledgement