SB3-Contrib v2.4.0: New algorithm (CrossQ), Gymnasium v1.0 support
Breaking Changes:
- Upgraded to Stable-Baselines3 >= 2.4.0
New Features:
- Added
CrossQalgorithm, from "Batch Normalization in Deep Reinforcement Learning" paper (@danielpalen) - Added
BatchRenormPyTorch layer used inCrossQ(@danielpalen) - Added support for Gymnasium v1.0
Bug Fixes:
- Updated QR-DQN optimizer input to only include quantile_net parameters (@corentinlger)
- Updated QR-DQN paper link in docs (@corentinlger)
- Fixed a warning with PyTorch 2.4 when loading a
RecurrentPPOmodel (You are using torch.load with weights_only=False) - Fixed loading QRDQN changes
target_update_interval(@jak3122)
Others:
- Updated PyTorch version on CI to 2.3.1
- Remove unnecessary SDE noise resampling in PPO/TRPO update
- Switched to uv to download packages on GitHub CI
New Contributors
- @corentinlger made their first contribution in #252
- @jak3122 made their first contribution in #259
- @danielpalen made their first contribution in #243
Full Changelog: v2.3.0...v2.4.0