ML-Agents Beta 0.9.2
Pre-release
Pre-release
New Features
- Added multi-gpu implementation for PPO. You can turn this on by using
--multi-gpu
option inmlagents-learn
. In our own experiments we've observed a 38% boost on the update speed of Resnet using the Obstacle Tower Environment. (#2288)
Fixes and Improvements
- Improved the training performance by 20-30% when curiosity is used. We achieved this by updating the reward signal in parallel with the policy. (#2362)
- Changed the Crawler example environment observation space to include relative observations. Increases performance in
CrawlerDynamic
scene. (18f661) - Fixed a bug with online Behavior Cloning. (#2384)
- Fixed a compile error on DemonstrationImporter.cs. (b617de)
Acknowledgements
Thanks to everyone at Unity who contributed to v0.9.2, as well as: @RunSwimFlyRich, @niskander.