Skip to content

Releases: cpnota/autonomous-learning-library

VPG

31 May 21:47
021f0a0

Choose a tag to compare

VPG Pre-release
Pre-release

The release contains two small changes:

  1. Rename REINFORCE to VPG in order to stay consistent with other libraries. Also, allow VPG to average the gradients over multiple episodes, drastically improving performance in some cases.
  2. Tweaked A2C to make it align better with other implementations. In particular, a new n-step buffer was added that is more accurate. There are also some small changes to make sure feature gradients are computed correctly.

A2C

14 Apr 18:44

Choose a tag to compare

A2C Pre-release
Pre-release
  • Add the a2c algorithm
  • Support for running parallel environments
  • Record the loss function and other information
  • Simplify the installation process
  • Reenable TravisCI tests

CUDA

06 Apr 22:11
efb3f5d

Choose a tag to compare

CUDA Pre-release
Pre-release
  1. Add CUDA support
  2. Refactor list handling in function approximators
  3. Tweak the organization of presets
  4. Improve the benchmarks API

DQN

22 Mar 23:09
589ee9e

Choose a tag to compare

DQN Pre-release
Pre-release

Includes a fully working DQN implementation, as well as a working partial Rainbow. Also includes Actor-Critic, Sarsa, and Reinforce implementations for classic control environments.