Soft Actor Critic - Reinforcement Learning With Trainable Alpha

A soft actor critic implementation, based on the Haarnoja etc al. paper (https://arxiv.org/pdf/1801.01290), with a learnable alpha value instead of the regular set value.

Utilizes priority replay with a sum-tree approach for faster retrieval. Trialled with normalization pre-processing functions specific for the intended environment.

Designed to work with OpenAI Gym - supports multiple environments to parallelize training and tested on OpenAI Gym Humanoid V4.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
1200-WALKEN-SAC5-Buffer100000-Epoch500000.png		1200-WALKEN-SAC5-Buffer100000-Epoch500000.png
LICENSE		LICENSE
README.md		README.md
SAC-AC-Trial-MPRER.py		SAC-AC-Trial-MPRER.py
SAC-FinalResultVideo.gif		SAC-FinalResultVideo.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft Actor Critic - Reinforcement Learning With Trainable Alpha

About

Uh oh!

Releases

Packages

Languages

License

ThursdaysMan/SAC-AlphaLearn

Folders and files

Latest commit

History

Repository files navigation

Soft Actor Critic - Reinforcement Learning With Trainable Alpha

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages