Skip to content

Soft Actor Critic - Reinforcement Learning With Trainable Alpha

License

Notifications You must be signed in to change notification settings

ThursdaysMan/SAC-AlphaLearn

Repository files navigation

Soft Actor Critic - Reinforcement Learning With Trainable Alpha

SAC-FinalResultVideo

A soft actor critic implementation, based on the Haarnoja etc al. paper (https://arxiv.org/pdf/1801.01290), with a learnable alpha value instead of the regular set value.

Utilizes priority replay with a sum-tree approach for faster retrieval. Trialled with normalization pre-processing functions specific for the intended environment.

Designed to work with OpenAI Gym - supports multiple environments to parallelize training and tested on OpenAI Gym Humanoid V4.

About

Soft Actor Critic - Reinforcement Learning With Trainable Alpha

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages