-
Huazhong University of Science and Technology
- Wuhan, China
-
18:30
- 8h ahead
Pinned Loading
-
awesome-RLHF
awesome-RLHF PublicForked from opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
-
Defocus-blur-detection
Defocus-blur-detection PublicDefocus Blur Detection for computational photography, done at the Huazhong University of Science and Technology.
Python
-
Discovery-of-Optimal-Reward-function
Discovery-of-Optimal-Reward-function PublicOfficial implementation of the paper "Discovery of the Optimal Reward Function for Embodied RL Agents".
-
imitation
imitation PublicForked from HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Python
-
stable-baselines3
stable-baselines3 PublicForked from DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Python
If the problem persists, check the GitHub status page or contact support.