-
Huazhong University of Science and Technology
- Wuhan, China
-
20:12
- 8h ahead
Pinned Loading
-
awesome-RLHF
awesome-RLHF PublicForked from opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
-
Defocus-blur-detection
Defocus-blur-detection PublicDefocus Blur Detection for computational photography, work done at the HUST.
Python
-
Discovery-of-Optimal-Reward-function
Discovery-of-Optimal-Reward-function PublicOfficial implementation of the paper "Discovery of the Reward Function for Embodied RL Agents".
Python 2
-
stable-baselines3
stable-baselines3 PublicForked from DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Python
-
BPref
BPref PublicForked from rll-research/BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
Python
-
LiRE
LiRE PublicForked from chwoong/LiRE
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
Python
If the problem persists, check the GitHub status page or contact support.