Founder of FAR.AI @AlignmentResearch
- Berkeley, California
- http://gleave.me
Pinned Loading
-
hill-a/stable-baselines
hill-a/stable-baselines PublicForked from openai/baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
-
HumanCompatibleAI/adversarial-policies
HumanCompatibleAI/adversarial-policies PublicFind best-response to a fixed policy in multi-agent RL
-
HumanCompatibleAI/imitation
HumanCompatibleAI/imitation PublicClean PyTorch implementations of imitation and reward learning algorithms
-
HumanCompatibleAI/seals
HumanCompatibleAI/seals PublicBenchmark environments for reward modelling and imitation learning algorithms.
-
HumanCompatibleAI/population-irl
HumanCompatibleAI/population-irl Public(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
-
73 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More