Skip to content

Latest commit

 

History

History
28 lines (23 loc) · 2.21 KB

RLAIF: Reinforcement Learning from AI Feedback

File metadata and controls

28 lines (23 loc) · 2.21 KB