anmol6 / learning-from-human-preferences Goto Github PK
View Code? Open in Web Editor NEWThis project forked from mrahtz/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
License: MIT License