View Code? Open in Web Editor
NEW
본 repository는 서울대학교 2022_1 Data Science and Reinforcement Learning 수업 최종 과제입니다.
Python 34.97%
Jupyter Notebook 65.03%
reinforcement_learning_lava's Introduction
- --team : your team number
- --envType : 0 for chain mdp, 1 for lava grid
- --evalType : 0 for evaluating performance, 1 for evaluation sample efficiency
- --seeds : the list of seeds
- chain mdp & performance
- python test.py --team 1 --envType 0 --evalType 0 --seeds 1 10 100 1000 10000
- chain mdp & sample efficiency
- python test.py --team 1 --envType 0 --evalType 1 --seeds 1 10 100 1000 10000
- lava grid & Performance
- python test.py --team 1 --envType 1 --evalType 0 --seeds 1 10 100 1000 10000
- lava grid & sample efficiency
- python test.py --team 1 --envType 1 --evalType 1 --seeds 1 10 100 1000 10000
- For chain mdp, chain-pf.txt(or chain-se.txt) will be created for peformance(or sample efficiency).
- For lava grid, lava-pf.txt(or lava-se.txt) will be created for peformance(or sample efficiency).
reinforcement_learning_lava's People
Contributors
Watchers