Each algorithm consists of three parts:
-
algorithm
-
config file (
.yaml
) -
main function
No inheritance is used, making the algorithm clearer.
Trade-off between performance and readability.
- Reinforce
-
Actor Critic (AC)
-
Adavantage Actor Critic (A2C)
- Deep Q Network (DQN)
- Double Deep Q Network (DoubleDQN)
- Dueling Double Deep Q Network (DuelingDoubleDQN)
-
Python
-
Numpy
-
Gym
-
Pytorch
-
Matplotlib