Here I will list the algorithm I implimented.
Here I will list the thing I am learning or have learned to implement the algorithms.
- David Silver's RL lectures
- Lecture 1 - Intro RL
- Lecture 2 - MDP
- Lecture 3 - Dynamic Programming
- Lecture 4 - MC-TD
- Lecture 5 - Control
- Lecture 6 - Function Approximation
- Lecture 7 - Policy Gradient
- Lecture 8 - Integrating Learning and Planning
- Lecture 9 - Exploration and Exploitation
- Lecture 10 - Classic Games
- Sutton and Barto's "An Introduction to Reinforcement Learning"
- Csaba Szepesvari's "Algorithms for Reinforcement Learning"