Implements simple reinforcement learning policy gradient methods such as actor critic methods on openAI gym environments
-
One Step Actor Critic
(Introduction to Reinforcement Learning 2nd Edition by Richard S. Sutton and Andrew G. Barton)
Simulation environment : CartPole-V0
- Tensorflow 2.0
- Keras 2.2.4
- openAI gym