Adapted from http://ai.berkeley.edu/project_overview.html
Learning from here - https://gibberblot.github.io/rl-notes/intro.html
In this project, you will implement value iteration and Q-learning. You will test your agents first on Gridworld , then apply them to a simulated robot controller (Crawler) and Pacman.