Hands-On Reinforcement Learning with Python, published by Packt

License: MIT License

Jupyter Notebook 100.00%

hands-on-reinforcement-learning-with-python's Introduction

Hands-On-Reinforcement-Learning-with-Python

This is the code repository for Hands-On-Reinforcement-Learning-with-Python, published by Packt.

Master reinforcement and deep reinforcement learning using OpenAI Gym and TensorFlow

What is this book about?

Reinforcement Learning (RL) is the trending and most promising branch of artificial intelligence. Hands-On Reinforcement learning with Python will help you master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms.

This book covers the following exciting features:

Understand the basics of reinforcement learning methods, algorithms, and elements
Train an agent to walk using OpenAI Gym and Tensorflow
Understand the Markov Decision Process, Bellman’s optimality, and TD learning
Solve multi-armed-bandit problems using various algorithms
Master deep learning algorithms, such as RNN, LSTM, and CNN with applications

If you feel this book is for you, get your copy today!

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

policy_iteration():
Initialize random policy
for i in no_of_iterations:
Q_value = value_function(random_policy)
new_policy = Maximum state action pair from Q value

Following is what you need for this book: If you’re a machine learning developer or deep learning enthusiast interested in artificial intelligence and want to learn about reinforcement learning from scratch, this book is for you. Some knowledge of linear algebra, calculus, and the Python programming language will help you understand the concepts covered in this book.

With the following software and hardware list you can run all code files present in the book (Chapter 1-15).

Software and Hardware List

Chapter	Software required	OS required
1-12	anaconda	Ubutnu or mac
	chrome	Ubutnu or mac

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Get to Know the Author

Sudharsan Ravichandiran is a data scientist, researcher, artificial intelligence enthusiast, and YouTuber (search for Sudharsan reinforcement learning). He completed his bachelors in information technology at Anna University. His area of research focuses on practical implementations of deep learning and reinforcement learning, which includes natural language processing and computer vision. He used to be a freelance web developer and designer and has designed award-winning websites. He is an open source contributor and loves answering questions on Stack Overflow.

Suggestions and Feedback

Click here if you have any feedback or suggestions.

hands-on-reinforcement-learning-with-python's People

Contributors

Stargazers

Watchers

Forkers

godfatherace vijayravik duoan jamac22 carlosandres12 jtwangtw wizquierdo huzichunjohn chengjiacheng taheral jidvei amimul justrypython aiedward irvinec airob pemar expert68 gvakhrushev swissvoc equalll brynl tianyikenan fuxianh angleboy8 piquepei beerbottle awesome-archive aust-hansen mojia yishuihanhan pyhustsong davidzhangdw yuanmengzhixing ringwraith emiya-lin awsomename zzzz123321 william0523 henrydeng abdullahmohammadkhan bookcollections jesselivingston atomwrite kevinguom zuiwanting pherse cybort leefree-git linghanqian mx597014232 0-avalon-0 cxyac freesia-sherry tedrepo njulug vinwan haoxuu sharan-shastri cxxxxy xiaohui-victor-li zhangdats makulu1987 luojianp shanliwa1 xclu hivewang lzjtt2017 jdc08161063 allensmile jiaodaxiaozi liwzhi edunuke raivien davechenxy mingkin yibit nofeetbird0321 georgethrax ozilxu machao0322 chloe4d ikol172q qianrenjian lei3li zyq11223 noahs33 ahaoboy adrianhust lrjxaint tuanthng wintogo hyk13213036 liuzhiyong01 onepiece808 kwaika zhang921210 landiaokafeiyan lemonxy fullstackdevil

hands-on-reinforcement-learning-with-python's Issues

Building a Video Game Bot using OpenAI Universe -Error

Hello! I'm reading a book now.

I try to run the tasks that are given in the second chapter. The task where the robot learns to walk turned out to run, it works.

But when launching a program with a bot for the video game "Auto-Racing", an error occurs:

KeyError: 'flashgames.NeonRace-v0'

During handling of the above exception, another exception occurred:

UnregisteredEnv Traceback (most recent call last)
in
3 import random
4
----> 5 env = gym.make('flashgames.NeonRace-v0')
6 env.configure(remotes = 1) # автоматически создает локальный контейнер docker
7 observation_n = env.reset()

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in make(id, **kwargs)
140
141 def make(id, **kwargs):
--> 142 return registry.make(id, **kwargs)
143
144 def spec(id):

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in make(self, path, **kwargs)
84 else:
85 logger.info('Making new env: %s', path)
---> 86 spec = self.spec(path)
87 env = spec.make(**kwargs)
88 # We used to have people override _reset/_step rather than

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in spec(self, path)
126 raise error.DeprecatedEnv('Env {} not found (valid versions include {})'.format(id, matching_envs))
127 else:
--> 128 raise error.UnregisteredEnv('No registered env with id: {}'.format(id))
129
130 def register(self, id, **kwargs):

UnregisteredEnv: No registered env with id: flashgames.NeonRace-v0

When installing, I had a problem related to mujoco-py. Could the fact that the program does not start be related to this?
Installed OpenAi on a macbook with MacOS Catalina ver. 10.15.2.
I will be grateful for the help.

env = gym.make('CartPole-v0')报错

PkgResourcesDeprecationWarning: Parameters to load are deprecated. Call .resolve and .require separately.
result = entry_point.load(False)

One question about FrozenLake env rewards

I'm actually reading the 3rd chapter when we try to implement a way to resolve the fronzenlake environnement,

As it is written : "We give +1 point as a reward to the agent if it correctly walks on the frozen lake and 0 points if it falls into the hole"

So my question is, why would the agent want to reach the exit of the Lake if the environnement is not giving him negative reward for each step past on the Lake ?

I thought the agent would want to maximize his rewards by walking infinitely on frozen place which gives him +1 reward.

Thx in advance

packtpublishing / hands-on-reinforcement-learning-with-python Goto Github PK