Git Product home page Git Product logo

hands-on-reinforcement-learning-with-python's Introduction

Hands-On-Reinforcement-Learning-with-Python

Hands-On Reinforcement Learning with Python

This is the code repository for Hands-On-Reinforcement-Learning-with-Python, published by Packt.

Master reinforcement and deep reinforcement learning using OpenAI Gym and TensorFlow

What is this book about?

Reinforcement Learning (RL) is the trending and most promising branch of artificial intelligence. Hands-On Reinforcement learning with Python will help you master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms.

This book covers the following exciting features:

  • Understand the basics of reinforcement learning methods, algorithms, and elements
  • Train an agent to walk using OpenAI Gym and Tensorflow
  • Understand the Markov Decision Process, Bellman’s optimality, and TD learning
  • Solve multi-armed-bandit problems using various algorithms
  • Master deep learning algorithms, such as RNN, LSTM, and CNN with applications

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

policy_iteration():
Initialize random policy
for i in no_of_iterations:
Q_value = value_function(random_policy)
new_policy = Maximum state action pair from Q value

Following is what you need for this book: If you’re a machine learning developer or deep learning enthusiast interested in artificial intelligence and want to learn about reinforcement learning from scratch, this book is for you. Some knowledge of linear algebra, calculus, and the Python programming language will help you understand the concepts covered in this book.

With the following software and hardware list you can run all code files present in the book (Chapter 1-15).

Software and Hardware List

Chapter Software required OS required
1-12 anaconda Ubutnu or mac
chrome Ubutnu or mac

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Related products

Get to Know the Author

Sudharsan Ravichandiran is a data scientist, researcher, artificial intelligence enthusiast, and YouTuber (search for Sudharsan reinforcement learning). He completed his bachelors in information technology at Anna University. His area of research focuses on practical implementations of deep learning and reinforcement learning, which includes natural language processing and computer vision. He used to be a freelance web developer and designer and has designed award-winning websites. He is an open source contributor and loves answering questions on Stack Overflow.

Suggestions and Feedback

Click here if you have any feedback or suggestions.

hands-on-reinforcement-learning-with-python's People

Contributors

packt-itservice avatar packtutkarshr avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hands-on-reinforcement-learning-with-python's Issues

Building a Video Game Bot using OpenAI Universe -Error

Hello! I'm reading a book now.

I try to run the tasks that are given in the second chapter. The task where the robot learns to walk turned out to run, it works.

But when launching a program with a bot for the video game "Auto-Racing", an error occurs:


KeyError: 'flashgames.NeonRace-v0'

During handling of the above exception, another exception occurred:

UnregisteredEnv Traceback (most recent call last)
in
3 import random
4
----> 5 env = gym.make('flashgames.NeonRace-v0')
6 env.configure(remotes = 1) # автоматически создает локальный контейнер docker
7 observation_n = env.reset()

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in make(id, **kwargs)
140
141 def make(id, **kwargs):
--> 142 return registry.make(id, **kwargs)
143
144 def spec(id):

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in make(self, path, **kwargs)
84 else:
85 logger.info('Making new env: %s', path)
---> 86 spec = self.spec(path)
87 env = spec.make(**kwargs)
88 # We used to have people override _reset/_step rather than

/opt/anaconda3/envs/universe/lib/python3.6/site-packages/gym/envs/registration.py in spec(self, path)
126 raise error.DeprecatedEnv('Env {} not found (valid versions include {})'.format(id, matching_envs))
127 else:
--> 128 raise error.UnregisteredEnv('No registered env with id: {}'.format(id))
129
130 def register(self, id, **kwargs):

UnregisteredEnv: No registered env with id: flashgames.NeonRace-v0

When installing, I had a problem related to mujoco-py. Could the fact that the program does not start be related to this?
Installed OpenAi on a macbook with MacOS Catalina ver. 10.15.2.
I will be grateful for the help.

env = gym.make('CartPole-v0')报错

PkgResourcesDeprecationWarning: Parameters to load are deprecated. Call .resolve and .require separately.
result = entry_point.load(False)

One question about FrozenLake env rewards

I'm actually reading the 3rd chapter when we try to implement a way to resolve the fronzenlake environnement,

As it is written : "We give +1 point as a reward to the agent if it correctly walks on the frozen lake and 0 points if it falls into the hole"

So my question is, why would the agent want to reach the exit of the Lake if the environnement is not giving him negative reward for each step past on the Lake ?

I thought the agent would want to maximize his rewards by walking infinitely on frozen place which gives him +1 reward.

Thx in advance

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.