32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Python 1.56% Jupyter Notebook 98.44%

deep-rl-algorithms github-udacity dqn-ppo-ddpg dqn td3 cartpole bipedalwalker deep-reinforcement-learning sac carracing hopperbulletenv lunarlander ddpg ppo a2c antbulletenv soft-actor-critic halfcheetahbulletenv walker2dbulletenv

deep-reinforcement-learning-algorithms's Issues

How to remove the environment logging in the console?

I have a lot of this kind of logging when a new episode begins: Track generation: 1220..1529 -> 309-tiles track

I noticed that there is no such kind of logging in your console in the .ipynb page. Can you tell me how to remove them? Thank you very much.

Why did you remove the death penalty for solving CarRacing with PPO from raw pixels?

Hi, I've been looking through your code as a reference to figure out how to solve CarRacing-v0.

Mine works up to a point then has a catastrophic performance crash.
The only difference I can find between my version and yours is that When the unwrapped environment is done (fails) the agent gets a big negative reward.
You removed this in your wrapper, and I don't understand why.

What's the significance of offsetting the reward there?

Reward shaping not removed in evaluation in CarRacing-From-Pixels-PPO

Hi,

The figure and log in README shows scores >1000, which due to the CarRacing's design, is not quite possible.
It turns out that the reward shaping in Wrapper.step() is not removed in evaluation and that leads to incorrect results.
Commenting out relevant lines, I got an average score of 820 over 100 episodes.

rafael1s / deep-reinforcement-learning-algorithms Goto Github PK

deep-reinforcement-learning-algorithms's Issues

How to remove the environment logging in the console?

Why did you remove the death penalty for solving CarRacing with PPO from raw pixels?

Reward shaping not removed in evaluation in CarRacing-From-Pixels-PPO

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent