Git Product home page Git Product logo

Comments (3)

gorgitko avatar gorgitko commented on June 24, 2024 1

Hi Kjell!

I had had the same questions as you and everything has been answered by keras-rl author: https://github.com/matthiasplappert/keras-rl/issues/38

In short:
Basically, in your game class (like board_gym.py) you need to implement step and reset methods. These are described in this abstract class: https://github.com/matthiasplappert/keras-rl/blob/master/rl/core.py#L533

To get info from the learning process you need some callback object. In this object (class) you implement several methods which are called by keras-rl on specific points during the learning process (or during testing/playing by agent). These methods are defined here: https://github.com/matthiasplappert/keras-rl/blob/master/rl/callbacks.py#L14
For example on_action_begin is called when agent does some chosen action.
You can use the default *Logger classes from https://github.com/matthiasplappert/keras-rl/blob/master/rl/callbacks.py . However I needed to get and print some specific information about my game (like real score) so I implemented my own Logger in callbacks.py.

from mi-mvi_2016.

Kjell-K avatar Kjell-K commented on June 24, 2024

Yes I see. I will just have to wrap my head around that.
The unclear part is maybe, that I do not have discrete states and observations like in a board game, but continues. In AirSim, I am extracting images and then take an action.
Therefore I do not really now how to time/clock the procedure.

from mi-mvi_2016.

gorgitko avatar gorgitko commented on June 24, 2024

Well, I don't have any experience with continuous environments. It's rather a question for keras-rl author -- I see he implemented some continous reinforcement learning methods, so ask him there :)

But I think I read something about this problem and the solution is to somewhat convert the continous space to discrete by averaging the extracted images.
Or maybe you can use recurrent NN (it should be implemented in keras-rl): each time step would be one extracted image.

from mi-mvi_2016.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.