Describe the bug A clear and concise deion of what the bug is.<

Trying to solve the simple chain environment using LSPI about mushroom-rl HOT 1 CLOSED

mushroomrl commented on September 21, 2024

Trying to solve the simple chain environment using LSPI

from mushroom-rl.

Comments (1)

boris-il-forte commented on September 21, 2024

This is not a bug, the behavior is expected. LSPI automatically computes the action features from the state features. You just need to set use input_size=1, as the state vector has only one component.

Remember also that for the simple chain there is no visualization, so you have to turn off the rendering.

However, you are not supposed to use the polynomial features for finite states, as they are specifically designed for vectors of continuous state-actions.
In our framework the index of the discrete state is not meant to have a meaning, it is just a label for the state. Of course, in a chain, this label is meaningful as it represents the order in the chain.
In general, if you really want to create features for finite-state MDPs, you can just create your own class, implementing the FeaturesImplementation (in features._implementations) interface. This will let you choose a reasonable set of features to the MDP, if you have previous information about the state distribution.

I hope this solves your issue.

from mushroom-rl.

Trying to solve the simple chain environment using LSPI about mushroom-rl HOT 1 CLOSED

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent