Pierre-Luc Bacon The project deion suggests that RLPy is mai

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Including Policy Gradient Techniques about rlpy HOT 6 OPEN

rlpy commented on September 24, 2024

Including Policy Gradient Techniques

from rlpy.

Comments (6)

smcgregor commented on September 24, 2024

Any updates on Policy Gradient methods? I am considering implementing a human-interpretable policy class in rlpy and Policy Gradient would likely match my needs.

I know this is a mirror of the bitbucket repository, should we comment on the issue there? It is currently closed.

from rlpy.

alborzgeramifard commented on September 24, 2024

We don’t have plan for adding policy gradient techniques at the moment, but you should be able to expand the framework to support them.

Best,

Alborz Geramifard
Research Scientist | Amazon Echo
people.csail.mit.edu/agf

On Sep 30, 2015, at 4:43 PM, Sean McGregor [email protected] wrote:

Any updates on Policy Gradient methods? I am considering implementing a human-interpretable policy class in rlpy and Policy Gradient would likely match my needs.

I know this is a mirror of the bitbucket repository https://bitbucket.org/rlpy/rlpy/issues/25/including-policy-gradient-methods, should we comment on the issue there? It is currently closed.

—
Reply to this email directly or view it on GitHub #7 (comment).

from rlpy.

smcgregor commented on September 24, 2024

Thanks! If my implementation meets rlpy's quality threshold, would you like a pull request?

from rlpy.

alborzgeramifard commented on September 24, 2024

Yup.

Best,

Alborz Geramifard
Research Scientist | Amazon Echo
people.csail.mit.edu/agf

On Oct 1, 2015, at 12:02 PM, Sean McGregor [email protected] wrote:

Thanks! If my implementation meets rlpy's quality threshold, would you like a pull request?

—
Reply to this email directly or view it on GitHub #7 (comment).

from rlpy.

vladfi1 commented on September 24, 2024

@smcgregor Any updates on this? I'd like to do policy gradient in rlpy.

from rlpy.

smcgregor commented on September 24, 2024

@vladfi1 I think it is unlikely that I will be implementing this anytime soon. I've been running experiments that use probabilistic policies on top of RLPy, but we don't currently need RLPy to optimize the policy parameters.

from rlpy.

Including Policy Gradient Techniques about rlpy HOT 6 OPEN

Comments (6)

Best,

Best,

Related Issues (15)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent