homework2's People
Forkers
zenspam maolin23 cedl739 chang810249 andrewliao11 chingyaoc s104061622 s2244521 andyhahaha eborboihuc nitahhhh yenmincheng0708 shihmengli gina9726 tommy-liu jakc4103 nccheng brade31919 cning colin700 ph81323 hiram94 fuenwang oscar-lu williamd4112 vyraun babooppa6 jackingchenhomework2's Issues
When I ran cells of HW2_Policy_Graident, it appeared 3 errors. But I couldn't find a solution.
ImportError Traceback (most recent call last)
in ()
2 import tensorflow as tf
3 import numpy as np
----> 4 from policy_gradient import util
5 from policy_gradient.policy import CategoricalPolicy
6 from policy_gradient.baselines.linear_feature_baseline import LinearFeatureBaseline
/home/haiyang/homework2/policy_gradient/util.py in ()
1 from gym.spaces import Box, Discrete
2 import numpy as np
----> 3 from scipy.signal import lfilter
4
5 def flatten_space(space):
ImportError: No module named 'scipy'
NameError Traceback (most recent call last)
in ()
2
3 # Construct a neural network to represent policy which maps observed state to action.
----> 4 in_dim = util.flatten_space(env.observation_space)
5 out_dim = util.flatten_space(env.action_space)
6 hidden_dim = 8
NameError: name 'util' is not defined
NameError Traceback (most recent call last)
in ()
3 path_length = 200
4 discount_rate = 0.99
----> 5 baseline = LinearFeatureBaseline(env.spec)
6
7 po = PolicyOptimizer(env, policy, baseline, n_iter, n_episode, path_length,
NameError: name 'LinearFeatureBaseline' is not defined
Solve at 109 iteration, am I right?
I solve at 109 iters, am I correct? (a little bit more than 80 iters)
.py version of HW2
Hi all, for people who suffer from using iPython, you can also download and work on this assignment using HW2_Policy_Graident.py provided here.
Basically, this file contains the same code as HW2_Policy_Graident.ipynb.
Assignment Submission
When submitting assignment, just add the newly downloaded HW2_Policy_Graident.py
file into your homework2
repo and push it to your forked repository.
Then, open a Pull Request to this repository.
Math Formula
If you are writing assignment in this manner, you can find math formula provided in the description of problem 2 here:
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.