yangsenius / learning-to-learn-by-pytorch Goto Github PK
View Code? Open in Web Editor NEW"Learning to learn by gradient descent by gradient descent "by PyTorch -- a simple re-implementation.
License: MIT License
"Learning to learn by gradient descent by gradient descent "by PyTorch -- a simple re-implementation.
License: MIT License
@yangsenius Thanks for your helpful implementation. There seems a minor bug during runtime.
The line for CUDA configuration is redundant to avoid potential GPU utilization.
In the process of program training, GPU memory leaks and GPU memory keeps increasing until out of memory. Can you figure this bug ?
Hello,
It seems in the learn
function, the variable name should be optimizer
instead of optimizee
as LSTM is an optimizer and not optimizee. Can you clarify this?
I have some confusion here.
Thanks.
Rahul
Why is preprocess=False in the code? If it is changed to Ture instead, NAN appears.
when i run:
RuntimeError: can't retain_grad on Tensor that has requires_grad=False
As I read the original paper and the repo from deepmind, it seems to me that the LSTM optimizer should only take 1 variable as input to optimize and save the LSTM state for each variables. In other words, with an arbitrary number of parameters, it just updates one after another. While in this implementation, the dimension of the optimizer is fixed as the number of the optimizee.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.