Comments (3)
Hey Alex, can you provide a little more background to this? I've checked the discussion you link to, but what's exactly the motivation for not storing these particular Theano functions? After all, we're still keeping the _output_layer
around, which looks like it's the bulkier part. In fact, I've tried to pickle a small net and the difference in pickle size was neglectable. Maybe you've had different results with your nets? Or else, is there any other reason not to pickle these?
from nolearn.
I might start by turning the question around: is there any reason TO pickle these values? The network is fully specified without needing these functions (ie they are purely derived values from other values that are already being stored). In saving these functions, you are basically creating larger pickle files and storing no additional information.
The downsides of pickling though are: the generated function may be specific to the settings of the computer used to fit (ie CPU vs GPU, optimization levels) and you actually do want to re-compile the function on unpickling. A example of this is a fit on a GPU cluster, save the fits and then reload them on a machine with CPU only.
The more pressing issue is that some of the Theano function simply will not pickle using the default pickling library. Sure there is dill
, etc, but this seems like a hack in order to save a derived value. At the very least I would argue that the default is to just save the bear minimum to reload the net. If the user wants to "cache" the functions and save them (perhaps to avoid a recompile), that should be an opt in.
from nolearn.
Very good points. Could you check and see if the pull request in #37 sufficiently solves this issue for you?
from nolearn.
Related Issues (20)
- RememberBestWeights does not honor the verbose parameter HOT 2
- A replayable fit() method - diff/patch attached HOT 1
- remove('trainable') Lasagne's command doesn't work in nolearn HOT 6
- flip_filters and pad parameter not used by NeuralNet's class HOT 5
- OSError: could not read bytes when trying to fetch mldata HOT 2
- CUDA error, possibly related to network size? HOT 2
- Trained on GPU, inference on CPU doesn't make sense
- Install nolearn with Lasagne dependance not working HOT 2
- Bug in calculating average scores
- nolearn is not installing
- Bug when using Lasagne `mask_input` parameter
- 'NeuralNet' object has no attribute 'layers_' HOT 1
- Weights sum up to zero
- Future issue with sklearn.cross_validation
- Dependency on both backends in requirements.txt switches off GPU support HOT 3
- Enable to reproduce the last value of trainning when predicting CNN
- enable to reproduce loss value of training when predicting CNN HOT 1
- python 3 support not working with Lasagne? HOT 12
- TypeError: Failed to instantiate <class 'lasagne.layers.pool.MaxPool2DLayer'> with args {'name': 'pool1', 'ds': (2, 2), 'incoming': <lasagne.layers.conv.Conv2DLayer object at 0x7ff765fa29e8>}. Maybe parameter names have changed?
- nolearn now on conda-forge HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nolearn.