Comments (6)
Deep Learning can sometimes take a bit of time.
That's why we have a parallelized Hadoop-yarn version
https://github.com/jpatanooga/Metronome
which I'm working on moving over into:
https://github.com/agibsonccc/java-deeplearning/tree/master/hadoop-yarn
On Thu, Aug 7, 2014 at 1:59 AM, Samiur rahman [email protected]
wrote:
Training a Word2Vec model on a large corpus (approximately 3 billion
english words) takes a significant amount of time. On a Core i7- 8 core
machine, it took approximately 24 hours to train. Distributing this will
significantly reduce training time.—
Reply to this email directly or view it on GitHub
#44.
from deeplearning4j.
I will also be creating a quick and dirty akka version of this. It will be pretty straight forward. I have another engineer looking at GPUs this week. Getting that incorporated should help a little bit with benchmarks and all that. As of right now unfortunately, we're still in the stabilization phase, once we have a stable API and something easy to build on top of, finetuning (even with JNI if need be) will be pretty straightforward.
from deeplearning4j.
I surfaced one bottleneck that in retrospect was obvious. Check out
https://github.com/agibsonccc/java-deeplearning/tree/api-overhaul
when you get the chance. It's not distributed (still have other things to work on) but should be a performance improvement.
from deeplearning4j.
It looks like Apache Spark is adopting this in MLLib for Spark 1.1. It might be worth making the generated models compatible
from deeplearning4j.
Update on this: I've implemented a general purpose distributed system that will make neural nets a plugin. The word2vec plugin is almost done. I've already added the baseline feedforward nets and simpler things like word count/tfidf. Please check the 0.0.3.3 branch.
from deeplearning4j.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
from deeplearning4j.
Related Issues (20)
- ND4J is conflicting with Android dependencies
- Apply to remove dependency deeplearning4j-datasets from project deeplearning4j-core HOT 2
- Keras 2.14 optimizer format changed causing simple models to not import HOT 19
- Init pretrained models HTTP status code 403 HOT 3
- A script that parses XML takes only 0.00718 seconds to execute through Python, but it takes 17 seconds to execute through Python4J. Why is there such a difference in performance? HOT 6
- Please ensure that you have an nd4j backend on your classpath. Please see: https://deeplearning4j.konduit.ai/nd4j/backend HOT 2
- Please update the version of opencsv you are using
- module-info.java for 'modelimport' contains invalid (old) exports HOT 4
- python4j spring boot jar PythonExecutioner init error HOT 3
- Ensure read into byte array from InputStream happens fully HOT 2
- libnd4j: Execution javacpp-cppbuild-compile of goal org.bytedeco:javacpp:1.5.9:build failed: Process exited with an error: 127
- Unable to start UI server
- Unable to use RemoteUIStatsStorageRouter HOT 2
- libnd4j - undefined reference to `sd::PrintTo(sd::NDArray const&, std::ostream*) HOT 1
- Running a a basic CNN training fails on Google Colab!
- AMD GPUs not supported? HOT 4
- Normalized Autoencoder (NAE): Improve outlier detection performance
- Op [multiply] execution failed HOT 1
- error when compiling (exception in thread "main" java.lang.ExceptionInInitializerError) HOT 3
- ND4J: Add an option to only include backends for specified operating systems HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deeplearning4j.