Git Product home page Git Product logo

Comments (6)

jpatanooga avatar jpatanooga commented on May 10, 2024

Deep Learning can sometimes take a bit of time.

That's why we have a parallelized Hadoop-yarn version

https://github.com/jpatanooga/Metronome

which I'm working on moving over into:

https://github.com/agibsonccc/java-deeplearning/tree/master/hadoop-yarn

On Thu, Aug 7, 2014 at 1:59 AM, Samiur rahman [email protected]
wrote:

Training a Word2Vec model on a large corpus (approximately 3 billion
english words) takes a significant amount of time. On a Core i7- 8 core
machine, it took approximately 24 hours to train. Distributing this will
significantly reduce training time.


Reply to this email directly or view it on GitHub
#44.

from deeplearning4j.

agibsonccc avatar agibsonccc commented on May 10, 2024

I will also be creating a quick and dirty akka version of this. It will be pretty straight forward. I have another engineer looking at GPUs this week. Getting that incorporated should help a little bit with benchmarks and all that. As of right now unfortunately, we're still in the stabilization phase, once we have a stable API and something easy to build on top of, finetuning (even with JNI if need be) will be pretty straightforward.

from deeplearning4j.

agibsonccc avatar agibsonccc commented on May 10, 2024

I surfaced one bottleneck that in retrospect was obvious. Check out

https://github.com/agibsonccc/java-deeplearning/tree/api-overhaul

when you get the chance. It's not distributed (still have other things to work on) but should be a performance improvement.

from deeplearning4j.

theclaymethod avatar theclaymethod commented on May 10, 2024

It looks like Apache Spark is adopting this in MLLib for Spark 1.1. It might be worth making the generated models compatible

from deeplearning4j.

agibsonccc avatar agibsonccc commented on May 10, 2024

Update on this: I've implemented a general purpose distributed system that will make neural nets a plugin. The word2vec plugin is almost done. I've already added the baseline feedforward nets and simpler things like word count/tfidf. Please check the 0.0.3.3 branch.

from deeplearning4j.

lock avatar lock commented on May 10, 2024

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

from deeplearning4j.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.