Git Product home page Git Product logo

awesome-h2o's Introduction

Awesome H2O Awesome Powered by H2O.ai

Below is a curated list of all the awesome projects, applications, research, tutorials, courses and books that use H2O, an open source, distributed machine learning platform. H2O offers parallelized implementations of many supervised and unsupervised machine learning algorithms such as Generalized Linear Models, Gradient Boosting Machines (including XGBoost), Random Forests, Deep Neural Networks (Deep Learning), Stacked Ensembles, Naive Bayes, Cox Proportional Hazards, K-means, PCA, Word2Vec, as well as a fully automatic machine learning algorithm (AutoML).

H2O.ai produces many tutorials, blog posts, presentations and videos about H2O, but the list below is comprised of awesome content produced by the greater H2O user community.

We are just getting started with this list, so pull requests are very much appreciated! 🙏 Please review the contribution guidelines before making a pull request. If you're not a GitHub user and want to make a contribution, please send an email to [email protected].

If you think H2O is awesome too, please ⭐ the H2O GitHub repository.

Contents

Blog Posts & Tutorials

Books

Research Papers

Benchmarks

Presentations

Courses

Software

License

CC0

To the extent possible under law, H2O.ai has waived all copyright and related or neighboring rights to this work.

awesome-h2o's People

Contributors

cooleel avatar drradan avatar jphall663 avatar katrinleinweber avatar ledell avatar mmalohlava avatar navdeep-g avatar shaz13 avatar spekulatius avatar tomkraljevic avatar vindows avatar woobe avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-h2o's Issues

pretraining a supervised deeplearner gives an error

in Rstudio, if you use an autoencoder with the exact same config parameters as below and then use the autoencoder as a pretraining for a deep learning classification, it will through an error of mismatch between weights (the length of weights). Below are the code I used and the error that I keep receiving.

I first use the autoencoder with below configuration, which works fine.

AE_model_0 <- h2o.deeplearning(x = features_no_categorical_data,
training_frame = pre_training_hf,
model_id = "AE_model_0",
autoencoder = TRUE,
ignore_const_cols = TRUE,
seed = 42,
hidden = c(200, 100, 200),
epochs = 100,
activation = "Tanh")

then, I use a classification with an autoencoder as a pretrainer for weights.

training_hf = as.h2o(mds %>% filter(usage == 'training'))

training_hf[,430] <- as.factor(training_hf[,430]) # for classification the label column (430) must be a factor.

DNN_model0 <- h2o.deeplearning(y = "v430",
x = features_no_categorical_data,
training_frame = training_hf,
pretrained_autoencoder = "AE_model_0",
balance_classes = TRUE,
ignore_const_cols = TRUE,
seed = 42,
hidden = c(200, 100, 200),
epochs = 100,
activation = "Tanh")

The below is the error I receive (I dont have any categorical variables in my features. I also enabled ignored_const_cols and I still receive the same error):

java.lang.IllegalArgumentException: Mismatch between weights in pretrained model and this model: cols in layer 0: 5716 vs 7019. Enable ignored_const_cols for both models and/or check categorical levels for consistency.

java.lang.IllegalArgumentException: Mismatch between weights in pretrained model and this model: cols in layer 0: 5716 vs 7019. Enable ignored_const_cols for both models and/or check categorical levels for consistency.
at hex.deeplearning.DeepLearningModelInfo.initializeFromPretrainedModel(DeepLearningModelInfo.java:462)
at hex.deeplearning.DeepLearning$DeepLearningDriver.buildModel(DeepLearning.java:238)
at hex.deeplearning.DeepLearning$DeepLearningDriver.computeImpl(DeepLearning.java:216)
at hex.ModelBuilder$Driver.compute2(ModelBuilder.java:173)
at hex.deeplearning.DeepLearning$DeepLearningDriver.compute2(DeepLearning.java:209)
at water.H2O$H2OCountedCompleter.compute(H2O.java:1349)
at jsr166y.CountedCompleter.exec(CountedCompleter.java:468)
at jsr166y.ForkJoinTask.doExec(ForkJoinTask.java:263)
at jsr166y.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:974)
at jsr166y.ForkJoinPool.runWorker(ForkJoinPool.java:1477)
at jsr166y.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:104)

Error: java.lang.IllegalArgumentException: Mismatch between weights in pretrained model and this model: cols in layer 0: 5716 vs 7019. Enable ignored_const_cols for both models and/or check categorical levels for consistency.
In addition: Warning message:
In .h2o.startModelJob(algo, params, h2oRestApiVersion) :
Dropping bad and constant columns: [v389, v390, v388].

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.