Git Product home page Git Product logo

ai-workflow-ibm's Introduction

Business Problem [ibm-ai-workflow]

Classifying movie reviews: a binary classification example

Two-class classification, or binary classification, may be the most widely applied kind of machine-learning problem. In this example, you’ll learn to classify movie reviews as positive or negative, based on the text content of the reviews.

References

[1] The IDMB raw dataset : http://mng.bz/0tIo

[2] https://github.com/pytorch/examples/blob/master/mnist/main.py

[3] https://scikit-learn.org/stable/auto_examples/linear_model/plot_sparse_logistic_regression_mnist.html#sphx-glr-auto-examples-linear-model-plot-sparse-logistic-regression-mnist-py

[4] https://towardsdatascience.com/unit-testing-and-logging-for-data-science-d7fb8fd5d217

[5] https://github.com/CoreyMSchafer/code_snippets/blob/master/Decorators/decorators.py

[6] https://scikit-learn.org/stable/auto_examples/neural_networks/plot_mnist_filters.html?highlight=mnist

[7] https://nextjournal.com/gkoehler/pytorch-mnist

Data Setttings

IMDB dataset: a set of 50,000 highly polarized reviews from the Internet Movie Database. They’re split into 25,000 reviews for training and 25,000 reviews for testing, each set consisting of 50% negative and 50% positive reviews.

Approach

Baseline

Traditional Techniques: Logistic Regression, Multilayer Perceptron

In this section, we use unittest to verify the accuracy and confusion matrix of method "Logistic Regression".

Deep Neural Network:

  • Recurrent Neural Network: LSTM

Conclusion

In this project, as you see, we did:

  • How to explore extensions to a baseline model to improve training and predicting capacity.

  • How to use unit tests for the API and the model

  • How to use unit tests for logging

  • Can all of the unit tests be run with a single script and do all of the unit tests pass?

  • How to monitor performance

  • How to compare multiple models

  • How to use visualizations for the EDA investigation

ai-workflow-ibm's People

Contributors

ngoctanle avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.