Git Product home page Git Product logo

movie-analysis's Introduction

Welcome to movie-analysis repository

About

This is a Mini-Project for CZ1115 (Introduction to Data Science and Artificial Intelligence) which focuses on movies from The Movie Database APIs. For detailed walkthrough, please view the source code in order from:

  1. Data Extraction
  2. Data Visualization
  3. Data Resampling and Splitting
  4. Logistic Regression
  5. Neural Network

Contributors

  • @nicklimmm - Neural Networks, Data Resampling, Data Extraction
  • @TCaken - Logistic Regression
  • @coolcoolwhat - Data Visualization, Data Extraction

Problem Definition

  • Are we able to predict if a movie is good (rating above 7.2) based on its attributes?
  • Which model would be the best to predict it?

Models Used

  1. Logistic Regression
  2. Neural Networks

Conclusion

  • Popularity and budget have low linear correlation value with ratings (watch out for bandwagons ๐Ÿคฃ)
  • Popularity of the casts and crews have higher linear correlation value with ratings
  • Resampling imbalanced data improved model performance especially on the minority class
  • Logistic Regression did not perform well with non-linearly correlated variables
  • Neural Networks along with SMOTEENN resampling method consistently did well in predicting good movies after 100 training attempts (around 72% accuracy, 70% recall)
  • Yes, it is possible to predict if a movie is good with acceptable amount of accuracy and recall

What did we learn from this project?

  • Handling imbalanced datasets using resampling methods and imblearn package
  • Neural Networks, Keras and Tensorflow
  • Logistic Regression from sklearn
  • API Usage
  • Other packages such as tqdm, json, requests
  • Collaborating using GitHub
  • Concepts about Precision, Recall, and F1 Score

References

movie-analysis's People

Contributors

nicklimmm avatar tcaken avatar coolcoolwhat avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.