Git Product home page Git Product logo

ml-life-cycle-with-lqbtq-friendly-tweet-classifier's Introduction

Machine Learning Life Cycle with LGBTQ-Friendly Tweet Classifier

This project walks through a machine learning life cycle from data exploration to model development and deployment by building a LGBTQ Friendly Tweet Classifier. This process is no mean linear but iterative.

graph LR
A((Data Exploration))-->B((Model Development))
B --> C((Deployment))
C --> D((Monitoring))
D --> A
Loading

Disclaimer: The accuracy of the deployment model needs a lot more improvement. I hope you don't use the result of the deployment model to harm the community but to iterate on it to make better models and projects.

Authors

@xingvoong

Data

The dataset and details about it can be found here.

Features

The final product is an LGBTQ tweet classifier that takes in a tweet to classify whether it is LGBTQ-friendly. The thought process for developing the model can be found in Notebooks.

Flow 1 enter image description here

Flow 2 enter image description here

Notebooks

There are 4 notebooks for this project. They explain the decision-making process through each phrase of an ML life cycle. At the end of each notebook, there is a summary section that concludes the takeaways of each notebook or each phrase.

Limitation and room for improvement

While the project succeeded in showcasing an ML life cycle, it has a lot of limitations and room for improvement. Some of which I can think of:

  • Same users are more likely to use the same languages for their tweets, therefor splitting data by users could decrease biases.
  • Right now, the final product only classifies whether a tweet is friendly. It could do more by making a guideline or suggestion on how to write friendly tweets.
  • Adding a feature to let users rate whether a prediction is good or bad, then taking that new data to train better models.

Run Locally

Clone the project

git clone https://github.com/xingvoong/ml-life-cycle-with-lqbtq-friendly-tweet-classifier/tree/main/notebooks

Go to the project directory

cd  ml-life-cycle-with-lqbtq-friendly-tweet-classifier

Creating and activating virtual environment

python3 -m venv env
source env/bin/activate

Installing needed packages

pip3 install -r requirements.txt

To run the prototype Flask app

At the project root directory, run:

python3 app.py

This will start the Flask development server, and you can access your application by navigating to http://127.0.0.1:5000/ in your web browser.

Acknowledgement

This project is made possible via the resources I am using and inspired by:

ml-life-cycle-with-lqbtq-friendly-tweet-classifier's People

Contributors

xingvoong avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.