Git Product home page Git Product logo

ml-troll-filter's Introduction

ML-troll-filter

A ML project for filtering spam and troll comments/emails.

DESCRIPTION

Are you tired from people flooding your website/platform with spam and deceitful comments? Well, you are not the only one.

If you were wondering what big platforms like Youtube, Tripavisor, and others have done to solve (or at list limit)
this issue, Well, two words: Machine Learning. This project will help you train your machine to recognize potential spam messages by feeding it 16541 snippets of spam/ham messages.

Scikit-learn is required (check dependencies below) to train our Naive Bayes classifier.

Dataset: We are using the first 3 parts of the Enron spam dataset (minimize processing time).

If you would like to add even more accuracy to the model, you can add more parts to the data dir.

DEPENDENCIES

  1. Python 3.7.+
  2. numpy
  3. $ pip install numpy
    
  4. scikit-learn
  5. $ pip install scikit-learn
    

INSTALLATION

  1. Clone this project
  2. $ git clone https://github.com/tahaHichri/ML-troll-filter.git
    
  3. Launch message keyword dictionary/features generations
  4. $ python classif.py 
    
  5. Check whether a message is a spam by passing it as an argument to check.py. The output should be "SPAM" or "NOT SPAM"
  6. $ python check.py "Hey, this is an example comment"
    

ml-troll-filter's People

Contributors

tahahichri avatar

Stargazers

 avatar  avatar

ml-troll-filter's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.