Git Product home page Git Product logo

svm-dc's Introduction

Build Status

Brief

Implementation of One-class SVM (Support Vector Machine) that uses
binary, frequency, tf-idf and hadamard representations for document classification.
Based strictly on a research paper by Larry M. Manevitz and Malik Yousef,
'One-Class SVMs for Document Classification'.

It includes a graphical user interface with choice of representation,
kernel, data cache, outlier detection and SVM view control.

In addition, it also displays measures (F1-measure, recall, precision). 
Input data is whole books that are split into chunks of 1000 lines each,
and cleaned of stopwords and special symbols automatically.

The idea of One-Class SVM was first published by Bernhard Schölkopf (1999),
who extended the SVM methodology to handle training using only positive information.

One-Class Classification is a special case of supervised classification,
where negative samples are absent during training, but may appear during testing.

All the data having the same label in the target class is equivalent to having no label.
Therefore, it can be considered unsupervised learning, and used as an outlier detection algorithm.

DISCLAIMER:
This project is non-profit and is intended to serve for educational purposes only.
It is not meant to infringe copyright rights by any means.
In case that any of the documents used are copyrighted,
please notify the repository owner and they will be removed.

Research Papers

Installing and Running

  • Clone the Project
git clone https://github.com/RazMalka/SVM-DC.git
cd SVM-DC
  • Install Dependencies in Anaconda CLI
conda install -c anaconda nltk
conda install -c anaconda scikit-learn
  • Execute from Anaconda CLI
cd ..
conda init
conda activate base
python main.py

Prerequisites and Libraries

  • VSCode (IDE)
  • Anaconda (Python3 Distribution)
  • Tkinter (GUI)
  • MatplotLib (Plotting)
  • Sklearn (SVM, Classification)
  • NLTK (NLP)
  • Numpy, SciPy, Pandas (Scientific Calculations)

Appendix Data

appendix1 appendix2

svm-dc's People

Contributors

razmalka avatar shohamyamin avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.