Git Product home page Git Product logo

machine-learning's Introduction

Machine-Learning Projects

This repository contains a collection of projects which focus on solving rudimentary Multi class classification, Natural language processing and Vision problems. The projects explore the application of classical Machine Learning algorithms to solve the problems. The implementations primarily use only the Numpy and Pandas libraries. The implementation of the framework from scratch provides a great environment to experiment and learn from the code. The following algorithms have been used in the projects:

  1. Multi Nomial Naive Bayes Classifier
  2. Logistic Regression
  3. K Nearest Neighbour
  4. Decision Trees
  5. Multi Layer Perceptron

The repository uses publicly available datesets which have been extensively studied in literature. The datasets used are given below:

  1. Wisconsin Breast Cancer dataset (https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic))
  2. Hepatitis (http://archive.ics.uci.edu/ml/datasets/Hepatitis)
  3. Twenty News Group Dataset
  4. IMDB Reviews Dataset(http://ai.stanford.edu/ ̃amaas/data/sentiment/)
  5. MNIST Dataset

The projects explore the effect of changing the hyperparameters and the architecture of the models on the performance. The repository is intended to be a playground for students who would like to change different aspects the models to understand the finer nuances. The author has tried to write the code in a clear, lucid and modular manner to enable effficient implementation of the code. However, there is a scope to further streamline and improve the code. The authors have run a limited set of experiments to to understand the effect of changing the hyperparameters on the model performance. Further tuning the hyperparameters is expected to improve the peformance of the models.

All the codes can be run on the CPU and provide satisfactory performance. The Multi Layer perceptron code provides the option of runnning the code on GPU. The code uses CuPY which is the GPU version of Numpy. Also, the natural language processing code requires a large amount of memory to store the word tokens. The primary focus of the projects is to improve the understanding of the models and thus, the code does not provide a comparison with literature.

machine-learning's People

Contributors

abhiroopbhattacharya avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.