Git Product home page Git Product logo

machinelearning-cs584's Introduction

MachineLearning-CS584

Coursework from CS584, Machine Learning @ IIT

1. Programming Assignment 1 (HW1)

Topics: Decision Trees, Accuracy, Scikit-Learn.

Tasks: Listed in the code file. Create a test/train split for each dataset using part of student ID as the random state. For depths d ranging from 1 to 15, create decision trees of max depth d, fit the classifier to the training data, and compute accuracy on test/train splits. Record results and discuss observations.

2. Programming Assignment 2 (HW2)

Topics: Naive Bayes, Parameter Estimation, Joint Log Likelihood, Scikit-Learn.

Tasks: Load the datasets and create a test/train split for each using part of student ID as the random state. For 15 different values of alpha ranging from 10^-6 to 10^7: (1) create a Bernoulli Naive Bayes model with that alpha, (2) fit the model to the training set, (3) Compute JLL for training set and store it, (4) Compute JLL for testing set and store it. Store results, pickle results, discuss.

3. Programming Assignment 3 (HW3)

Topics: Logistic regression, L1 vs L2 Regularization, Scikit-Learn

Tasks: Load the datasets from HW2 and create test/train split for each same as before. For 15 different C values from 10^-7 to 10^7: (1) Create L2 regularized logistic regression classifier, (2) fit the model to the training set, (3) compute model complexity and store it, (4) compute number of 0 weights, store it, (5) compute CLL for train set, store it, (6) compute CLL for test set, store it, (7) Create L1 regularized logistic regression model, (8) fit to training data, (9) compute number of 0 weights, store it.

4. Final Project (Final_Project)

Topics: K-Nearest Neighbors, Support Vector Machines, Principal Components Analysis, Confusion Matrices, F1, Neural Networks, Convolutional Neural Networks, Transfer Learning, Keras, Tensorflow.

Tasks: Detailed in the project_writeup.pdf.

Files:

File Description
ProjectFile.ipynb Main python code notebook for the project
project_writeup.pdf Information about the project requirements, phases, and results
bush_test_66.model Phase 3 Bush CNN
williams_test_94.model Phase 3 Williams CNN
bush.model Phase 4 Bush CNN
williams.model Phase 4 Williams CNN
yaledata_cleaned_pickled Phase 4 Transfer Learning Data

Notes:

  • The primary data files were too large to upload, happy to provide a link.
  • A lot of trial and error went into the testing of various models. The code here only shows the final results.
  • The code in the ProjectFile.ipynb file was written over the course of several weeks on multiple machines. It may not work perfectly out the box depending on the Python, Keras, and TensorFlow versions.

machinelearning-cs584's People

Contributors

philiphossu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.