Git Product home page Git Product logo

TDI-Project

Overview

This project is the first part of a larger project that aims to predict sentiment from Twitch chatrooms. Our goal here is to provide a simple example of how this can be done. We proceed by cleaning our data and using a two different models: a bidirection LSTM neural network and a branched neural network. The bidirectional LSTM NN is a rough implementation of what is described in this article and the branched model is used in this article. In the future, we will further explore model selection, optimize parameters, and develop a user interface to make the analysis accessible and digestible.

Data

The data used and its documentation can be found here.

Tokenizer

We use the CMU Noah's Ark tokenizer, called twokenizer. It was initially developed for twitter, and it is suggested in Barbieri's article (linked above) that a modified version can be useful for Twitch. The twokenize.py file was downloaded from here.

Cleaning data

We include a few data cleaning functions in preprocessing.py and emotes.py. We use an API in order to get a list of emotes from Twitch for each streamer. Since the amount of data is very large, we can only access the list of emotes from one channel at a time.

Evaluation of models

There is a short analysis at the end of each notebook evaluating the accuracy of the corresponding model.

Daniel Levine's Projects

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.