Git Product home page Git Product logo

twit-miner's Introduction

twit-miner

twit-miner is an attempt to build a movie recommendation application. Moreover it has a global list of total score for each movie, which is incremented by 1 if a user likes the given movie and decremented if he hates the movie.

How does this work?

It works by searching in twitter for a list of recent movies. Tweets with movie-titles are classified as like/hate using simple word matching (no magic is going on here). When a matching tweet is found, all other tweets of the user are searched for other movie titles to create a like/hate profile of the user for the list of movies.

If a user has only mentioned a single movie in his twitter-history, the user isn't imported since it he doesn't help us to give recommendations. The total global "score" of the movie is updated, though.

The actual classification is done by a math magic trick: Latend Semantic Indexing.

The SVD matrix creation and the twitter-import is handled by the launch.py script in the project-root directory. You must set the DJANGO_SETTINGS_MODULE environment variable to "settings" to launch the script. The script tries to prune "unnecessary" user data in order to keep the SVD matrix small. If all data would have been stored the computation may result in a MemoryError on smaller VPS.

How can I run the tests?

The tests can be run with modipyd.

What are the dependencies?

There are a lot of dependencies for running twit-miner:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.