Git Product home page Git Product logo

movie-review-preference-analysis's Introduction

cs5780_kaggle_competition

https://www.kaggle.com/competitions/movie-review-preference-analysis

Together with @Suphakrit Lertkitcharoenvong.

Sentiment analysis is a fascinating field in natural language processing (NLP) that deals with understanding the emotions conveyed in written text. As a natural extension to this established problem, in this competition, a model is built to automatically detects the more positive review in any pairs of movie reviews.

Specifically, for every pair of reviews (r0, r1), we are given the feature vectors (E(r0), E(r1)) encoded by a language model. The goal is to build a model that can predict y, the binary label representing the "preference" between these two reviews. We predict 1 if r0 is more positive than r1 and 0 otherwise. We use the Accuracy of the test data as the evaluation metric.

The best strategy: We used sklearn.svm.SVC with a rbf kernel. For data preparation we combined emb1 and emb2 and the corresponding labels. We used cross validation set to 5 instead of splitting into training/validation sets. Performed several GridSearchCV to narrow down the best parameters and settled on 'C'=2.5 and 'gamma'=2.25. During the training we also enabled predict_proba, this allowed us to resolve conflicts when emb1 and emb2 had the same prediction - we would go with whichever movie review had the higher confidence.

movie-review-preference-analysis's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.