Git Product home page Git Product logo

naturallanguageprocessing_intro's Introduction

NaturalLanguageProcessing_Intro

This is course of Northeastern University Lecture assignment.

  • Assignment 1: Sentiment Analysis on Movie Reviews
  • Assignment 2: Text Classification Project with TF-IDF Vectorization
  • Assignment 3(On working): Implementing Skipgram and CBOW Algorithms

As1 Sentiment Analysis on Movie Reviews

๐Ÿ“ข Introduction: In this project, I've tackled the exciting challenge of sentiment analysis, specifically focusing on movie reviews. Movies have always been a passion of mine, but diving deep into what people think about them? That's an adventure in itself!

๐ŸŽฅ About the Dataset: The dataset I've worked with comprises both positive and negative movie reviews. The contrast between these reviews, the emotions, the passion, and sometimes the sarcasm, provided a rich ground for analysis.

๐Ÿ” Sentiment Analysis: Sentiment analysis is more than just finding positive or negative words in a text. It's about understanding context, tone, and underlying feelings. My goal was to assess if a given review praised the movie or was critical of it.

๐Ÿ”ง Tools & Methodology: For this analysis, I leveraged the Naive Bayes algorithm. Naive Bayes, with its probabilistic approach, proved to be quite effective in discerning the sentiments embedded within the reviews.

๐ŸŽฌ Outcome: Tapping into the world of movies and extracting sentiments from reviews was both challenging and rewarding. Through this project, I got a glimpse into the world of critics and movie enthusiasts alike, understanding what makes a movie resonate or fall flat for them.

๐Ÿ™Œ Gratitude: A big thank you to everyone who has contributed to the dataset and to those who are checking out this project. It was a roller coaster of emotions (pun intended), and I'm thrilled to share this journey with all of you!

As2 Text Classification Project with TF-IDF Vectorization

๐Ÿ“ข Introduction: In this project, I've dived deep into the realms of text classification leveraging the power of Multinomial Naive Bayes and Neural Networks. Unlike traditional Bag of Words methods, I chose to employ TF-IDF Vectorization, ensuring a more nuanced representation of text data.

๐Ÿ“Š Visualization: Besides just classification, I've also dedicated efforts to visualize these vectors. It's truly intriguing to see how different text data points relate to one another in a multidimensional space.

๐Ÿ“ Datasets: I've worked with some large datasets in this endeavor. These aren't your typical few-hundred rows of data - they are massive! And that's what made this project even more interesting (and a tad bit challenging).

โฐ Execution Time: Given the size of the datasets and the computations involved, certain parts of the project can take a considerable amount of time to run. Hence, patience is key. If you're planning on running the code or testing it out, maybe go grab a coffee in the meantime.

๐Ÿš€ Recommendation: I recommend starting with the exploration and preprocessing of these datasets a tad bit early if you're planning to replicate or extend this work. This isn't just due to the execution time but also because understanding the nuances of such vast data takes time.

๐Ÿ”— Tools & Libraries Used:

TF-IDF Vectorization Multinomial Naive Bayes Neural Networks And a few more... ๐Ÿ™ Acknowledgements: I want to give a shout-out to all the open-source tools and libraries that made this project possible. And of course, to all of you who are checking out this project or finding it useful in any capacity!

naturallanguageprocessing_intro's People

Contributors

andyfcui avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.