Git Product home page Git Product logo

restaurant-recommender-system-aws-hadoop-mapreduce's Introduction

Restaurant Recommendation Enginer - Content Based and Personalized (Yelp Dataset)

Description

  • Developed a content based recommender that recommends restaurants to the users.
  • Extracted, pre-processed, and cleaned the data related to restaurants from Yelp academic dataset.
  • Implemented mapreduce design patterns like filtering, summarization, data organization, and join patterns to perform analysis such as top restaurants by country and state, total restaurants by country and state, moving average rating of restaurants, top restaurants by positive reviews, minimum and maximum review count of each restaurants, etc.
  • Performed sentiment analysis of the reviews about the restaurants given by Yelp users.
  • Calculated the pearson correlation, jaccard correlation and cosine correlation between restaurants to recommend to users.
  • Performed bining to split the data source on the basis of a preset value of a column and bloom filtering to filter the restaurants on basis of cities they are located in.
  • Deployed the project on AWS EC2 with 4 instances comprising of a namenode, a secondary namenode and two data nodes to achieve high scalability and performance.
  • Visualized the analysis in PowerBI.

Code

  1. Average rating and total restaurants by cuisine
  2. Content based recommendation
  3. Elite users based on useful votes
  4. Minimun maximum total review count
  5. Restaurants by star
  6. Restaurant search using bloom filtering
  7. Sentiment analysis of user reviews
  8. Sentiment analysis of user reviews by restaurants
  9. Simple moving average rating of restaurants
  10. Tip at restaurants
  11. Top 10 restaurants by positive reviews
  12. Top restaurants by state
  13. Total and average rating of restaurants by country
  14. Total restaurants by state

Data Preprocessing and Cleansing

Data Visualization

  • Total Restaurants by Cuisine

  • Top 10 Restaurants

  • Average Rating of Restaurants by Cuisine

  • Positive and Negative Review Count of Restaurants

  • Moving Average of a Restaurant

Programming Language

Java, R

Technologies

Hadoop, HDFS, MapReduce, AWS EC2, Ubuntu

Tools/IDE

Eclipse, RStudio, WinSCP, Putty, PuttyGen, PowerBI

restaurant-recommender-system-aws-hadoop-mapreduce's People

Contributors

agrawal-priyank avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.