Git Product home page Git Product logo

amazon_vine_analysis's Introduction

Amazon_Vine_Analysis

Overview

Big Market helps businesses optimize their marketing efforts. A client is preparing to release a large catalogue of products, but is curious if enrolling in a program that gives out free products to select reviewers is worth the cost. This exercise will be an introduction to Big Data, it’s handling systems and it’s handle processes with some of the leading technologies in association: Hadoop, MapReduce, and PySpark. There will also be a look into NLP, Natural Processing Language, Cloud services such as AWS, Amazon Web Services, and their beneficial relationships with Big Data. This project will use use AWS Simple Storage Service (S3) and relational databases for basic cloud storage to complete an analysis of an Amazon customer reviews.

Data environment:

  • AWS: S3, RWD
  • PostgreSQL
  • Google Colaboratory
  • PySpark

Results:

Screen Shot 2022-10-02 at 4 32 00 PM

Out of 918,702 reviews:

  • There are 20,314 Vine reviews
    • There were 8,079, 5-star, vine reviews
    • 39.7% of vine reviews rated 5-stars
  • There are 898,388 Non-Vine reviews
    • 267,089, 5-star, non-vine reviews
    • 29.7% of the non-vine reviews rated 5-stars

Summary

Is there is any positivity bias for reviews in the Vine program?

Yes, Although the unpaid reviews’ had a greater population set of 267, 089 reviews, the 5-star rating, from this group, was only 30% of their total votes. Whereas the Vine reviews’ 5-star rating percentage was nearly half of the total vote, approximately 40%. Statistically speaking there is an apparent bias with the participants of the paid reviews program. In further analysis, it was determined that non-vine reviews had a 43% positive (4-5 star) rating and a 46% negative (1-2 stars) rating, where as, Vine reviews were determined to have a 62% positive rating and a 6% negative rating. Taking into account the overall grand scheme of the rating data (1-5 star ratings), rather than just the 5 star ratings, the statistics does indicate a bias of positive ratings among reviews that were incentivized.

Screen Shot 2022-10-02 at 4 25 27 PM

amazon_vine_analysis's People

Contributors

tracari avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.