abidor13 / amazon_vine_analysis Goto Github PK
View Code? Open in Web Editor NEWGiven access to approximately 50 datasets, each containing reviews of a specific product and written by members of the paid Amazon Vine Program. We used PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into PgAdmin.