In this project scenario, students will use actual YELP and climate datasets in order to analyze the effects the weather has on customer reviews of restaurants. The data for temperature and precipitation observations are from the Global Historical Climatology Network-Daily (GHCN-D) database. Students will use a leading industry cloud-native data warehouse system called Snowflake for all aspects of the project.
Students will then apply the skills they have acquired in the preceding Designing Data Systems Course to architect and design a Data Warehouse DWH for the purpose of reporting and online analytical processing (OLAP).
YELP DATA Navigate to the Yelp Dataset [https://www.yelp.com/dataset/download] (opens in a new tab), then enter your details and click Download
On this page download 2 files “Download JSON” and “COVID-19 Data”
Note: The COVID-19 Data is currently not available for download on Yelp dataset page. You can download the COVID-19 dataset from this [https://www.kaggle.com/datasets/claudiadodge/yelp-academic-data-set-covid-features].
If you get an error code, go back to the page where you entered your details, click on download again try again
Note: Use single word filenames when you save. This will make it easier when loading into the database in later steps.
These are compressed files that will need to be uncompressed with the tool of your choice