Git Product home page Git Product logo

course-dprep / airbnb-effect-of-price-on-reviews Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 1.0 1.38 MB

This repository was made by Denise, Max, Rick and Sjuul. The project focusses on ´Airbnb Analysis of Private Rooms and the Effects Of Reviews´. The project is supervised by Hannes Datta, professor at Tilburg University as part of the course ´Data Preperation & Workflow Management'.

R 87.95% Makefile 12.05%

airbnb-effect-of-price-on-reviews's People

Contributors

denisebal avatar github-classroom[bot] avatar maxbaaten avatar rickmassuger avatar sjuulvisschers avatar

Watchers

 avatar

Forkers

denisebal

airbnb-effect-of-price-on-reviews's Issues

1.1 Research Motivation

Maandag 11 september af

The research question is clearly articulated and important. The choice for the research method (e.g., regression analysis) is motivated well. The way of deployment (e.g., PDF report, dashboard, …) is useful and accessible to potential knowledge users, and clearly communicates the conclusions of the analysis. The automated and reproducible workflow is of potential use to other students and the larger scientific community.

Learn how to use make

Take a deep dive into the Make pipeline and learn how it works so we can apply this to our project in order to automatize the code

feedback on repos

  • all issues still relevant?
  • file names all in lower caps
  • makefiles incorrect: need file names as target, not a self-chosen name (unless it's your first rule)
  • dependency structure incorrect (i.e., prerequisites need to be added)
  • final rule (all) only needs to have the "final" data set to build, not all
  • make needs to realize it's "done".
  • clean legacy files - they are stored in your file version history anyways
  • explicitly choose which files to keep (e.g., R vs. Rmd)
  • output needs to be stored in gen, not in a subfolder of src.
  • remove legacy files

Preliminary research question

To what extent do the top 25% most expensive single rooms get more reviews than the bottom 25% of single rooms compared across (5?) European capitals on a quarterly time base? An analysis of multiple cities and time-frames.

Merge this dataset with the reviews dataset AIRBNB

[src_download_data.Rmd], this dataset is done in a way that it contains all the data of the 100 most cheap and 100 most expensive private rooms in 5 capital cities. we need to merge this with the reviews (amount of reviews) in R. please write code for this

Complete Readme

type of analysis
conclusion analysis
how to run the project

1.2 Repository structure and documentation (10%)

Maandag 11 september af

1.2 Repository structure and documentation (10%)
The end-to-end workflow, kickstarted with one of the workflow templates available at Tilburg Science Hub, is made publicly available on GitHub. The repository contains a readme.md (in markdown format, so that it renders well on GitHub), which clearly explains the project’s goal, and provides instructions to potential contributors/replicators on how to run the project. The project has a concise and accurate name, enticing the potential user to explore the workflow. An appropriate short name for the repository’s location is chosen (e.g., github.com/yourusername/investigating-airbnb). Additional metadata on GitHub is provided (e.g., a short project description), so that the repository feels and looks professional and complete.

Tip: Structure your README.md

The readme is not a copy of your way of deployment (e.g., your report), but links to it. Therefore, only summarize the main results in your readme (e.g., in bullet point format), or include screenshots (e.g., to your app, to your “best” figure).

More information on how a proper readme should look like can be found at Tilburg Science Hub.

fix_charts

save the barcharts somewhere as output of research

merge review data and listings data

in folder "src/data-preperation/src_download_data.rmd". this is the file that contains the code to download all the data for the listings. also make such a code for the download of review data and merge these two datasets.

that file contains data for the top 100 and bottom 100 private rooms of all the cities!

improve gitlog

add clear descriptions to your commits when solving issues

update README

the README has to be updated, with more details than how it previous was.

Divide the code into chunks

divide the code into chunks according to the ITO methods, so we can run these code chunks seperately through make.

use dplyer or tidyverse

in our advanced code, we need to make use of data packages in rmarkdown to show our capabilities

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.