Git Product home page Git Product logo

kennethcandersen / citibike-strategic-analysis-tableau Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 66.93 MB

Strategic data analysis and visualization of the Citibike's evolution and future growth in Tableau.

Home Page: https://public.tableau.com/views/CitibikeDashboard_16283828004620/TheStory?:language=en-US&publish=yes&:display_count=n&:origin=viz_share_link

License: MIT License

Jupyter Notebook 100.00%

citibike-strategic-analysis-tableau's Introduction

Citibike Strategic Growth Analysis Dashboard in Tableau

Project summary

Project Goal

Create a dashboard in Tableau for New York City's Citibike stakeholders that demonstrates the evolution of the system and strategic trends for future develoment.

How to View the Visualization

Options:

  1. Download the "Citibike Dashboard.twbx" file and open in Tableau.
  2. View the visualizaiton online at Tableau's website.

Executive Summary

Conclusions

  1. Overall ridership was growing until the pandemic, although the growth rate YoY has been decreasing since 2016.
  2. The average number of rides per station remained relatively stable, despite the aggressive expansion in stations.
  3. Women remain underrepresented among ridership. But there is progress: in 2020 women represented 28.4% of riders, versus 20% in 2013. In peak months it reaches over 30%.
  4. The percentage of Short-Term Customers has been increasing the last few years to over 25% of the overall user base. Short-Term Customer use is highly seasonal, likely representing summer time tourism.
  5. The average rider age is now 41 years old, versius 38 in 2013. The 2 age groups that increased in overall rider percentage are from ages 50- 59 and 60-69. This could be good news (the program is attracting a diversity of ages) or potentially bad news (the user base is aging and the program is failing to attract younger riders). More evaluation is needed.
  6. Rider intensity is greatest in Central to Southern Manhattan. All top-20 stations are located there. Although the system has grown substantially over the years, Manhattan remains the main hub of ridership.

ETL process

Extract

Transform

  • All of the monthly CSV files were cleaned and concatenated (merged) using Python and Pandas in Jupyter Notebook in order to export one master data file. It had 122 million rows and weighed 22GB.
  • Given the size of the dataset, I created a subset of one in hundred, resulting in a 1% sample size of 1.2 million data points and a file that weighed 250MB.
  • The sample data set file is still too large to store in GitHub. You can view a "one in thousand" dataset just to see how the data was structured, although this file was not used for the visualization.

Load

  • Data was loaded into Tableau as one master CSV file and the dashboard was created.

Languages & Tools Used

  • Python, Pandas and Jupyter Notebook for the data extraction & cleanup
  • Tableau Public for the visualization

citibike-strategic-analysis-tableau's People

Contributors

kennethcandersen avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.