The project aims to visualize the work opportunities in the US based on the job openings on USF Handshake.
- Go to the location of Google Chrome application and run the cmd. Then paste this line of code:
chrome.exe --remote-debugging-port=8000 --user-data-dir="D:\Chromedata"
- In the new Google Chrome window, log in to USF Handshake and open Job Postings.
- Use handshake_scrapping.ipynb to scrape the website.
NOTE: In order to avoid errors caused by a poor Internet connection, it is recommended to perform multiple scraping attempts and concat the resulting files.
- Use data_cleaning.ipynb to explore and clean the data. Feel free to make any necessary adjustments to ensure that you have a clear understanding of your data.
- To get the locations and states abbreviations of each job title, run data_explode.ipynb.
NOTE: Some CSV files may contain formatting errors that affect the next step. Consider saving the file as .xlsx or any other more stable types.
- Install Tableau through this link: https://www.tableau.com/.
- Create a new Workbook using 3 files:
- Establish the relationship between 3 files: First, the Job Information Data is linked with Location Data by ID. The Location Data's "States abbreviation" is equivalent to Hexagon Map's "Abbreviation".
- Feel free to use any charts to visualize your data. Here is an example dashboard: Data Visualization.twbx.
Tableau: https://public.tableau.com/app/profile/tran.ho4576/viz/DataVisualization_16803730460700/Dashboard1