Pandas project made in Ironhack Data Analytics Aug '20 cohort.
The objective of this project is to demonstrate a hypothesis made about shark attacks. A messy shark attack dataset is received and several data cleaning and data wrangling techniques are applied to this dataset in order to fulfill this objective.
The hypothesis chosen is In the last century, the top five countries where non-fatal shark attacks to people practising spearfishing in winter have occured are, in this order, Australia, South Africa, New Zealand, Fiji and USA
.
The structure and contents of the project are as follows:
notebooks/
: two jupyter notebooks, clean.ipynb for the cleaning proccess and hypothesis.ipynb for the hypothesis demonstration.output/
: a.csv
with the clean dataset, sharks_clean.csv.