Git Product home page Git Product logo

eda-marathon-using-python's Introduction

EDA marathon running using python

This repository contains an exploratory data analysis (EDA) of a comprehensive dataset on ultra-marathon running events. The dataset spans over two centuries and provides detailed information on various ultra-marathon races, athletes, and their performances.

Dataset

The dataset used in this analysis is sourced from Kaggle: The Big Dataset of Ultra-Marathon Running. It includes information on race events, distances, athlete demographics, and performance metrics.

Analysis Overview

The Jupyter notebook EDA_marathon_running.ipynb covers the following steps:

  1. Data Acquisition

    • Downloading the dataset using the Kaggle API.
    • Extracting the data from the downloaded zip file.
  2. Data Preprocessing

    • Loading the dataset into a pandas DataFrame.
    • Initial exploration of the dataset (shape, data types, missing values).
    • Filtering the dataset for specific criteria (e.g., events in 2020, races held in the USA).
    • Cleaning and transforming data columns for analysis.
  3. Exploratory Data Analysis (EDA)

    • Visualizing the distribution of race distances and athlete genders.
    • Analyzing the relationship between athlete age and performance.
    • Comparing average speeds across different event distances and genders.

Visualizations

Several visualizations are generated to understand the data better:

  • Count plots for event distances and athlete genders.
  • Violin plots for athlete speeds across event distances.
  • Line plots showing the relationship between athlete age and average speed.

Dependencies

To run the notebook, you need the following Python libraries:

  • pandas
  • seaborn
  • matplotlib
  • kaggle

Conclusion

This analysis provides insights into ultra-marathon running events, highlighting trends and patterns in athlete performances based on gender, age, and event distances. Further analysis can be conducted to explore additional aspects of the dataset.

eda-marathon-using-python's People

Contributors

jey-krishna avatar

Stargazers

keerthiga0203 avatar Tamilselvan P avatar sharan avatar Nivetha nagulan avatar Kaviya N avatar Sri Ragavendiran N avatar Mohan kumar avatar  avatar Bharath Priyan S avatar Vishwapraveen.J avatar  avatar Mithilesh G avatar ARTHI J S avatar  avatar Harini avatar ASWINKUMAR avatar  avatar  avatar SHESHANTH R S avatar  avatar Dharshini2124 avatar NAVANEETHA KRISHNAN P S avatar Cibirajan V avatar Tharun PV avatar Harish Madhavan A avatar Harikishore S avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.