Git Product home page Git Product logo

cricket_analysis_k_means_clustering's Introduction

Cricket Player Performance Analysis

Objective

The project aims to analyze cricket player performance data using clustering techniques to identify player segments based on their batting statistics.

Data Preparation

The dataset includes batting statistics of various cricket players. Key attributes considered for analysis include Matches (Mat), Innings (Inns), Not Outs (NO), Runs (Runs), Batting Average (Ave), Balls Faced (BF), and Strike Rate (SR). Initially, the data is standardized using StandardScaler to ensure uniformity in scale across variables.

Hopkins Statistic

To assess the clustering tendency of the dataset, the Hopkins statistic is computed. This statistic measures the spatial randomness of the data points, with values closer to 1 indicating a high tendency to cluster.

K-Means Clustering

K-Means clustering is employed to partition the dataset into clusters based on player performance attributes. The number of clusters is arbitrarily set to 4. K-Means iteratively assigns data points to the nearest cluster centroid and updates the centroids until convergence.

Interpretation

Once clustering is performed, the players are assigned cluster IDs based on their similarity in batting statistics. The clusters can be analyzed to understand different player segments and their respective performance characteristics.

Visualization

A dendrogram is plotted using hierarchical clustering to visualize the distance between player clusters based on batting strike rate and average. This dendrogram helps identify natural groupings or clusters within the data.

Outcome

By clustering cricket player performance data, the project aims to uncover distinct player segments based on their batting statistics. This analysis can provide valuable insights for team management, talent identification, and strategic decision-making in cricket.

Libraries Used:

  • numpy
  • pandas
  • matplotlib.pyplot
  • seaborn
  • datetime
  • sklearn
  • scipy.cluster.hierarchy

cricket_analysis_k_means_clustering's People

Contributors

datasciencevishal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.