Git Product home page Git Product logo

cs2803-final-project's Introduction

CS2803-final-project- NFL Analysis (Jacob Meyers and Lauren Kearley)

File Descriptions

  • nfl_data.csv: The original dataset from which we gathered all the data. Came from Kaggle, can be found here.
  • DataViz.ipynb: A Jupyter Notebook containing data visualizations of run/pass trends in the NFL since 2009, made with matplotlib. As suspected, even within this short period of time the NFL as a whole shifted ~3% toward passing.
  • nfl_classifiers.py: Python module which has functions to create scikit-learn classifiers and regressors on the NFL data. The classifiers were used to try to classify a play as either run or pass based on quarter, down, distance, time left in the game, and the point differential relative to the offensive team (i.e. if the offensive team is winning by 7, it would be 7.0. If they are losing by 7, it would be -7.0.) The accuracy scores typically range from 65% to 68%, with 68% being the absolute upper bound we got. The regressors were used to train models to determine the win probability of a team given the time left in the game and the point differential relative to that team. These worked much better, typically getting around 88% to 95% accuracy. The functions return a tuple of the trained model and its average accuracy based on test data. An example use of the function could be this: clf, score = build_sklearn_randforest_classifier('new_run_pass.csv'). This will give you an sklearn RandomForestClassifier trained on data from new_run_pass.csv.
  • new_run_pass.csv: This is the data used to train the classifiers to determine run or pass based on game situation. This csv was created from the original nfl_data.csv using Pandas DataFrame queries.
  • play_year.csv: This is the file the visualizations were created from. This csv was created from the original nfl_data.csv using Pandas DataFrame queries.
  • winprob.csv: This is the file the regressors were trained on to determine win probability of a team based on game situation. This csv was created from the original nfl_data.csv using Pandas DataFrame queries.

cs2803-final-project's People

Contributors

lkearley avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.