Git Product home page Git Product logo

calc-analysis's Introduction

This repository contains analysis and science experiments regarding CALC data, mostly in the form of IPython/Jupyter notebooks.

Reading the notebooks

If you only want to read the notebooks, you can actually just use GitHub--look for the files ending in .ipynb. Here's a few to get you started:

  • Anomaly detection can help us find invalid/incorrect data in CALC.

  • Fun with CALC and pre-trained GloVe word embeddings can help us perform semantic search in CALC, e.g. searching for "tutor" and getting results like "Principal Instruction Technologist".

  • Log analysis contains some open-ended exploration of CALC's API logs, to learn how people are using CALC (note that CALC's API is used by CALC's front page).

  • Contract analysis contains open-ended exploration of CALC's contract data.

  • SIN analysis attempts to parse and make sense of CALC's unstructured Special Item Number (SIN) data.

Interacting with the notebooks

If you want to interactively explore the notebooks or create new ones, you'll need to clone this repository locally.

This project's git repository uses Git Large File Storage to store some of our large data assets. Please install it before cloning. (If you've already cloned the repository, you can obtain the large files after installing Git LFS by running git lfs pull.)

If you're familiar with virtualenvs and pip, you can use them to install all dependencies:

virtualenv venv

# On Windows, replace the following line with 'venv\Scripts\activate'.
source venv/bin/activate

pip install -r requirements.txt

On the other hand, if you're not familiar with virtualenvs, you can probably just install Anaconda, as it ships with most/all the dependencies you'll need.

Once you've done either of those, just run jupyter notebook in the root directory of the repository and use its web interface to explore any of the notebooks.

calc-analysis's People

Contributors

toolness avatar ericschles avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.