Git Product home page Git Product logo

taxdata's Introduction

About taxdata Repository

This repository prepares data used in the Tax-Calculator repository.

The data produced here, all of which have CSV format, provide two different sets of data files for Tax-Calculator:

  • A set based on a recent IRS-SOI Public Use File (PUF)

  • A set based on recent Census Current Population Survey (CPS) data

Because the PUF data are restricted in their use, the IRS-SOI-supplied PUF file and the puf.csv data file produced here are not part of the taxdata or the Tax-Calculator repository.

Each of these two sets of data files contains four files:

  1. a sample data file containing variables for each tax filing unit;

  2. a factors file containing annual variable extrapolation factors;

  3. a weights file containing annual weights for each filing unit;

  4. a ratios file containing annual adjustment ratios for some variables.

Note that the factors file is the same in both sets of data files because the variable extrapolation factors are independent of the sample data being used. But the weights and ratios files do depend on the data file, so they are different in the two sets of data files.

Data-Preparation Documentation

IRS-SOI Public Use File (PUF) documentation:

  1. PUF-based sample data;

  2. grow factors

  3. PUF-based sample weights;

  4. PUF-based adjustment ratios.

Census Current Population Survey (CPS) documentation is available here:

  1. CPS-based sample data;

  2. grow factors

  3. CPS-based sample weights;

  4. CPS-based adjustment ratios.

Work-Flow Documentation

The sequence of operations required to make the two sets of data files is contained in the csvmake bash script, which also automates the preparation work-flow (except on Windows).

The sequence of operations required to install the two sets of data files in the Tax-Calculator repository is contained in the csvcopy bash script, which also automates the installation work-flow (except on Windows).

Contributors

  • John O'Hare
  • Amy Xu
  • Anderson Frailey
  • Martin Holmer

taxdata's People

Contributors

martinholmer avatar andersonfrailey avatar amy-xu avatar matthjensen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.