Git Product home page Git Product logo

project-alpha's Introduction

Project Alpha

Statistics 159/259: Reproducible and Collaborative Statistical Data Science

UC Berkeley | Fall 2015

[Build Status] (https://travis-ci.org/berkeley-stat159/project-alpha?branch=master) [![Coverage Status](https://coveralls.io/repos/berkeley-stat159/project-alpha/badge.svg?bran ch=master)] (https://coveralls.io/r/berkeley-stat159/project-alpha?branch=master)

This repository stores the documentation of our analysis of the balloon- analogue risk task (BART) data included as part of the OpenfMRI ds009 data set [The generality of self-control] (https://openfmri.org/dataset/ds000009/). The original analysis was conducted in 2009 by Jessica Cohen for her doctoral thesis at UCLA, and was largely focused on comparing different notions of self-control across several fMRI studies. Our original aim was to reproduce the original analysis. However, we were unable to do so for many of the methods used, either due to the utilization of unavailable pre-packaged software or because we could not justify the underlying theory for some approaches. So, we chose to develop our own pre-processing pipeline and pursue new directions for identifying activation regions for the BART study. Our ultimate goal is to compare our results with those of Cohen's paper.

Many thanks to Jarrod Millman, Matthew Brett, J-B Poline, and Ross Barnowski for their advice and encouragement.

Navigating the Repository

The Makefile contains recipes to perform all final analysis of the data, generate the project report, and test all user-defined functions.

Please note that to perform the analysis and/or run the function test files, you will need to download the data for analysis and the testing data. Please see the project-alpha/data directory for more details.

  • make all: Downloads and validates the data and then performs all final analysis. Generates the final report, stored in the paper subdirectory. Please be aware that the raw data is 6 GB and that we also generate 14 GB of data in our analysis, so plan accordingly for download space and time. All images needed for the report are reproduced in the images directory.

  • make data: Downloads and decompresses the .tgz archive of the data for our analysis from OpenfMRI.org. We are using the BART data from ds009, The generality of self-control. All downloaded files will be located in the data/ds009 directory. Note that the .tgz archive is about 6 GB, so users should plan accordingly for space and download time.

  • make validate: Validates the ds009 data after downloading it via make data. Checks if all files are present and have the correct hash, based on the hashes stored in the data/ds009_hashes.json file.

  • make analysis: Generate final figures and results. Please note that in order to generate our results, 14 GB of data will be produced.

  • make report: Generates the PDF of our report, including appendices.

  • make eda: Generate figures from exploratory analysis and sends console output to eda.txt in the current directory. Please be aware that the output from this script could perhaps be better described as all of our supplementary data analysis that was not directly referenced in our paper.

  • make clean: Removes all extra files generated when compiling code. Does this recursively for all subdirectories

  • make test: Tests the functions located in the data and code directories, to be used to validate and analyze the data, respectively.

  • make verbose: Performs the same actions as make test, but uses the verbose nosetests option.

  • make coverage: Generates a coverage report for the functions located in the data and code directories.

Commands and documentation for generating additional and supplementary work can be found in various subdirectories. Please view their respective READMEs for more information.

  • code: Code files for all user-defined functions used for analysis, the tests files for the functions, and scripts to run intermediate analysis.
  • data: Data used for our analysis and tests can be downloaded and validated here.
  • final: Scripts to run the final analysis and generate all figures required for the report and slides. The make all recipe in the present directory essentially runs all the scripts in final.
  • images: Stores all output figures generated by the analysis scripts. Figures required for the report and slides are cached for convenience, but may also be reproduced.
  • paper: Files and instructions to generate the report.
  • proposal: Files and instructions to generate the proposal.
  • slides: Files and instructions to generate the progress report and final slides.

Contributors

project-alpha's People

Contributors

benjaminleroy avatar hiroto-udagawa avatar janewliang avatar jarrodmillman avatar kentschen avatar reychil avatar rossbar avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.