Git Product home page Git Product logo

analysis-flow's Introduction

Data Analysis Workflows & Reproducibility Learning Resources

This repository aims to collect resources relating to workflow and tooling choices that promote reproducibility and best practice in data analysis and data science projects.

The resources have been organised as:

  • R Packages
  • Books
  • Papers
  • Blog Posts
  • Talks and Videos

If you would like to make a contribution, I would be glad to include it. Please file an issue, submit a PR or email me on [email protected]


R Packages

Package About Available on
drake An R-focused pipeline toolkit for reproducibility and high-performance computing CRAN
ProjectTemplate ProjectTemplate is a system for automating the thoughtless parts of a data analysis project CRAN
workflowr A Framework for Reproducible and Collaborative Data Science CRAN
rrtools Tools for Writing Reproducible Research in R Github
orderly Lightweight Reproducible Reporting for R CRAN
fnmate A function definition generator Github
dflow Automatically setup a drake project Github
represtools Basic utility functions to support reproducible research CRAN
starters R Package for initializing projects for various R activities Github
targets Function-oriented Make-like declarative workflows for R Github

Books

Title Authors Year
Agile Data Science with R - A workflow Edwin Thoen 2020
What They Forgot to Teach You About R Jennifer Bryan, Jim Hester 2020
The Turing Way: A Handbook for Reproducible Data Science Becky Arnold, Louise Bowler, Sarah Gibson, Patricia Herterich, Rosie Higman, Kirstie Whitaker 2019

Papers

Title Citation
Packaging Data Analytical Work Reproducibly Using R (and Friends) Ben Marwick, Carl Boettiger & Lincoln Mullen (2018) Packaging Data Analytical Work Reproducibly Using R (and Friends), The American Statistician, 72:1, 80-88, DOI: 10.1080/00031305.2017.1375986
Opinionated analysis development Parker H. 2017. Opinionated analysis development. PeerJ Preprints 5:e3210v1 https://doi.org/10.7287/peerj.preprints.3210v1

Blog Posts


Talks

analysis-flow's People

Contributors

deanmarchiori avatar deanmarchiori-ee avatar hplieninger avatar jl5000 avatar milesmcbain avatar mitchstares avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

analysis-flow's Issues

Meet ups to discuss workflows

There is some interest to hold a meet up to discuss workflows, project setup etc.

There was twitter banter around locations to hold it, but with the COVID19, perhaps online/virtual meetups would be best.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.