A short NCRM workshop led by Professor Vernon Gayle, University of Edinburgh (UK)
Material developed by Professor Vernon Gayle, University of Edinburgh.
Concern about the lack of the reproducibility of research persists across a range of academic disciplines. There is a general appeal for extra materials to be routinely provided alongside research publications and reports which include sufficient information for a third party to reproduce results without any additional information from the authors. Jupyter notebooks are an open source web-based application that enable researchers to author documents that include live code (e.g. Stata or R code), alongside data analysis outputs (e.g. modelling results, plots etc.), and documentation (e.g. narrative text describing and detailing the workflow. Jupyter notebooks have been used in Nobel Prizewinning high-profile big science applications but are rarely used social science research.
File Title | File Type | Description |
---|---|---|
audreys_thomas_workshop_slides_20200304_vg_v1.pdf | Workshop slides | |
jupyter_install_june_2019_20190618_vg_v3.pdf | updated pdf on how to install Jupyter and the Stata and R kernels |
nbviewer - A simple way to share Jupyter notebooks https://nbviewer.jupyter.org/
Link to the git repo for Connelly and Gayle (2019) https://github.com/RoxanneConnelly/Social-Class-Inequalities-in-General-Cognitive-Ability-in-Two-British-Birth-Cohorts
View Connelly and Gayle (2019) Jupyter Notebook in nbviewer https://nbviewer.jupyter.org/github/RoxanneConnelly/Social-Class-Inequalities-in-General-Cognitive-Ability-in-Two-British-Birth-Cohorts/blob/master/JupterNotebook_20171122.ipynb
The Project Jupyter Website (http://jupyter.org/)
Install Jupyter http://jupyter.org/install.html
The Quickstart Guide to https://jupyter.readthedocs.io/en/latest/content-quickstart.html
The Workflow of Data Analysis Using Stata http://www.indiana.edu/~jslsoc/web_workflow/wf_home.htm
Paper on the workflow in social science data analysis
Gayle, V.J. and Lambert, P.S. (2017) The Workflow: A Practical Guide to Producing Accurate, Efficient, Transparent and Reproducible Social Survey Data Analysis. NCRM Working Paper http://eprints.ncrm.ac.uk/4000/
Long, J.S. and Long, J.S., 2009. The workflow of data analysis using Stata. College Station, TX: Stata Press.
https://www.amazon.co.uk/Workflow-Data-Analysis-Using-Stata/dp/1597180475/ref=sr_1_1?s=books&ie=UTF8&qid=1530726163&sr=1-1&keywords=stata+workflow
A fantastic book. This is the 'bible' of good data analysis workflow practices.
A blog post from Fernando Perez's blog http://blog.fperez.org/2013/04/literate-computing-and-computational.html
Lorena Barba's web page http://lorenabarba.com/
A gallery of interesting notebooks https://github.com/jupyter/jupyter/wiki/A-gallery-of-interesting-Jupyter-Notebooks
https://guides.github.com/activities/hello-world/
https://zeroturnaround.com/rebellabs/git-commands-and-best-practices-cheat-sheet/
https://services.github.com/on-demand/downloads/github-git-cheat-sheet.pdf
Here is a summary of Markdown codes https://en.wikipedia.org/wiki/Markdown#Example
If you have half an hour you can easily learn Markdown from here
http://www.markdowntutorial.com/
(if this doesn't work try a different browser)
Gayle, V. and Lambert, P. (2018) What is Quantitative Longitudinal Data Analysis? Bloomsbury Publishing.
Available at Amazon https://tinyurl.com/y8cbf7c2 or directly from Bloomsbury https://tinyurl.com/y9x5fltk