Git Product home page Git Product logo

fungenomics / g34-gliomas Goto Github PK

View Code? Open in Web Editor NEW
0.0 3.0 0.0 27.99 MB

Cell-of-origin analysis for "Histone H3.3G34-mutant interneuron progenitors co-opt PDGFRA for gliomagenesis" (Chen*, Deshmukh*, Jessa*, Hadjadj*, et al, Cell, 2020)

Home Page: https://fungenomics.github.io/G34-gliomas/

HTML 99.60% CSS 0.01% JavaScript 0.01% R 0.36% Shell 0.02%
reproducible-research cancer-genomics reproducible-paper rna-seq

g34-gliomas's Introduction

DOI

G34-gliomas

This repository contains the code & data for the bulk analysis included the G34R/V HGG manuscript (Chen, Deshmukh, Jessa, Hadjadj, et al, Cell, 2020), for the analysis that was performed by our lab.

This repository is meant to enhance the STAR Methods section by providing code for the custom analyses in the manuscript and the exact R dependencies, in order to improve reproducibility for the main results. However, it is not a fully executable workflow. The code is permanently archived on Zenodo at the doi 10.5281/zenodo.7086446.

NOTE: if viewing on GitHub, only code is visible, as inputs, data, and outputs are generally not tracked in git.

Contents:

Directory organization and links to code

Map from figures to code

This section contains a pointer from each figure in the paper to the section (§) where it's generated in the code. For each figure panel, I provide a partial path to the RMD/MD/HTML files within this repository/directory, and then the section in the rendered HTML which specifically produces that panel. As described below in the section on reproducibility, the source data for the figure is typically saved alongside the figure itself.

Figure 2

  • Figure 2A: bulk_transcriptome_epigenome/02-GSEA..., § 6.1.1 Forebrain reference
  • Figure 2C: bulk_transcriptome_epigenome/02-GSEA..., § 6.1.2 Striatal SVZ
  • Figure 2D: bulk_transcriptome_epigenome/01-bulk_RNAseq_pipeline... § 4.3.2 Lineage specific TFs
  • Figure 2E: bulk_transcriptome_epigenome/03-ChIPseq..., § 4.2.1 DGE
  • Figure 2F: singlecell_normal/analysis/01-interneuron_pseudotime...,
    • top panel § 4.1 Cell type density along normal interneuron differentiation trajectory
    • bottom panel § 4.2 Plot genes of interest along pseudotime

Figure S2

  • Figure S2A: bulk_transcriptome_epigenome/02-GSEA..., § 6.2 GSEA enrichment plots
  • Figure S2B: bulk_transcriptome_epigenome/02-GSEA..., § 6.4 Confirmation of signal by direct expression of gene programs
  • Figure S2D: bulk_transcriptome_epigenome/02-GSEA..., § 6.1.3 Adult V-SVZ
  • Figure S2E: bulk_transcriptome_epigenome/01-bulk_RNAseq_pipeline... § 4.3.2 Lineage specific TFs

Figure 3

  • Figure 3A: bulk_transcriptome_epigenome/analysis/04-isogenic_cell_lines...,
    • top panel § 4.4.3 Targeted DGE, for stem condition - GBM002
    • bottom panel § 4.4.2 Targeted DGE, for serum condition - GBM002
  • Figure 3C: bulk_transcriptome_epigenome/analysis/04-isogenic_cell_lines... § 4.5.4 Visualize results
  • Figure 3D: singlecell_normal/analysis/02-gene_bubbleplots...,
    • left § 4.1 Mouse developing forebrain
    • right § 4.3 Striatal SVZ

Figure S5

  • Figure S5A: bulk_transcriptome_epigenome/01-bulk_RNAseq_pipeline... § 4.4.1 G34 mutants
  • Figure S5B: singlecell_normal/analysis/02-gene_bubbleplots..., § 4.2 Human

Figure 5

  • Figure 5D: singlecell_normal/analysis/03-astrocyte_interneuron_coexpression..., § 5.3 Developing mouse forebrain and 5.4 H3G34R/V tumors

Figure S6

  • Figure S6C: singlecell_normal/analysis/03-astrocyte_interneuron_coexpression..., § 5.2 Human fetal telencephalon

Notes for reproducibility

rr template & helpers

This repository uses the rr template, which contains a set of R markdown templates to help ensure reproducibility. This template also provides a set of helper functions (located in rr_helpers.R and prefixed by rr_ in the function name) to help encourage documentation.

R and R package versions

The R library for this project is managed with the package renv, which:

  1. maintains an isolated project-specific library in the renv folder,
  2. stores packages according to version
  3. records the R, Bioconductor, and package versions in the file renv.lock, which can be used to reproduce the R package environment elsewhere

The R version used is 3.5.1. The analysis also makes use of our in-house package for scRNAseq visualization, cytobox.

R Markdown

Each markdown/HTML file has a "Reproducibility report" at the bottom, indicating when the document was last rendered, the most recent git commit when it was rendered, the seed, and the R session info.

Figures and source data

For most figures, the source data underlying the plot is saved along side the figure in the respective figures directory. If so, a message is displayed in the markdown/HTML files underneath the chunk which produces the plot, giving the path for the figures/source data within this project directory.

e.g. [figure/source data @ G34-gliomas/bulk_transcriptome_epigenome/figures/01/gsx2_pdgfra_correlation…]

Analysis outputs

For most text file & R object outputs, there is a text file saved next to the object with the extension .desc, with a very brief one-line description of what's contained in the file.

e.g. for the output file bulk_transcriptome_epigenome/output/02/fgsea_df.tsv, there is an associated description file bulk_transcriptome_epigenome/output/02/fgsea_df.desc

GitHub / version control

This directory is tracked with git and has an associated GitHub repository in the Kleinman lab account at https://github.com/fungenomics/G34-gliomas.

The following are tracked / available on GitHub:

  • .Rmd files, containing the code, and .md files, containing code and outputs
  • Figures in png format
  • Certain output files (tsv/Rda/Rds), if they're small
  • The brief desc files for otputs
  • The lockfile produced by the renv package

The following are not tracked / available on GitHub:

  • Figures in pdf format, and figure source data
  • Raw data and large analysis output / processed data files
  • The actual packages in the R library

Contact

Selin Jessa (selin.jessa at mail.mcgill.ca)

g34-gliomas's People

Contributors

sjessa avatar

Watchers

 avatar Steven Hébert avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.