Git Product home page Git Product logo

scrna_smartseq2's Introduction

scRNA_SMARTSEQ2_transcriptomeMapping

This pipeline is for the processing of paired-end single-cell RNA sequencing. The pipeline takes individual FASTQs for each cell as input and proceeds to trim, map, filter and count the reads. Then using this counts table it produces a Seurat object with the data. Standard outputs include counts tables, metadata files, QC plots, and the Seurat object.

Trimming

Pipeline uses TrimGalore to trim the samples as well as run fastQC on the reads. Outputs gzipped trimmed FASTQs, a trimming report, and fastqc reports.

Fastqscreen

Pipeline runs FastQ screen on the trimmed reads and produces even more QC reports.

Mapping

This pipeline uses HISAT2 to map the reads to a genome of choice. Location of HISAT tool, genome index and GTF file needs to be specified in the config file. Rule is currently set up to run HISAT2 over 12 threads, but that can be changed in the align.smk rule. HISAT2 outputs SAM files by default, so an extra step is added to convert the SAM files into BAM files.

Feature Count

Uses featureCounts to count the mapped reads. Location of GTF file needs to be specified in the config file.

Filter Counts

This is a custom R script to filter the counts. Options to filter out mitochondrial genes and filter by biotype need to be specified in the config file. Also outputs a sample metadata file connecting the sample name, output bam, and cell name. Also outputs a gene metadata file containing chromosome location information, ensembl ID, and gene name for all genes in dataset.

Seurat

This takes the filtered count table and creates a Seurat Object using it. Produces QC plots as well as filters the Seurat object based on parameters that need to be set in the config file. There is also an option to integrate different sample together in the Seurat object. It will then cluster the dataset and produce UMAPs as well as output differential expression heatmaps. It also produces a second Seurat object which has the effects of cell cycle removed and does the same clustering and differential expression analysis on this second object.

scrna_smartseq2's People

Contributors

ashrosewood avatar adoe21 avatar

Stargazers

 avatar Breeshey Roskams-Hieter avatar

Watchers

James Cloos avatar Joseph Estabrook avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.