Git Product home page Git Product logo

rmats-nf's Introduction

rmats-nf:

A pipeline to retrieve RNA sequencing data from SRA, perform alignment with hisat2 and characterize splicing events with rMATS 3.2.5.

rmats-nf: Documentation

An overview of how the pipeline works, how to run it and a description of all of the different command-line flags.

Usage

The typical command for running the pipeline is as follows:

nextflow run lifebit-ai/rmats-nf --accessionList 'accession_list.txt' -profile docker

Mandatory arguments:

--accessionList

Path to input file with accession list to fetch from SRA

--gencodeFile

Path to input gencode GTF file

--keyFile

Path to a keyfile used to fetch restricted access datasets with SRAtools

-profile

Configuration profile to use. Can use multiple (comma separated) Available: short-test, key-test

Other arguments:

--skipTrimming

If used, trimming will be ommited.

--readType

Specifies what type of input reads are to be processed. Accepted values: single end, paired end

--readLength

Specifies the read length

rmats-nf's People

Contributors

pprieto avatar cgpu avatar odiogosilva avatar

Stargazers

wangyang avatar jkl123 avatar  avatar

Watchers

Konrad Zych avatar Ricardo Lopes avatar  avatar James Cloos avatar dmoleiro avatar Pedro Luís avatar Alberto avatar  avatar  avatar Bruno Gonçalves avatar Tiago Jesus avatar gmmrx avatar  avatar Panos Matzavinos avatar David Silva avatar Pedro Alagoa avatar Santiago de la Peña González avatar Sofwan Lawal avatar Sangram K Sahu avatar

rmats-nf's Issues

Create tarballs for directories in publishDir with high count of files to decrease total files

Why?

To decrease number of total files in the results/ directory of the job.

How?

With a bash ugly-liner, something like this

for dir in `find . -maxdepth 1 -type d  | grep -v "^\.$" `; do tar -cvzf ${dir}.tar.gz ${dir} && rm -R ${dir}; done

Which reads like this:

Search for folders in the working directory (excluding itself), create a .tar.gz of every folder and remove the initial folder.

Try it with this reprex:

mkdir folder1 folder2
cd folder1 && touch file1 file2 && cd -
cd folder2 && touch file1 file2 && cd -

image

As Anne, I would like to add the functionality to be able to automatically download a requested version of gencode

this should be part of the rmats.nf pipeline to download a requested version of annotation or to have the user supply their own (with long read RNA-sequencing) this is the case. That is, that the user will make a sample specific transcriptome, in the form of a GTF file, this is the outcome of the https://github.com/TheJacksonLaboratory/LyRICS pipeline.
ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_33/gencode.v33.annotation.gtf.gz

This is a transferred issue initially posted here: TheJacksonLaboratory/sbas#53

needs documentation :)

This public repository would benefit from a pre-amble, documentation, continuous integration and development tests, etc....

rMATS output files needed for sbas

When rMATS (any version) is run, the fromGTF files define the junction definitions for each of the splicing types (A3SS, A5SS, RI, SE, MXE) and junction counting types (JC, JCEC) and counting types (IJC, SJC, INC (calculated), INCLEN, SKIPLEN) the saveSampleCounts.sh has some good documentation which should go into a README.md for matrix formation.

Quoting Anne above.

  • Do we have the script she mentions named saveSampleCounts.sh
  • Do we have as ouputs the fromGTF.{<splicing-event-type>}.txt files to be able to use?

The latter are input files for the sbas repo scripts.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.