Git Product home page Git Product logo

mry's Introduction

MrY -- ๐Ÿ“š Management of references for yourself

Why this mini-manager?

When human, mouse, zebrafish, roundworm (C. elegans), yeast (S. cerevisiae), and bacteria (salmonella and M. tuberculosis) show up together in one zoo, a manager is suggested.

What does it do for you?

  • Download genome sequence (.fasta). It downloads the primary_assembly version, if not available, toplevel version is downloaded.
  • Download annotation (both .gff3 and .gtf).
  • Build aligner index.
  • Maintain references in a folder structure which is making sense.

How to install MrY?

git clone [email protected]:Puriney/MrY.git
cd MrY
pip install ./

How to use MrY?

There are 3 common operations:

  1. install
  2. list
  3. delete

Install references

yun install  --root-dir /path/to/your/zoo \
--species zebrafish \
--assembly GRCz10 \
--org Ensembl \
--release 90 \
--target all

MrY will generate a folder named zebrafish under /path/to/your/zoo to create references for zebrafish of which the assembly name is GRCz10 in release 90 of Ensembl.

zebrafish/
โ”œโ”€โ”€ aligner_index
โ”‚ย ย  โ”œโ”€โ”€ bowtie2
โ”‚ย ย  โ”‚ย ย  โ””โ”€โ”€ Ensembl
โ”‚ย ย  โ”‚ย ย      โ””โ”€โ”€ GRCz10
โ”‚ย ย  โ”‚ย ย          โ””โ”€โ”€ release_90
โ”‚ย ย  โ”‚ย ย              โ”œโ”€โ”€ GRCz10.dna.1.bt2
โ”‚ย ย  โ”‚ย ย              โ”œโ”€โ”€ GRCz10.dna.2.bt2
โ”‚ย ย  โ”‚ย ย              โ”œโ”€โ”€ GRCz10.dna.3.bt2
โ”‚ย ย  โ”‚ย ย              โ”œโ”€โ”€ GRCz10.dna.4.bt2
โ”‚ย ย  โ”‚ย ย              โ”œโ”€โ”€ GRCz10.dna.rev.1.bt2
โ”‚ย ย  โ”‚ย ย              โ””โ”€โ”€ GRCz10.dna.rev.2.bt2
โ”‚ย ย  โ””โ”€โ”€ star
โ”‚ย ย      โ””โ”€โ”€ Ensembl
โ”‚ย ย          โ””โ”€โ”€ GRCz10
โ”‚ย ย              โ””โ”€โ”€ release_90
โ”‚ย ย                  โ”œโ”€โ”€ chrLength.txt
โ”‚ย ย                  โ”œโ”€โ”€ chrNameLength.txt
โ”‚ย ย                  โ”œโ”€โ”€ chrName.txt
โ”‚ย ย                  โ”œโ”€โ”€ chrStart.txt
โ”‚ย ย                  โ”œโ”€โ”€ exonGeTrInfo.tab
โ”‚ย ย                  โ”œโ”€โ”€ exonInfo.tab
โ”‚ย ย                  โ”œโ”€โ”€ geneInfo.tab
โ”‚ย ย                  โ”œโ”€โ”€ Genome
โ”‚ย ย                  โ”œโ”€โ”€ genomeParameters.txt
โ”‚ย ย                  โ”œโ”€โ”€ Log.out
โ”‚ย ย                  โ”œโ”€โ”€ SA
โ”‚ย ย                  โ”œโ”€โ”€ SAindex
โ”‚ย ย                  โ”œโ”€โ”€ sjdbInfo.txt
โ”‚ย ย                  โ”œโ”€โ”€ sjdbList.fromGTF.out.tab
โ”‚ย ย                  โ”œโ”€โ”€ sjdbList.out.tab
โ”‚ย ย                  โ””โ”€โ”€ transcriptInfo.tab
โ”œโ”€โ”€ annotation
โ”‚ย ย  โ””โ”€โ”€ Ensembl
โ”‚ย ย      โ””โ”€โ”€ GRCz10
โ”‚ย ย          โ””โ”€โ”€ release_90
โ”‚ย ย              โ”œโ”€โ”€ ensembl.GRCz10.90.gff3
โ”‚ย ย              โ”œโ”€โ”€ ensembl.GRCz10.90.gff3.gz
โ”‚ย ย              โ”œโ”€โ”€ ensembl.GRCz10.90.gtf
โ”‚ย ย              โ””โ”€โ”€ ensembl.GRCz10.90.gtf.gz
โ””โ”€โ”€ genome
    โ””โ”€โ”€ Ensembl
        โ””โ”€โ”€ GRCz10
            โ””โ”€โ”€ release_90
                โ”œโ”€โ”€ Danio_rerio.GRCz10.dna.toplevel.fa.gz
                โ”œโ”€โ”€ GRCz10.dna.fa
                โ””โ”€โ”€ GRCz10.dna.fa.gz -> /path/to/your/zoo/zebrafish/genome/Ensembl/GRCz10/release_90/Danio_rerio.GRCz10.dna.toplevel.fa.gz

Alternatively, more than one species can be installed in the same time:

yun install --root-dir /ifs/data/yanailab/ref \
--species zebrafish roundworm brewer_yeast \
--assembly GRCz10 WBcel235 R64-1-1 \
--release  90 90 90 \
--org Ensembl \
--target all

List available references

List available references for specific zibrafish:

yun list  --root-dir /path/to/your/zoo \
--species zebrafish \
--assembly GRCz10 \
--org Ensembl \
--release 90
==========
zebrafish-GRCz10-Ensembl-90 was queried:
            Genome (.fa): Installed
       Annotation (.gtf): Installed
      Annotation (.gff3): Installed
                 Bowtie2: Installed
                    STAR: Installed

Alternatively, list all available references and a markdown table will be generated.

yun list  --root-dir /path/to/your/zoo
Species Assembly Org Release Genome (.fa) Annotation (.gtf) Annotation (.gff3) Bowtie2 STAR
zebrafish GRCz10 Ensembl 90 True True True True True
roundworm WBcel235 Ensembl 90 True True True False False
mouse GRCm38 GENCODE M15 True True True True True
mouse GRCm38 Ensembl 90 True True True True True
human GRCh38 GENCODE 27 True True True True True
human GRCh38 GENCODE 23 False True True False False
human GRCh38 Ensembl 90 True True True True True
brewer_yeast R64-1-1 Ensembl 90 True True True True True

Delete references

Check which GTF and GFF3 files are to be deleted before actually removing them.

yun delete  --root-dir /path/to/your/zoo \
--species zebrafish \
--assembly GRCz10 \
--org Ensembl \
--release 90 \
--target task_annotation \
-n

Run above command without -n and actually remove them. Change task_annotation to all to delete all references of specific zebrafish.

More Examples

here

FAQ

here

mry's People

Contributors

puriney avatar

Stargazers

 avatar

Watchers

 avatar  avatar

mry's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.