Git Product home page Git Product logo

maharank-workflow's Introduction

maharank-workflow

Snakemake workflow to generate the reference files for MahaRank

This workflow consists of nine workflows as follows:

  • workflow/identifier.smk: ID retreival

  • workflow/download.smk: Data download

  • workflow/divide_plasmid_host.smk: Dividing plasmid/host amino acids sequences

  • workflow/stats.smk: Some statistics such as length and GC contents of amino acids sequences

  • workflow/rho.smk: k-mer

  • workflow/reference.smk: Reference files (reference.tar.gz, reference_slim.tar.gz)

  • workflow/distance.smk: Distance calculation

  • workflow/roc.smk: ROC/AUC analysis

  • workflow/plot.smk: Plot of ROC/AUC analysis

Requirements

  • Bash: GNU bash, version 4.2.46(1)-release (x86_64-redhat-linux-gnu)
  • Snakemake: 7.20.0
  • Singularity: 3.9.2

Summary

How to reproduce this workflow

In local machine:

snakemake -s workflow/identifier.smk -j 4 --use-singularity
snakemake -s workflow/download.smk -j 4 --use-singularity
snakemake -s workflow/divide_plasmid_host.smk -j 4 --use-singularity
snakemake -s workflow/stats.smk -j 4 --use-singularity
snakemake -s workflow/rho.smk -j 4 --use-singularity
snakemake -s workflow/reference.smk -j 4 --use-singularity
snakemake -s workflow/distance.smk -j 4 --use-singularity
snakemake -s workflow/roc.smk -j 1 --use-singularity
snakemake -s workflow/plot.smk -j 1 --use-singularity

In parallel environment (GridEngine):

snakemake -s workflow/identifier.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/download.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/divide_plasmid_host.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/stats.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/rho.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/reference.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/distance.smk -j 32 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity
snakemake -s workflow/roc.smk -j 1 --cluster "qsub -l nc=4 -p -50 -r yes -q large.q" --latency-wait 2000 --use-singularity
snakemake -s workflow/plot.smk -j 1 --cluster "qsub -l nc=4 -p -50 -r yes -q node.q" --cluster-cancel qdel --latency-wait 2000 --use-singularity

In parallel environment (Slurm):

snakemake -s workflow/identifier.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/download.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/divide_plasmid_host.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/stats.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/rho.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/reference.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/distance.smk -j 32 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/roc.smk -j 1 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity
snakemake -s workflow/plot.smk -j 1 --cluster sbatch --cluster-cancel scancel --latency-wait 2000 --use-singularity

License

Copyright (c) 2023 Koki Tsuyuzaki Artistic License 2.0.

Authors

  • Koki Tsuyuzaki

maharank-workflow's People

Contributors

kokitsuyuzaki avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.