Git Product home page Git Product logo

ng-meta-profiler's Introduction

NGLESS-PROFILER

https://ngless.embl.de

This is a collection of predefined ngless pipelines for processing shotgun metagenomes.

  1. human-gut.ngl for human gut samples

  2. marine.ngl for marine samples

  3. mouse-gut.ngl for mouse gut samples

  4. dog-gut.ngl for dog gut samples

  5. pig-gut.ngl for pig gut samples

These are predefined, but users are encouraged to adapt them to their specific needs.

Citation

If you are using NG-meta-profiler (or any NGLess-based pipeline), please cite:

NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language by Luis Pedro Coelho, Renato Alves, Paulo Monteiro, Jaime Huerta-Cepas, Ana Teresa Freitas, Peer Bork, Microbiome (2019) https://doi.org/10.1186/s40168-019-0684-8

INSTALL

You need to

  1. install ngless. The recommended way is through bioconda:

    conda install -c bioconda ngless

Alternatively, see the ngless documentation on how to install it.

  1. copy the profiling scripts to next to where your data lives

USAGE

To use the profiler, select the appropriate script (e.g., human-gut.ngl for human gut samples), put all the FastQ files from the same sample in the same directory (INPUT-DIRECTORY) with the extension .fq.gz or fastq.gz and run:

ngless human-gut.ngl INPUT-DIRECTORY OUTPUT-DIRECTORY

You can use all the ngless command line options to set options. For example, to use 8 cores, use:

ngless --threads=8 human-gut.ngl INPUT-DIRECTORY OUTPUT-DIRECTORY

If you want very verbose output:

ngless --threads=8 --trace human-gut.ngl INPUT-DIRECTORY OUTPUT-DIRECTORY

USAGE WITH DOCKER

We now also provide docker containers containing all required databases on docker hub.

These images can be used directly or through the convenience script ng-meta-profiler.sh. To use ng-meta-profiler.sh simply download the script, decide which profiler to use (human-gut:1.0.0 in the example) and run:

ng-meta-profiler.sh human-gut:1.0.0 INPUT-DIRECTORY OUTPUT-DIRECTORY

which will download and analyse the specified data with the human-gut:1.0.0 profiler.

INPUT and OUTPUT should be absolute paths and INPUT should be a directory containing a metagenome as a set of files named according to the MOCAT format.

RESOURCE REQUIREMENTS

Profiler Memory Disk space
Human 14.7G 18G
Mouse 5.1G 6.3G
Dog 4.5G 3.5G
Pig 10.1G 17G

Note too that the first time you run a particular profiler, NGLess will download and index the respective databases. This is likely to take several hours (and this process does not benefit from using multiple threads). This is only necessary once. Afterwards, the index will be reused (this is what is responsible for disk space usage in the table above).

GETTING HELP

ng-meta-profiler's People

Contributors

luispedro avatar unode avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

ng-meta-profiler's Issues

temporary directory not working

Hi,
I am running the pipeline in an slurm enviroment and even though I specified the -t to write into scratch, the pipeline start to write on ~/.local/share/ngless (specifically /users/valensan/.local/share/ngless/data.tar.gz2776146-0.tmp). Which at some point runs out of disk space.

error log:

Exiting after fatal error:
An unhandled error occurred (this should not happen)!

	If you can reproduce this issue, please run your script
	with the --trace flag and report a bug (including the script and the trace) at
		https://github.com/ngless-toolkit/ngless/issues

The error message was: `/users/valensan/.local/share/ngless/data/Modules/igc.ngm/0.9/igc.fna: hPutBuf: permission denied (Quota exceeded)`

I tried also by exporting another temporary path but didn't work.

what could I do?
thanks in advance!

om-rgc module version mismatch

In

import "om-rgc" version "0.9"
we refer om-rgc 0.9 but we only have om-rgc 0.0.
Should we a) create the 0.9 module or instead b) downgrade the ngless script to 0.0.

Since we also have the ng-meta-profiler scripts on Zenodo, if we go for b) we will need to update those as well.

Profiler for Soil metagenomes

Hello @luispedro @unode

What's the recommendation when dealing with soil metagenomes since there is no particular predefined profiler for it or any other metagenome that has no predefined profiler for that matter?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.