Git Product home page Git Product logo

cnn-mgp's Introduction

Convolutional Neural Networks for Metagenomics Gene Prediction(CNN-MGP)

CNN-MGP scripts is a metagenomic ORF finding tool for the prediction of protein coding genes in short, environmental DNA sequences with unknown phylogenetic origin. CNN-MGP is based on 10 models trained on 10 pre-defined GC content ranges. The scripts contains two main parts ORF FINDER handling the extraction of all possible ORFs from raw data and the second partis one hot ecodind the orfs afterward feeds them to the right model. CNN-MGP analyses can be performed via the CNN-MGP website Coming soon, or alternatively you can run the script from the commande line. The instructions below discuss use of CNN-MGP at the command line, following a general overview of how CNN-MGP works.

Predicting genes

If metagenomic fragements are submitted, CNN-MGP first extracts all complete and incomplete open reading frames (ORFs) using our integrated orf finder (casting ORFS less than 80 bp), The user customize the desirable parametre for the script and the output is eather the predicted nucliotide CDS, protein sequences or both.

Table of Contents

License

Use or reproduction of these materials, in whole or in part, by any commercial organization whether or not for non-commercial (including research) or commercial purposes is allowed.

Citation

Al-Ajlan, A., El Allali, A. CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction. [CNN-MGP]

Support & Bug Reports

Please log an issue on github issue.

You can email the CARD curators or developers directly at [email protected].

Python version

-Install python 3.7 (64bit) or higher from the officiel website Python 3.7.

Requirements

  • tqdm
  • tensorflow==2.3.0

Install Dependencies

  • pip3 install -r requirements.txt
  • or
  • pip3 install tqdm tensorflow==2.3.0

Help Menu and Usage

The following command will bring up CNN-MGP's main help menu:

CNN-MGP --help
usage: CNN-MGP <command> [<args>]
      commands are:
         ---------------------------------------------------------------------------------------
         -i
         ---------------------------------------------------------------------------------------

         load the input file(fasta)

         ---------------------------------------------------------------------------------------
         -o
         ---------------------------------------------------------------------------------------

        Specify the output file name

         ---------------------------------------------------------------------------------------
         -min
         ---------------------------------------------------------------------------------------

         The minimun orf lenght, Default 80

         ---------------------------------------------------------------------------------------
         -u
         ---------------------------------------------------------------------------------------

         Type 1 for unresolved start codons(ie, ATG, CTG, GTG, TTG) recommanded
         Type 0 for start codon(ATG)

         ---------------------------------------------------------------------------------------
         -st
         ---------------------------------------------------------------------------------------

         Type nucl for the output file to be nucleotide CDS
         Type Prot for the output file to be protein translated gene
         Type Both for two output files

         ---------------------------------------------------------------------------------------

cnn-mgp's People

Contributors

rachidelfermi avatar rachidelfe avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.