Git Product home page Git Product logo

eutyper's Introduction

euTyper

Work In Progress

Chewie steps into the Eukarya Domain

euTyper is a tool to create gene-by-gene typing schemas and perform allele calling for eukaryotes.

Dependencies

Python >= 3.7
BioPython
Augustus
BLAST Parallel

Usage

-h, --help      show this help message and exit
-i INPUT_FILES  Path to a FASTA file with contig sequences for a single
                genome.
-o OUTPUT_PATH  Path to the directory that will store output files created
                by the process.
-s sample_metadata path to samplemetadata (TSV file containing fastafile name, species, codon number.
--bsr BSR       BLAST Score Ratio value used as threshold.
--t THREADS     Number of threads used by BLASTp to align translated genes.
--p probabbility probability change of a gene found by Augsustus, used as an cutoff


exstra functionality for flexibility and not needing to rerun euTyper witht the smalles changes

add loci

--add_loci path path to fasta file containing the CDS of each gene in it. This parameters enables to directly add genes to a folder containing all genes
--codon Required codon table number to calculate the AA sequence of the CDS
--force Forces the genes from the add_loci parameter to be directly added to the clustering dir (genes will have a forced prefix)
-o OUTPUT_PATH  Path to the directory that will store output files created
                by the process.

add genomes

--add_gene path to directory containing the fastafiles of genomes to be added to processed genomes
-s sample_metadata path to samplemetadata (TSV file containing fastafile name, species, codon number.
-o OUTPUT_PATH  Path to the directory that will store output files created
                by the process.
-recluster If recluster = True, after adding the gene, all known genes will be directly evaluated by Clustering and blasting to retrieve the 
--bsr BSR       BLAST Score Ratio value used as threshold.
--t THREADS     Number of threads used by BLASTp to align translated genes.
--p probabbility probability change of a gene found by Augsustus, used as an cutoff

example commant with test files

normal run

python3 euTyper.py -i /path/to/your/favourite/directory/containing/genomes -o /output/folder/path/of/directory/be/created/and/files/stored -s /path/to/sample_metadata --t 30 --p 0.9

incase euTyper already ran, and some adjustments are required

add loci directly to scheme seed

python3 euTyper.py -add_loci /path/to/your/favourite/directory/containing/genes -o --t 30 --codon 12 --forced True 

If genome to list of genes

python3 euTyper.py -add_gene /path/to/your/favourite/directory/containing/genomes -o /output/folder/path/of/directory/be/created/and/files/stored -s /path/to/sample_metadata --t 30 --p 0.7 -recluster True

generating the scheme seed again from already processed genes

python3 euTyper.py --reclyster -o  /output/folder/path/of/directory/be/created/and/files/stored --t 20

adding already processed augustus files

python3 euTyper.py -o /output/folder/path/of/directory/be/created/and/files/stored --t 30 -s /path/to/sample_metadata -i //path/to/your/favourite/directory/containing/genomes --p 0.5 --re_use_augustus /work/directory/of/already/processed/euTyper

eutyper's People

Contributors

dirowa avatar rfm-targa avatar

Watchers

James Cloos avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.