Git Product home page Git Product logo

ezaai's Introduction

EzAAI: High Throughput Prokaryotic AAI Calculator

Build License Bioconda

EzAAI is a suite of workflows for improved AAI calculation performance along with the novel module that provides hierarchical clustering analysis and dendrogram representation.

Quick start with conda

conda install -c bioconda -y ezaai
ezaai -h

Available modules

extract

  • Extract protein database from genome using Prodigal
ezaai extract -i <IN_SEQ> -o <OUT_DB>
Argument Description
-i Input prokaryotic genome sequence
-o Output database

calculate

  • Calculate AAI value from protein databases using MMseqs2
ezaai calculate -i <INPUT_1> -j <INPUT_2> -o <OUTPUT>
Argument Description
-i First input DB / directory with DBs
-j Second input DB / directory with DBs
-o Output result file

convert

  • Convert CDS FASTA file into MMseqs2 database
ezaai convert -i <IN_CDS> -s <SEQ_TYPE> -o <OUT_DB>
Argument Description
-i Input CDS profile (FASTA format)
-s Sequence type of input file (nucl/prot)
-o Output database

cluster

  • Hierarchical clustering of taxa with AAI values
ezaai cluster -i <AAI_TABLE> -o <OUTPUT>
Argument Description
-i Input EzAAI result file containing all-by-all pairwise AAI values
-o Output result file

ezaai's People

Contributors

aaronmussig avatar endixk avatar lebsnu avatar psi103706 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

ezaai's Issues

Error running EzAAI

Hi EzAAI team,
I am trying to run EzAAI but I keep getting this error.
[MAY 03 19:29:36] EzAAI |: EzAAI - v1.0 [Mar. 2021]
getEngineName() called
java.io.FileNotFoundException: mm.label (No such file or directory)
at java.base/java.io.FileInputStream.open0(Native Method)
at java.base/java.io.FileInputStream.open(FileInputStream.java:212)
at java.base/java.io.FileInputStream.(FileInputStream.java:154)
at java.base/java.io.FileInputStream.(FileInputStream.java:109)
at java.base/java.io.FileReader.(FileReader.java:60)
at leb.main.EzAAI.runCalculate(EzAAI.java:284)
at leb.main.EzAAI.run(EzAAI.java:445)
at leb.main.EzAAI.main(EzAAI.java:480)
[MAY 03 19:29:39] ERROR |: Program terminated with error.

I have tried in two different linux systems and I get the same error. Could you help me?

AAI calculation error: tmp file issue?

Hi folks,

I'm trying to use a snakemake workflow to run a number of EzAAI jobs. However, I'm getting an error during extract that makes me a bit worried about whether or not the files will always be right.

For extract, I'm generating a temporary gunzipped file, since I have all references available as .fna.gz. Here's an example command.

gunzip -c /path/to/ref/files/GCA_009909065.1_genomic.fna.gz > GCA_009909065.1.tmp.fna

java -jar EzAAI_latest.jar extract -i GCA_009909065.1.tmp.fna -o GCA_009909065.1.db -l GCA_009909065.1 > GCA_009909065.1.extract.log

rm GCA_009909065.1.tmp.fna

The error I'm (sometimes!) getting is the following:

java.io.FileNotFoundException: /tmp/prodigal.faa (No such file or directory)
        at java.base/java.io.FileInputStream.open0(Native Method)
        at java.base/java.io.FileInputStream.open(FileInputStream.java:219)
        at java.base/java.io.FileInputStream.<init>(FileInputStream.java:157)
        at java.base/java.io.FileReader.<init>(FileReader.java:75)
        at leb.process.ProcCDSPredictionByProdigal.execute(ProcCDSPredictionByProdigal.java:147)
        at leb.main.EzAAI.runExtract(EzAAI.java:224)
        at leb.main.EzAAI.run(EzAAI.java:482)
        at leb.main.EzAAI.main(EzAAI.java:518)

If I'm running more than one EzAAI job on the same node, how would that affect the /tmp/prodigal.faa file for each extraction? Will the later job always fail without overwriting the first /tmp/prodigal.faa file? Running failed jobs a second time usually resolves the issue.

Note, I am running EzAAI in the following conda environment:

conda list -p  /home/ntpierce/miniconda3/7841dd127abab0c21fbc5a5b78f2aefd
# packages in environment at /home/ntpierce/miniconda3/7841dd127abab0c21fbc5a5b78f2aefd:
#
# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                 conda_forge    conda-forge
_openmp_mutex             4.5                       1_gnu    conda-forge
bzip2                     1.0.8                h7f98852_4    conda-forge
ca-certificates           2021.10.8            ha878542_0    conda-forge
gawk                      5.1.0                h7f98852_0    conda-forge
gettext                   0.19.8.1          h73d1719_1008    conda-forge
libffi                    3.4.2                h7f98852_5    conda-forge
libgcc-ng                 11.2.0              h1d223b6_12    conda-forge
libgomp                   11.2.0              h1d223b6_12    conda-forge
libidn2                   2.3.2                h7f98852_0    conda-forge
libstdcxx-ng              11.2.0              he4da1e4_12    conda-forge
libunistring              0.9.10               h7f98852_0    conda-forge
libzlib                   1.2.11            h36c2ea0_1013    conda-forge
mmseqs2                   13.45111             h95f258a_1    bioconda
openssl                   3.0.0                h7f98852_2    conda-forge
prodigal                  2.6.3                h779adbc_3    bioconda
wget                      1.20.3               ha35d2d1_1    conda-forge
zlib                      1.2.11            h36c2ea0_1013    conda-forge

Calculating AAI fails with PGAP annotated genomes

I was trying to use EzAai with genomes that had already annotated using pgap (extracting the database from an existing ".faa" file).

Creating the database with convert from the .faa file runs without errors.

The "calculate" run fails with the following error.

JUL 12 15:30:41] EzAAI |: Calculating AAI... [Task 1/1] [JUL 12 15:30:41] EzAAI |: Preparing to run reciprocal MMSeqs2 search... [JUL 12 15:30:41] EzAAI |: Running MMSeqs2 search... (/tmp/5099f44642a78a35_faa/i0.faa vs. /tmp/5099f44642a78a35_faa/j0.faa) [JUL 12 15:30:45] EzAAI |: Running MMSeqs2 search... (/tmp/5099f44642a78a35_faa/j0.faa vs. /tmp/5099f44642a78a35_faa/i0.faa) java.lang.NullPointerException at leb.process.ProcCalcPairwiseAAI.calcIdentityWithDetails(ProcCalcPairwiseAAI.java:363) at leb.process.ProcCalcPairwiseAAI.pairwiseMmseqs(ProcCalcPairwiseAAI.java:568) at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:243) at leb.main.EzAAI.runCalculate(EzAAI.java:313) at leb.main.EzAAI.run(EzAAI.java:445) at leb.main.EzAAI.main(EzAAI.java:480)

Taking the genomes and using "extract" to run prodigal gene annotation to produce the .db file works as expected.

Maybe there is some characters in the pgap fasta headers that EzAai doesn't like?

Thanks,
Andrew

Coverage and Identity cut-off explanation

Hello everyone,

I've changed the identity percentage cutoff (-id) to 0, 0.2, 0.3, 0.4 while keeping the coverage fixed at 0.5, but the number of matched counts and AAI remains the same. Is it possible for the number of Matched counts to always stay the same?

I also tried using 0 as coverage and varying the identity percentage (0.2, 0.4), but also in this case, the value of matched counts remains identical.

In detail, for my experiments, I would like to see what difference there is between 0.2 and 0.4 as identity cutoffs, and I would expect at least a slight variation in the AAI.

Is there any explanation for this?

Thank you so much.

Laura

list of database profiles A agains profiles B

Dear EzAAI team,

I am wondering whehter it is possible to provide an options when computing many pairs, like in fastANI, a list of query database, and a list of reference database then compare each of the query with each of the reference. This can be extremely useful for large number of pairs and also testing benchmarking speed. Same suggestions for orthoANIu (usearch is open source!) but no idea where to request the features.

Thanks,
Jianshu

AAI calculation error

I am trying to calculate AAI using EZAAI. I have already created the DB file from genome but when executing command for AAI calculation, programme is terminated in between. Below is the screenshot for same.
$ java -jar EzAAI_latest.jar calculate -i db/ -j db/ -o out/SAOS_164_21_8_21.tsv
[AUG 21 14:09:44] EzAAI |: EzAAI - v1.0 [Mar. 2021]
[AUG 21 14:09:47] EzAAI |: Calculating AAI... [Task 1/1600]
[AUG 21 14:09:59] EzAAI |: Calculating AAI... [Task 2/1600]
[AUG 21 14:10:09] EzAAI |: Calculating AAI... [Task 3/1600]
[AUG 21 14:10:21] EzAAI |: Calculating AAI... [Task 4/1600]
[AUG 21 14:10:32] EzAAI |: Calculating AAI... [Task 5/1600]
[AUG 21 14:10:44] EzAAI |: Calculating AAI... [Task 6/1600]
[AUG 21 14:10:55] EzAAI |: Calculating AAI... [Task 7/1600]
[AUG 21 14:11:05] EzAAI |: Calculating AAI... [Task 8/1600]
[AUG 21 14:11:16] EzAAI |: Calculating AAI... [Task 9/1600]
java.lang.NullPointerException
at leb.process.ProcCalcPairwiseAAI.calcIdentityWithDetails(ProcCalcPairwiseAAI.java:364)
at leb.process.ProcCalcPairwiseAAI.pairwiseMmseqs(ProcCalcPairwiseAAI.java:568)
at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:243)
at leb.main.EzAAI.runCalculate(EzAAI.java:313)
at leb.main.EzAAI.run(EzAAI.java:445)
at leb.main.EzAAI.main(EzAAI.java:480)
[AUG 21 14:11:26] ERROR |: Program terminated with error.

calculation issue java.io.IOException: Cannot delete file: /tmp/ezaai/78e2b9bcc9dd8cf6_tmp/8191412700641163904

Hi developers, I ran the program but got this error.
[MAR 15 12:15:10] EzAAI |: EzAAI - v1.2.3 [Feb. 2024]
[MAR 15 12:15:20] EzAAI |: Calculating AAI... [Task 1/484]
org.apache.commons.io.IOExceptionList: /tmp/ezaai/78e2b9bcc9dd8cf6_tmp
at org.apache.commons.io.FileUtils.cleanDirectory(FileUtils.java:331)
at org.apache.commons.io.FileUtils.deleteDirectory(FileUtils.java:1192)
at leb.process.ProcCalcPairwiseAAI.pairwiseMmseqs(ProcCalcPairwiseAAI.java:399)
at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:67)
at leb.main.EzAAI.runCalculate(EzAAI.java:511)
at leb.main.EzAAI.run(EzAAI.java:689)
at leb.main.EzAAI.main(EzAAI.java:725)
Caused by: java.io.IOException: Cannot delete file: /tmp/ezaai/78e2b9bcc9dd8cf6_tmp/8191412700641163904
at org.apache.commons.io.FileUtils.forceDelete(FileUtils.java:1344)
at org.apache.commons.io.FileUtils.cleanDirectory(FileUtils.java:324)
... 6 more
Caused by: java.nio.file.AccessDeniedException: /tmp/ezaai/78e2b9bcc9dd8cf6_tmp/8191412700641163904/pref_0.dbtype
at java.base/sun.nio.fs.UnixException.translateToIOException(UnixException.java:90)
at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:106)
at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111)
at java.base/sun.nio.fs.UnixFileSystemProvider.implDelete(UnixFileSystemProvider.java:291)
at java.base/sun.nio.fs.AbstractFileSystemProvider.deleteIfExists(AbstractFileSystemProvider.java:109)
at java.base/java.nio.file.Files.deleteIfExists(Files.java:1191)
at org.apache.commons.io.file.DeletingPathVisitor.visitFile(DeletingPathVisitor.java:161)
at org.apache.commons.io.file.DeletingPathVisitor.visitFile(DeletingPathVisitor.java:37)
at java.base/java.nio.file.Files.walkFileTree(Files.java:2786)
at java.base/java.nio.file.Files.walkFileTree(Files.java:2857)
at org.apache.commons.io.file.PathUtils.visitFileTree(PathUtils.java:971)
at org.apache.commons.io.file.PathUtils.deleteDirectory(PathUtils.java:434)
at org.apache.commons.io.file.PathUtils.delete(PathUtils.java:391)
at org.apache.commons.io.FileUtils.forceDelete(FileUtils.java:1341)
... 7 more
[MAR 15 12:15:24] ERROR |: Program terminated with error.

Any suggestions?

ezaai calculate issue: temp directories being created read only, java errors out when temp files can't be deleted

I've been trying to run this software on a test dataset. I can convert my protein sequence files to ezaai databases, no trouble. But when I try to actually calculate the AAI :
java -jar ~/Downloads/EzAAI.jar calculate -t 64 -i EzAAI_dbs/CP040925.1_NCHU2014.db -j EzAAI_dbs/CP097207.1_PR08.db -o EzAAI_test_output

I get:
[MAR 07 14:42:18] EzAAI |: EzAAI - v1.2.3 [Feb. 2024] [MAR 07 14:42:18] EzAAI |: Calculating AAI... [Task 1/1] org.apache.commons.io.IOExceptionList: /tmp/ezaai/b72f86d85f7e7f28_tmp at org.apache.commons.io.FileUtils.cleanDirectory(FileUtils.java:331) at org.apache.commons.io.FileUtils.deleteDirectory(FileUtils.java:1192) at leb.process.ProcCalcPairwiseAAI.pairwiseMmseqs(ProcCalcPairwiseAAI.java:399) at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:67) at leb.main.EzAAI.runCalculate(EzAAI.java:511) at leb.main.EzAAI.run(EzAAI.java:689) at leb.main.EzAAI.main(EzAAI.java:725) Caused by: java.io.IOException: Cannot delete file: /tmp/ezaai/b72f86d85f7e7f28_tmp/9614918276263326714 at org.apache.commons.io.FileUtils.forceDelete(FileUtils.java:1344) at org.apache.commons.io.FileUtils.cleanDirectory(FileUtils.java:324) ... 6 more Caused by: java.nio.file.AccessDeniedException: /tmp/ezaai/b72f86d85f7e7f28_tmp/9614918276263326714/pref_0.dbtype at java.base/sun.nio.fs.UnixException.translateToIOException(UnixException.java:90) at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111) at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:116) at java.base/sun.nio.fs.UnixFileSystemProvider.implDelete(UnixFileSystemProvider.java:249) at java.base/sun.nio.fs.AbstractFileSystemProvider.deleteIfExists(AbstractFileSystemProvider.java:110) at java.base/java.nio.file.Files.deleteIfExists(Files.java:1181) at org.apache.commons.io.file.DeletingPathVisitor.visitFile(DeletingPathVisitor.java:161) at org.apache.commons.io.file.DeletingPathVisitor.visitFile(DeletingPathVisitor.java:37) at java.base/java.nio.file.Files.walkFileTree(Files.java:2725) at java.base/java.nio.file.Files.walkFileTree(Files.java:2797) at org.apache.commons.io.file.PathUtils.visitFileTree(PathUtils.java:971) at org.apache.commons.io.file.PathUtils.deleteDirectory(PathUtils.java:434) at org.apache.commons.io.file.PathUtils.delete(PathUtils.java:391) at org.apache.commons.io.FileUtils.forceDelete(FileUtils.java:1341)

Checking the /tmp directory, the directory created (/tmp/ezaai/b72f86d85f7e7f28_tmp/9614918276263326714 in this case) has permissions:
dr-xr-xr-x. 2 myusername myusername 260 Mar 7 14:42 9614918276263326714

So readable and executable, but not writeable, which extends to the files it contains. The folder containing this folder has write permissions just fine, as does the other folder active when the software errors out (the _faa folder). Any ideas?

OS is RockyLinux 9.3

Sequence matched CDS

Hi all,
I was wondering how I could obtain the sequence of the matched CDS between two genomes.
Thank you in advance,
Arancha

Is there any way to name db with a variable?

Hello! Firstly, thanks for creating and maintaining this great tool :) I've been using it on a large collection of genomes, and I've run into a small issue. Basically, I am looping over a list of directories, and I'd like to name the output db with the variable name for later comparison. While I can easily go back and rename them later, I was wondering if there was a way to do this in the script more easily.

Here is a portion of my script:

cd $i
fna=*.fna
mkdir EZAAIdb
echo "finding_prot" $fna
ezaai extract -i $fna -o EZAAIdb/$fna.db

While everything else works well, the final name of the db for all my files is "*.fna.db" - do you have any tips for fixing this? Thanks!

Ezaai calculate issue

Hello developers,

I'm trying to test the program for AAI calculation and I stumbled upon an issue when I tried to execute the calculate module. The program was installed with Conda and I can create a database file fine. I ran the following command:
ezaai calculate -i db/ -j db/ -o aai.tsv

I got this:
[APR 08 06:16:16] EzAAI |: EzAAI - v1.2.3 [Feb. 2024]
java.io.FileNotFoundException: mm.label (No such file or directory)
at java.io.FileInputStream.open0(Native Method)
at java.io.FileInputStream.open(FileInputStream.java:195)
at java.io.FileInputStream.(FileInputStream.java:138)
at java.io.FileInputStream.(FileInputStream.java:93)
at java.io.FileReader.(FileReader.java:58)
at leb.main.EzAAI.runCalculate(EzAAI.java:456)
at leb.main.EzAAI.run(EzAAI.java:689)
at leb.main.EzAAI.main(EzAAI.java:725)

I also tried giving the -tmp argument to the command and the same error appeared.
Any ideas on how to solve this?

/Aleks

error for automatic detection of .db files from the directory

Hallo,
I want to calculate AAI values and use as input just the directory with the profile DBs (db/) as shown in the tutorial

java -jar ~/bin/EzAAI.jar  calculate -i db/ -j db/ -o aai_out/test.tsv

However, I get the following error:

EzAAI - v1.2 [Mar. 2022]
java.io.FileNotFoundException: mm.label (No such file or directory)
at java.io.FileInputStream.open0(Native Method)
at java.io.FileInputStream.open(FileInputStream.java:195)
at java.io.FileInputStream.(FileInputStream.java:138)
at java.io.FileInputStream.(FileInputStream.java:93)
at java.io.FileReader.(FileReader.java:58)
at leb.main.EzAAI.runCalculate(EzAAI.java:307)
at leb.main.EzAAI.run(EzAAI.java:482)
at leb.main.EzAAI.main(EzAAI.java:517)

I also tried it with "db/*.db" but than just the selfcomparison of the first db file is calculated.
MMseqs2 Version: 13.45111

Thanks

Error creating database

Hello,

I am unable to see a .db file created using either extract or convert. I set up the same sample folder as the tutorial, with 10 of my genomes in the fasta folder.

java -jar EzAAI_latest.jar extract -i fasta/MAG_48_Clean_2N.fa -o db/MAG_48_Clean_2N.db -l "Methanolobus sp. SBSPR1"
EzAAI |: EzAAI - v1.0 [Mar. 2021]
EzAAI |: Running prodigal on genome fasta/MAG_48_Clean_2N.fa...
getEngineName() called
getEngineName() called
EzAAI |: Converting given CDS file into profile database... (fasta/MAG_48_Clean_2N.fa.faa -> db/MAG_48_Clean_2N.db)
EzAAI |: Task finished.

It all looks like it worked well. But then when I check the db/ folder, there is no MAG_48_Clean_2N.db file. Same thing happened when I ran prodigal manually outside of EzAAI to make a protein CDS file and tried to convert it to the .db

It seems like the issue is the getEngineName() called, as I do not see that in the tutorial.

If it helps here is my java info.
java version "15.0.1" 2020-10-20
Java(TM) SE Runtime Environment (build 15.0.1+9-18)
Java HotSpot(TM) 64-Bit Server VM (build 15.0.1+9-18, mixed mode, sharing)

Please let me know if you have any suggestions. Thanks!

`calculate` error when using `-p blastp`

Thanks for your help with the previous issue - I'm back with a different one!

I'd like to use blastp for some AAI calculations. I used conda to install blastp in the same environment I was previously using for EzAAI calculations (info below). I'm running a single instance at a time.

As I understand it, I use the same extract command to prepare databases, regardless of mapping type - is that incorrect?

Using the sample data from your tutorial:

java -jar EzAAI_v1.11.jar calculate -i db/Ci.db -j db/Lc.db \
      -t 10 -p blastp -o test.blastp.tsv -v

I get the following error:

[FEB 24 08:51:15] EzAAI   |:  EzAAI - v1.11 [Feb. 2022]
getEngineName() called
[FEB 24 08:51:16] EzAAI   |:  Calculating AAI... [Task 1/1]
[FEB 24 08:51:16] EzAAI   |:  Preparing to run reciprocal BLASTp+...
[FEB 24 08:51:16] EzAAI   |:  Running BLASTp+... (/tmp/bdd397448b571442_faa/i0.faa vs. /tmp/bdd397448b571442_faa/j0.faa)
java.io.FileNotFoundException: /tmp/bdd397448b571442_vice.out (No such file or directory)
        at java.base/java.io.FileInputStream.open0(Native Method)
        at java.base/java.io.FileInputStream.open(FileInputStream.java:219)
        at java.base/java.io.FileInputStream.<init>(FileInputStream.java:157)
        at java.base/java.io.FileReader.<init>(FileReader.java:75)
        at leb.process.ProcFuncAnnoByBlastPlus.parseOutFile(ProcFuncAnnoByBlastPlus.java:283)
        at leb.process.ProcFuncAnnoByBlastPlus.execute(ProcFuncAnnoByBlastPlus.java:273)
        at leb.process.ProcCalcPairwiseAAI.pairwiseBlastp(ProcCalcPairwiseAAI.java:499)
        at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:248)
        at leb.main.EzAAI.runCalculate(EzAAI.java:361)
        at leb.main.EzAAI.run(EzAAI.java:493)
        at leb.main.EzAAI.main(EzAAI.java:528)
[FEB 24 08:51:16] ERROR   |:  Program terminated with error.

Please let me know if you have ideas on how to proceed. Thanks!

Conda environment:

conda list -p /home/ntpierce/miniconda3/066c58eb7ac25d2ab6a302135cb3ad40
# packages in environment at /home/ntpierce/miniconda3/066c58eb7ac25d2ab6a302135cb3ad40:
#
# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                 conda_forge    conda-forge
_openmp_mutex             4.5                       1_gnu    conda-forge
blast                     2.12.0               h3289130_3    bioconda
bzip2                     1.0.8                h7f98852_4    conda-forge
c-ares                    1.18.1               h7f98852_0    conda-forge
ca-certificates           2021.10.8            ha878542_0    conda-forge
curl                      7.81.0               h494985f_0    conda-forge
entrez-direct             16.2                 he881be0_0    bioconda
gawk                      5.1.0                h7f98852_0    conda-forge
gettext                   0.19.8.1          h73d1719_1008    conda-forge
krb5                      1.19.2               h48eae69_3    conda-forge
libcurl                   7.81.0               h494985f_0    conda-forge
libedit                   3.1.20191231         he28a2e2_2    conda-forge
libev                     4.33                 h516909a_1    conda-forge
libffi                    3.4.2                h7f98852_5    conda-forge
libgcc-ng                 11.2.0              h1d223b6_12    conda-forge
libgomp                   11.2.0              h1d223b6_12    conda-forge
libidn2                   2.3.2                h7f98852_0    conda-forge
libnghttp2                1.46.0               ha19adfc_0    conda-forge
libssh2                   1.10.0               ha35d2d1_2    conda-forge
libstdcxx-ng              11.2.0              he4da1e4_12    conda-forge
libunistring              0.9.10               h7f98852_0    conda-forge
libzlib                   1.2.11            h36c2ea0_1013    conda-forge
mmseqs2                   13.45111             h95f258a_1    bioconda
ncurses                   6.3                  h9c3ff4c_0    conda-forge
openssl                   3.0.0                h7f98852_2    conda-forge
pcre                      8.45                 h9c3ff4c_0    conda-forge
perl                      5.32.1          1_h7f98852_perl5    conda-forge
perl-archive-tar          2.40            pl5321hdfd78af_0    bioconda
perl-carp                 1.50            pl5321hd8ed1ab_0    conda-forge
perl-common-sense         3.75            pl5321hdfd78af_0    bioconda
perl-compress-raw-bzip2   2.101           pl5321h1b792b2_0    bioconda
perl-compress-raw-zlib    2.101           pl5321h1b792b2_1    bioconda
perl-encode               3.16            pl5321h779adbc_0    bioconda
perl-exporter             5.74            pl5321hd8ed1ab_0    conda-forge
perl-exporter-tiny        1.002002        pl5321hdfd78af_0    bioconda
perl-extutils-makemaker   7.64            pl5321hd8ed1ab_0    conda-forge
perl-io-compress          2.102           pl5321h1b792b2_0    bioconda
perl-io-zlib              1.11            pl5321hdfd78af_0    bioconda
perl-json                 4.05            pl5321hdfd78af_0    bioconda
perl-json-xs              2.34            pl5321h7d875b9_4    bioconda
perl-list-moreutils       0.430           pl5321hdfd78af_0    bioconda
perl-list-moreutils-xs    0.430           pl5321h779adbc_0    bioconda
perl-parent               0.238           pl5321hd8ed1ab_0    conda-forge
perl-pathtools            3.75            pl5321h779adbc_2    bioconda
perl-scalar-list-utils    1.60            pl5321h779adbc_0    bioconda
perl-types-serialiser     1.01            pl5321hdfd78af_0    bioconda
prodigal                  2.6.3                h779adbc_3    bioconda
tk                        8.6.11               h27826a3_1    conda-forge
wget                      1.20.3               ha35d2d1_1    conda-forge
zlib                      1.2.11            h36c2ea0_1013    conda-forge

How about eukaryotic proteins

Dear developer,

I'm curious if ezaai is also suitable for AAI calculations between eukaryotic proteins, please note that proteins and not genomes are entered here.

Looking forward your reply. Thanks a lot.

ERROR: invalid input file given

Hi,
I installed Ezaai using conda. However, there is an error when running the command as shown below:

Command: ezaai extract -i fasta/Ci.fasta -o db/seq_db -dev
[MAR 13 17:12:16] EzAAI |: EzAAI - v1.2.3 [Feb. 2024]
[MAR 13 17:12:16] EzAAI |: Checking dependencies...
[MAR 13 17:12:16] DEV |: exec: prodigal -h
[MAR 13 17:12:17] DEV |: exec: mmseqs -h
[MAR 13 17:12:17] DEV |: EzAAI - extract module
[MAR 13 17:12:17] EzAAI |: Running prodigal on genome fasta/Ci.fasta...
[MAR 13 17:12:17] DEV |: exec: prodigal -i fasta/Ci.fasta -f gff -o /tmp/ezaai/aefac6c266baa84a.gff -a /sample/fasta/Ci.fasta.faa -d /tmp/ezaai/aefac6c266baa84a.ffn -q -p meta -g 11
[MAR 13 17:12:17] EzAAI |: Creating a submodule for converting .faa into .db...
[MAR 13 17:12:17] ERROR |: Invalid input file given.
[MAR 13 17:12:17] ERROR |: Program terminated with error.

How to solve this issue?
Thanks.

calculation issue

Hello,
I tried using EzAAI for a calculation. The first time, it worked without any issues, but I encountered an error on my second attempt. Can you explain what the 'mm.label' file is? Here's what my terminal showed:
This is my code: java -jar EzAAI.jar calculate -i db/ -j mmseqs/ -o out/aai.tsv

java.io.FileNotFoundException: mm.label (No such file or directory)
at java.base/java.io.FileInputStream.open0(Native Method)
at java.base/java.io.FileInputStream.open(FileInputStream.java:213)
at java.base/java.io.FileInputStream.(FileInputStream.java:152)
at java.base/java.io.FileInputStream.(FileInputStream.java:106)
at java.base/java.io.FileReader.(FileReader.java:60)
at leb.main.EzAAI.runCalculate(EzAAI.java:456)
at leb.main.EzAAI.run(EzAAI.java:689)
at leb.main.EzAAI.main(EzAAI.java:725)
[MAR 03 01:26:15] ERROR |: Program terminated with error.

I would appreciate your assistance in resolving this issue.
Thanks.

AAI calculation error

Hello. I am totally appreciative with all the efforts poured in creating this AAI tool. Well done.

I have just some issue with the second step, the calculation of AAI. I have successfully extracted the DB from my genome data files. However, whenever I run the command for the AAI calculation, I got this error prompt:
MAR 07 16:12:03] EzAAI |: EzAAI - v1.2 [Mar. 2022]
[MAR 07 16:12:42] EzAAI |: Calculating AAI... [Task 1/5776]
java.io.FileNotFoundException: /tmp/31f9c2de403cfb4b_faa/i0.faa (No such file or directory)
at java.io.FileInputStream.open0(Native Method)
at java.io.FileInputStream.open(FileInputStream.java:195)
at java.io.FileInputStream.(FileInputStream.java:138)
at java.io.FileInputStream.(FileInputStream.java:93)
at java.io.FileReader.(FileReader.java:58)
at leb.process.ProcCalcPairwiseAAI.pairwiseMmseqs(ProcCalcPairwiseAAI.java:308)
at leb.process.ProcCalcPairwiseAAI.calculateProteomePairWithDetails(ProcCalcPairwiseAAI.java:58)
at leb.main.EzAAI.runCalculate(EzAAI.java:350)
at leb.main.EzAAI.run(EzAAI.java:482)
at leb.main.EzAAI.main(EzAAI.java:517)
[MAR 07 16:12:42] ERROR |: Program terminated with error.


I just don't know if it was just my fault, because I am totally new to this "coding" world.
I hope you could attend to this query despite of your hectic schedules.

Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.