Git Product home page Git Product logo

Comments (5)

amarinderthind avatar amarinderthind commented on July 3, 2024

Hi, it should be path of the directory....

if

fasta path DB/HUMAN_RNA/rRNA.fasta
idx path DB/HUMAN_RNA/rRNA.idx

You should mention this in the following way (for configuration file)

RIBO_DB=DB/HUMAN_RNA
RIBO_NAME=rRNA

Let me know, if it doesn't solve your issue.

from decontaminer.

loubean4073 avatar loubean4073 commented on July 3, 2024

Thank you for your reply! Actually, I strictly follow the User Guide.pdf for setting up the config file.
Here is my running log in terminal showed below, and I notice that except the error I mentioned yesterday, the last sentence shows that:

Wrong filepath or input file not existent: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_UNALIGNED.fasta.

And I check the RIBO folder, It's really empty there. It means that there is anything to be written in this folder.
Looking forward to your reply.


bash ./decontaMiner.sh -i /home/kc/workspace/fasta/data_test/website_examples -o /home/kc/workspace/output -c /home/kc/decontaMiner_1.4/config_files/configure.txt

-- WELCOME TO THE DECONTAMINER PIPELINE --

INFO: input file format (-f) not specified. BAM assumed
INFO: paired/single end flag -s not specified. PAIRED END assumed (default option).
INFO: quality filtering flag -Q not specified. Quality filtering is ACTIVE (default option).
INFO: ribosomal filtering flag -R not specified. Human ribosomal filtering is ACTIVE (default option).

  • PROCESSING DETAILS
    INFO: processing input in BAM format
    INFO: processing PAIRED END data.
    INFO: quality encoding (-e ) not specified. Default: 33
    INFO: percentage threshold (-p) not specified. Default: 100
    INFO: quality value threshold (-q) not specified. Default: 20
    INFO: no option was specified for contaminating organisms. Assuming all (BACTERIA/FUNGI/VIRUSES, -bfv)

  • LOADING AND CHECKING THE CONFIGURATION FILE :
    INFO: configuration file is: /home/kc/decontaMiner_1.4/config_files/configure.txt

EXTERNAL SOFTWARE
INFO: samtools executable is located at /usr/bin/samtools
INFO: fastx executable is located at /usr/bin/fastq_quality_filter
INFO: sortmerna executable is located at /home/kc/miniconda3/bin/sortmerna
INFO: blastn executable is located at /home/kc/ncbi-blast-2.12.0+/bin/blastn

HUMAN RIBOSOMAL/MITOCHONDRIAL DNA
INFO: the ribosomal database rRNA (fasta + indexes) is located at /home/kc/workspace/fasta/Human_rna

CONTAMINATING ORGANISMS DATABASES
INFO: the bacteria database named Bacteria is located at /home/kc/workspace/fasta/BACTERIA_db
INFO: the fungi database named Fungi is located at /home/kc/workspace/fasta/FUNGI_db
INFO: the viruses database named Virus is located at /home/kc/workspace/fasta/VIRUSES_db

  • PROCESSING STARTED
    INFO: executable files will be stored in the temporary directory: /home/kc/decontaMiner_1.4/shell_scripts/TEMP
    INFO: results will be stored in the output directory: /home/kc/workspace/output/RESULTS
    INFO: intermediate files results will be stored in the output directory: /home/kc/workspace/output/INTERMEDIATE_FILES
    INFO: fasta files will be stored in the output directory: /home/kc/workspace/output/INTERMEDIATE_FILES/FASTA_FILES
    INFO: fastq files will be stored in the output directory: /home/kc/workspace/output/INTERMEDIATE_FILES/FASTQ_FILES
    INFO: fastq files will be stored in the output directory: /home/kc/workspace/output/INTERMEDIATE_FILES/FASTQ_FILES
    INFO: sorting the bam file. Please wait, this might take some time...
    done. Output file written in dir /home/kc/workspace/output/INTERMEDIATE_FILES/SORTED/unmapped_1.qsort
    INFO: converting to fastq ...
    [M::bam2fq_mainloop] discarded 0 singletons
    [M::bam2fq_mainloop] processed 1896735 reads
    done. Output file written in dir /home/kc/workspace/output/INTERMEDIATE_FILES/FASTQ_FILES/unmapped_1.fastq
    Skipped reads (if present) written in dir /home/kc/workspace/output/INTERMEDIATE_FILES/SORTED/unmapped_1_SKIPPED.fastq
    INFO: processing fastaQ /home/kc/workspace/output/INTERMEDIATE_FILES/FASTQ_FILES/unmapped_1.fastq
    INFO: quality filtering the fastq files with nucleotide quality: 20, percentage: 100, encoding: 33 .
    done. Output files written in dir /home/kc/workspace/output/INTERMEDIATE_FILES/FASTQ_FILES/unmapped_1_Q.fastq
    INFO: converting to fasta format...
    ...done. Output file written in dir
    INFO: extracting ribosomal RNA...
    [process:1376] === Options processing starts ... ===
    Found value: /home/kc/miniconda3/bin/sortmerna
    Found flag: --ref
    Found value: /home/kc/workspace/fasta/Human_rna/rRNA.fasta,/home/kc/workspace/fasta/Human_rna/rRNA.idx of previous flag: --ref
    Found flag: --reads
    Found value: /home/kc/workspace/output/INTERMEDIATE_FILES/FASTA_FILES/unmapped_1.fasta of previous flag: --reads
    Found flag: --blast
    Found value: 1 cigar qcov of previous flag: --blast
    Found flag: --aligned
    Found value: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_ALIGNED of previous flag: --aligned
    Found flag: --fastx
    Previous flag: --fastx is Boolean. Setting to True
    Found flag: --other
    Found value: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_UNALIGNED of previous flag: --other
    Found value: fasta
    Found flag: --paired_out
    Previous flag: --paired_out is Boolean. Setting to True
    Found flag: -e
    Found value: 0.0000000000000000000000000000001 of previous flag: -e
    [process:1460] Processing option: aligned with value: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_ALIGNED
    [process:1460] Processing option: blast with value: 1 cigar qcov
    [process:1460] Processing option: e with value: 0.0000000000000000000000000000001
    [process:1460] Processing option: fastx with value:
    [process:1460] Processing option: other with value: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_UNALIGNED
    [process:1460] Processing option: paired_out with value:
    [process:1460] Processing option: reads with value: /home/kc/workspace/output/INTERMEDIATE_FILES/FASTA_FILES/unmapped_1.fasta
    [opt_reads:97] Processing reads file [1] out of total [1] files
    [process:1460] Processing option: ref with value: /home/kc/workspace/fasta/Human_rna/rRNA.fasta,/home/kc/workspace/fasta/Human_rna/rRNA.idx
    [opt_ref:157] Processing reference [1] out of total [1] references

[opt_ref:194] ERROR: The file /home/kc/workspace/fasta/Human_rna/rRNA.fasta,/home/kc/workspace/fasta/Human_rna/rRNA.idx is not an existing/valid absolute or relative path
Reference file (FASTA) absolute or relative path.

   Use mutliple times, once per a reference file

done. Output written in /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_UNALIGNED
Wrong filepath or input file not existent: /home/kc/workspace/output/INTERMEDIATE_FILES/RIBO/unmapped_1_RIBO_UNALIGNED.fasta

from decontaminer.

amarinderthind avatar amarinderthind commented on July 3, 2024

Are you using 'test_data.tar.gz' as an input? Here I am sharing the configuration file that I have tested recently ..

; Decontaminer configuration file. 

; Section 1: ext_soft.  Paths to the executables of external softwares.

[ext_soft]

; SAMTOOLS INSTALLED EXECUTABLE PATH (for samtool sort, bam2fq)
SAMTOOLS_EXEC= /home/software/samtools-1.3.1/samtools

; FASTX fastq_quality_filter EXECUTABLE PATH (for fastq_quality_filter)
FASTX_EXEC=  /scratch/hm82/CSCC/longRNA/unmapped/decontaMiner_1.4/fastx/fastq_quality_filter

; BLASTN INSTALLED EXECUTABLE PATH(for megablast)
BLASTN_EXEC= /scratch/hm82/CSCC/longRNA/unmapped/decontaMiner_1.4/ncbi-blast-2.2.29+/bin/blastn

; SORTMERNA INSTALLED EXECUTABLE PATH 
SORTMERNA_EXEC = /scratch/hm82/CSCC/longRNA/unmapped/decontaMiner_1.4/sortmerna-2.1b/sortmerna

; Section 2: cont_db.  Paths to the databases of contaminating sequences

[cont_db]

; HUMAN RIBOSOMAL SEQUENCES, SORTEMERNA FORMAT
; PATH OF THE DIRECTORY CONTAINING THE FASTA FILE (.fasta) AND THE INDEXES (.idx*)
RIBO_DB=/scratch/unmapped/decontaMiner_1.4/DB/HUMAN_RNA
; NAME OF THE DB ([name].fasta AND INDEXES  ([name].idx*)
RIBO_NAME=rRNA

; DATABASE OF BACTERIA, BLAST FORMAT
; PATH OF THE DIRECTORY CONTAINING THE COMPRESSED FILES
BACTERIA_DB=/scratch/unmapped/decontaMiner_1.4/DB/BACTERIA
BACTERIA_NAME=Bacteria_newblast

; DATABASE OF FUNGI, BLAST FORMAT
; PATH OF THE DIRECTORY CONTAINING THE COMPRESSED FILES
FUNGI_DB=/scratch/unmapped/decontaMiner_1.4/DB/FUNGI

FUNGI_NAME=Fungi_new

; DATABASE OF VIRUSES, BLAST FORMAT
; PATH OF THE DIRECTORY CONTAINING THE COMPRESSED FILES
VIRUSES_DB=/scratch/unmapped/decontaMiner_1.4/DB/VIRUSES

VIRUSES_NAME=Virus_new

from decontaminer.

loubean4073 avatar loubean4073 commented on July 3, 2024

Hi, thank you for providing your configure.txt for reference. According to your configure.txt, I found that the version of the software tools I use is different from yours. So I reinstalled all the software tools to your version. It works now.
Thanks again!

from decontaminer.

amarinderthind avatar amarinderthind commented on July 3, 2024

That's great.

from decontaminer.

Related Issues (7)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.