Git Product home page Git Product logo

Comments (8)

jma1991 avatar jma1991 commented on July 4, 2024

The error you're encountering likely stems from using a gene identifier that isn't included in your annotation. You've chosen GRCh38 as your genome parameter, which directs the workflow to pull the necessary annotation files from the iGenomes repository. You can verify this by downloading the GTF file linked in the configuration and checking for your gene identifier—you'll find it's missing. In fact, the GRCh38 annotation from iGenomes doesn't include Ensembl identifiers, a known issue across all nf-core workflows that has been discussed before. If you need to use the GRCh38 genome, I recommend providing your own FASTA and GTF files for the annotation.

from rnasplice.

BulutHamali avatar BulutHamali commented on July 4, 2024

Thank you very much. I later on used this command "nextflow run nf-core/rnasplice
-profile singularity
--input /path/to/samplesheet.csv
--contrasts /path/to/contrastsheet.csv
--outdir /path/to/output_directory
--genome GRCh38
--aligner star_salmon
--save_reference
--dexseq_dtu
--min_samps_gene_expr 6
--min_gene_expr 10
--min_samps_feature_expr 3
--min_feature_expr 10
--min_samps_feature_prop 3
--min_feature_prop 0.1
--fasta /path/to/reference_genome.fa
--gtf /path/to/annotation_file.gtf.gz
-resume
" and I got this error then"The exit status of the task that caused the workflow execution to fail was: 105

Error executing process > 'NFCORE_RNASPLICE:RNASPLICE:ALIGN_STAR:STAR_ALIGN (LC6_S6)'

Caused by:
Process NFCORE_RNASPLICE:RNASPLICE:ALIGN_STAR:STAR_ALIGN (LC6_S6) terminated with an error exit status (105)

Command executed:

STAR
--genomeDir STARIndex
--readFilesIn input1/LC6_S6_trimmed.fq.gz
--runThreadN 12
--outFileNamePrefix LC6_S6.

--sjdbGTFfile Homo_sapiens.GRCh38.104.gtf
--outSAMattrRGline 'ID:LC6_S6' 'SM:LC6_S6'
--quantMode TranscriptomeSAM --twopassMode Basic --outSAMtype BAM Unsorted --readFilesCommand gunzip -c --runRNGseed 0 --outFilterMultimapNmax 20 --alignSJDBoverhangMin 1 --outSAMattributes NH HI AS NM MD --quantTranscriptomeBan Singleend

if [ -f LC6_S6.Unmapped.out.mate1 ]; then
mv LC6_S6.Unmapped.out.mate1 LC6_S6.unmapped_1.fastq
gzip LC6_S6.unmapped_1.fastq
fi
if [ -f LC6_S6.Unmapped.out.mate2 ]; then
mv LC6_S6.Unmapped.out.mate2 LC6_S6.unmapped_2.fastq
gzip LC6_S6.unmapped_2.fastq
fi

cat <<-END_VERSIONS > versions.yml
"NFCORE_RNASPLICE:RNASPLICE:ALIGN_STAR:STAR_ALIGN":
star: $(STAR --version | sed -e "s/STAR_//g")
samtools: $(echo $(samtools --version 2>&1) | sed 's/^.samtools //; s/Using.$//')
gawk: $(echo $(gawk --version 2>&1) | sed 's/^.GNU Awk //; s/, .$//')
END_VERSIONS

Command exit status:
105

Command output:
STAR --genomeDir STARIndex --readFilesIn input1/LC6_S6_trimmed.fq.gz --runThreadN 12 --outFileNamePrefix LC6_S6. --sjdbGTFfile Homo_sapiens.GRCh38.104.gtf --outSAMattrRGline ID:LC6_S6 SM:LC6_S6 --quantMode TranscriptomeSAM --twopassMode Basic --outSAMtype BAM Unsorted --readFilesCommand gunzip -c --runRNGseed 0 --outFilterMultimapNmax 20 --alignSJDBoverhangMin 1 --outSAMattributes NH HI AS NM MD --quantTranscriptomeBan Singleend
STAR version: 2.7.9a compiled: 2021-05-04T09:43:56-0400 vega:/home/dobin/data/STAR/STARcode/STAR.master/source
May 07 16:08:24 ..... started STAR run
May 07 16:08:24 ..... loading genome

Command error:
INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred
INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred
STAR --genomeDir STARIndex --readFilesIn input1/LC6_S6_trimmed.fq.gz --runThreadN 12 --outFileNamePrefix LC6_S6. --sjdbGTFfile Homo_sapiens.GRCh38.104.gtf --outSAMattrRGline ID:LC6_S6 SM:LC6_S6 --quantMode TranscriptomeSAM --twopassMode Basic --outSAMtype BAM Unsorted --readFilesCommand gunzip -c --runRNGseed 0 --outFilterMultimapNmax 20 --alignSJDBoverhangMin 1 --outSAMattributes NH HI AS NM MD --quantTranscriptomeBan Singleend
STAR version: 2.7.9a compiled: 2021-05-04T09:43:56-0400 vega:/home/dobin/data/STAR/STARcode/STAR.master/source
May 07 16:08:24 ..... started STAR run
May 07 16:08:24 ..... loading genome

EXITING because of FATAL ERROR: Genome version: 20201 is INCOMPATIBLE with running STAR version: 2.7.9a
SOLUTION: please re-generate genome from scratch with running version of STAR, or with version: 2.7.4a

May 07 16:08:24 ...... FATAL ERROR, exiting

Work dir:
/home/hamalibt/Splicesome_Project/RNA_SEQ_ARGLU1KO_VS_MCF7WT/work/2e/8d479b2a30ee2730d2b11db1c60837

Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named .command.sh"

from rnasplice.

jma1991 avatar jma1991 commented on July 4, 2024

This is another widespread issue. Basically, the index stored on iGenomes is incompatible with the version of the STAR aligner in the nf-core modules. See the somewhat short discussion here. The solution, as mentioned earlier is to provide your own FASTA and GTF file so the workflow generates an index which is compatible.

from rnasplice.

BulutHamali avatar BulutHamali commented on July 4, 2024

So I downloaded the human genome from a custom repository""https://github.com/ewels/AWS-iGenomes" and used the following Nextflow command with nf-core's rnasplice pipeline "nextflow run nf-core/rnasplice -profile singularity --input <input_path>/samplesheet.csv --contrasts <contrasts_path>/contrastsheet.csv --outdir <output_path>/Master_Directory/MCF7/05062024 --genome GRCh37 --aligner star_salmon --save_reference --dexseq_dtu --min_samps_gene_expr 6 --min_gene_expr 10 --min_samps_feature_expr 3 --min_feature_expr 10 --min_samps_feature_prop 3 --min_feature_prop 0.1 --fasta <fasta_path>/genome.fa --gtf <gtf_path>/genes.gtf
" I even configured nextflow.config for MISO gene extensions, but encountered this error:"Workflow execution completed unsuccessfully!
The exit status of the task that caused the workflow execution to fail was: 1.

The full error message was:

Error executing process > 'NFCORE_RNASPLICE:RNASPLICE:VISUALISE_MISO:MISO_SASHIMI (2)'

Caused by:
Process NFCORE_RNASPLICE:RNASPLICE:VISUALISE_MISO:MISO_SASHIMI (2) terminated with an error exit status (1)

Command executed:

sashimi_plot --plot-event ENSG00000005302.19 index miso_settings.txt --output-dir sashimi

cat <<-END_VERSIONS > versions.yml
"NFCORE_RNASPLICE:RNASPLICE:VISUALISE_MISO:MISO_SASHIMI":
python: $(python --version | sed "s/Python //g")
misopy: $(python -c "import pkg_resources; print(pkg_resources.get_distribution('misopy').version)")
END_VERSIONS

Command exit status:
1

Command output:
(empty)

Command error:
INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred
INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred
/usr/local/lib/python2.7/site-packages/matplotlib/cbook/deprecation.py:107: MatplotlibDeprecationWarning: The mpl_toolkits.axes_grid module was deprecated in version 2.1. Use mpl_toolkits.axes_grid1 and mpl_toolkits.axisartist provies the same functionality instead.
warnings.warn(message, mplDeprecation, stacklevel=1)
Traceback (most recent call last):
File "/usr/local/bin/sashimi_plot", line 11, in
sys.exit(main())
File "/usr/local/lib/python2.7/site-packages/misopy/sashimi_plot/sashimi_plot.py", line 276, in main
plot_label=plot_label)
File "/usr/local/lib/python2.7/site-packages/misopy/sashimi_plot/sashimi_plot.py", line 142, in plot_event
%(event_name, pickle_dir)
Exception: Event ENSG00000005302.19 not found in pickled directory index. Are you sure this is the right directory for the event?

Work dir:
/home/hamalibt/my_refs/work/90/89430a1a23400ef603a22d59c6a7b6

Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named .command.sh"

from rnasplice.

jma1991 avatar jma1991 commented on July 4, 2024

Can you confirm the gene identifier you’re using is present in the annotation file? Just the output of grep would be sufficient.

from rnasplice.

BulutHamali avatar BulutHamali commented on July 4, 2024

I've tried various commands like 'grep', 'grep -m 1', 'grep -F', 'LC_ALL=C grep', './search', and 'awk' to search for 'ENSG00000005302.19' in 'genes.gtf', but after about half an hour, there's still no output. Is this normal?

from rnasplice.

jma1991 avatar jma1991 commented on July 4, 2024

Please try the following command:

grep "ENSG00000005302" genes.gtf

If there is no output, it means your gene identifier is not in the annotation file.

from rnasplice.

BulutHamali avatar BulutHamali commented on July 4, 2024

It worked out afterwards. It seems that the issue with the 'grep' command was likely related to an HPC problem. Thank you.

from rnasplice.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.