Comments (14)
Thanks for the comments everyone. I'm working on pushing a fix for this today. Hopefully report back with a PR!
from rnasplice.
Hi @paolo-kunderfranco and @expansion-bioemformatics , thanks for raising the issue. We will work on having this error resolved.
from rnasplice.
I am having the same issue running via Tower on AWS Batch.
from rnasplice.
I am also getting a similar issue. I am not sure, but I think the problem occurs because all contrasts are passed to suppa via --groups
, whereas the PSI files are somehow already specific for the contrast of interest. Therefore, there are more columns than samples.
from rnasplice.
To be clear, this happens whenever there are >2 contrasts specified in the samplesheet. I was able to "solve" the problem by running the pipeline multiple times, once for each desired contrast. However, there should be a contrastsheet with multiple contrasts specified in one of the test configs.
from rnasplice.
Hi @dkoppstein
What do you mean by whenever there are >2 contrasts specified in the samplesheet
?
Did you mean >2 contrasts in the contrast sheet
? Or >2 samples in the samplesheet
?
I'm also running into this issue, with the following samplesheet and contrast sheet:
sample,condition,strandedness,fastq_1,fastq_2
K562_TARDBP_TRT_REP1,K562_TARDBP_TRT,reverse,fastq-test/SRR3469414-R1.fastq.gz,fastq-test/SRR3469414-R2.fastq.gz
K562_TARDBP_TRT_REP2,K562_TARDBP_TRT,reverse,fastq-test/SRR3469415-R1.fastq.gz,fastq-test/SRR3469415-R2.fastq.gz
K562_TARDBP_CTL_REP1,K562_TARDBP_CTL,reverse,fastq-test/SRR3469418-R1.fastq.gz,fastq-test/SRR3469418-R2.fastq.gz
K562_TARDBP_CTL_REP2,K562_TARDBP_CTL,reverse,fastq-test/SRR3469419-R1.fastq.gz,fastq-test/SRR3469419-R2.fastq.gz
HepG2_TARDBP_TRT_REP1,HepG2_TARDBP_TRT,reverse,fastq-test/SRR4421909-R1.fastq.gz,fastq-test/SRR4421909-R2.fastq.gz
HepG2_TARDBP_TRT_REP2,HepG2_TARDBP_TRT,reverse,fastq-test/SRR4421908-R1.fastq.gz,fastq-test/SRR4421908-R2.fastq.gz
HepG2_TARDBP_CTL_REP1,HepG2_TARDBP_CTL,reverse,fastq-test/SRR4421557-R1.fastq.gz,fastq-test/SRR4421557-R2.fastq.gz
HepG2_TARDBP_CTL_REP2,HepG2_TARDBP_CTL,reverse,fastq-test/SRR4421558-R1.fastq.gz,fastq-test/SRR4421558-R2.fastq.gz
contrast,treatment,control
K562_TARDBP_TRT_CTL,K562_TARDBP_TRT,K562_TARDBP_CTL
HepG2_TARDBP_TRT_CTL,HepG2_TARDBP_TRT,HepG2_TARDBP_CTL
from rnasplice.
Actually, my error is very similar, but not identical to the one above:
ERROR ~ Error executing process > 'NFCORE_RNASPLICE:RNASPLICE:SUPPA_SALMON:CLUSTEREVENTS_IOI (HepG2_TARDBP_TRT-HepG2_TARDBP_CTL)'
Caused by:
Process `NFCORE_RNASPLICE:RNASPLICE:SUPPA_SALMON:CLUSTEREVENTS_IOI (HepG2_TARDBP_TRT-HepG2_TARDBP_CTL)` terminated with an error exit status (1)
Command executed:
suppa.py \
clusterEvents \
--dpsi HepG2_TARDBP_TRT-HepG2_TARDBP_CTL_transcript_diffsplice.dpsi \
--psivec HepG2_TARDBP_TRT-HepG2_TARDBP_CTL_transcript_diffsplice.psivec \
--dpsi-threshold 0.05 \
--eps 0.05 \
--metric euclidean \
--min-pts 20 \
--groups 1-2,3-4,5-6,7-8 \
--clustering DBSCAN \
-o HepG2_TARDBP_TRT-HepG2_TARDBP_CTL_transcript_cluster
cat <<-END_VERSIONS > versions.yml
"NFCORE_RNASPLICE:RNASPLICE:SUPPA_SALMON:CLUSTEREVENTS_IOI":
suppa: $(python -c "import pkg_resources; print(pkg_resources.get_distribution('suppa').version)")
END_VERSIONS
Command exit status:
1
Command output:
(empty)
Command error:
ERROR:lib.cluster_tools:Invalid index. Index 8 is larger than the number of columns in the file (4).
from rnasplice.
Sorry -- I did indeed mean >2 contrasts in the contrast sheet
. And it is clear that the error is similar, i.e. that all groups are specified on the command line, even though the dpsi and psivec files are already restricted to the contrast specified.
from rnasplice.
Good catch, I think you are right and the error seems to be caused by how the --groups
setting is being specified by the pipeline.
from rnasplice.
exactly, I did have more contrast specified and the pipelne failed in SUPPA.py
ERROR:lib.cluster_tools:Invalid index. Index 32 is larger than the number of columns in the file (16).
I am trying now to run again with only one contrast specified.
from rnasplice.
update, with only one contrast specified, CLUSTEREVENTS_IOE completed succesfully, however I am now facing and error
with NFCORE_RNASPLICE:RNASPLICE:DRIMSEQ_DEXSEQ_DTU_SALMON:STAGER
,
I do need to open a new issue?
Best
from rnasplice.
I'm not sure if this is helpful, but I've looked into this before and had a similar error as @paolo-kunderfranco ERROR:lib.cluster_tools:Invalid index. Index 32 is larger than the number of columns in the file (16).
I noticed this happened if the samplesheet wasn't sorted by condition, because it seems like suppa assumes that all the samples are sorted by condtion.
from rnasplice.
Thanks for your patience on this bug. I believe we have fixed the issue with #86. Please can you pull from dev and see if this works.
from rnasplice.
Closing because all tests are passing and fix has been merged into dev branch. Please re-open if you still encounter an error.
from rnasplice.
Related Issues (20)
- STAGER error: subscript out of bounds HOT 4
- link to example contrastsheet.csv is not working HOT 1
- Empty output for BAM+rMats HOT 2
- make contrasts file names consistent with those of the differentialabundance pipeline HOT 1
- Missing rMATS arguments HOT 2
- DEXSeq-DTU Stager pScreenAdjusted HOT 1
- sashimi_plot error HOT 2
- contrast file problem? HOT 7
- EXITING because of FATAL ERROR: Genome version: 20201 is INCOMPATIBLE with running STAR version: 2.7.9a HOT 2
- MISO error HOT 3
- The processes for splitting files are running very slowly with large numbers of input samples HOT 5
- Error: suppa_split_file.R Input_file must contain samplesheet samples. HOT 2
- ERROR ~ Error executing process > 'NFCORE_RNASPLICE:RNASPLICE:DRIMSEQ_DEXSEQ_DTU_SALMON:DRIMSEQ_FILTER (1)' HOT 11
- Error in DRIMSeq::dmDSdata(counts = counts, samples = samps) HOT 3
- Error single-end execution HOT 1
- Error in SUPPA: Clustergroups are assigned incorrectly HOT 4
- Miso error HOT 8
- `NFCORE_RNASPLICE:RNASPLICE:DRIMSEQ_DEXSEQ_DTU_SALMON:DEXSEQ_DTU (1)` HOT 7
- RNA splice output
- Merged genes output HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rnasplice.