Comments (12)
Hi,
Can you show me what your config file looks like? This error suggests to me that you may not have the correct number of comma-delimited fields in it.
Best,
Dana
from talon.
Hi.
Here is my config file looks like
AllRNA,HFB1,MinION,/Users/Ashok/Desktop/Hoffman/ashokpat/MinION/11042019/HFB.sam
from talon.
Ok, that looks reasonable. But is there by any chance a blank line underneath this entry? That has happened to my labmate before.
from talon.
Yes. There was a blank line underneath. And I delete that line. Its working perfect now. Thanks
from talon.
Glad to hear it! Let me know if you have more questions :)
from talon.
Hi @dewyman,
It runs fine but while generating the read_annot.tvs file I have the following error. Can you look into it?
[ 2019-11-15 04:25:53 ] All jobs complete. Starting database update.
[ 2019-11-15 04:29:02 ] Validating database........
[ 2019-11-15 04:29:03 ] Database update complete.
[ 2019-11-15 04:29:03 ] Creating read-wise annotation file.
Traceback (most recent call last):
File "/usr/local/bin/talon", line 8, in
sys.exit(main())
File "/usr/local/lib/python3.7/site-packages/talon/talon.py", line 2419, in main
outprefix, datasets = datasets)
File "/usr/local/lib/python3.7/site-packages/talon/post/get_read_annotations.py", line 372, in make_read_annot_file
curr_gene_novelty = gene_novelty[gene_ID]
KeyError: 46490
from talon.
Hi,
It looks like the script that generates the output file is having trouble finding the novelty status of the gene with ID '46490'. This is something I should be able to patch for you relatively quickly, but I'd like to understand why it's happening to solve the underlying issue. Would you mind sending me the TALON database that was the output of your run? My email is [email protected] if you prefer.
from talon.
Sure. I am sending it to you.
from talon.
Ok I was able to figure out what is going on. The gene with TALON id '46490' is ENSG00000272675.1, which has a 'NOVEL' gene_status designation in the GENCODE GTF you provided. This confused the TALON read_annot utility because it was expecting 3 only different types of genes: Known (by virtue of the annotation), antisense novel, or intergenic novel (these latter two cases are called by TALON).
One thing I can do is add a try/except clause to the novelty lookup in the read_annot script so that when this type of issue happens, the novelty assignment in the outfile will be 'Other' rather than crashing the whole thing. Would that make sense for you?
from talon.
Yah Sure. That should be fine. I am using Gencode v32
from talon.
Ok, I have implemented the patch now, so feel free to clone from the master branch and try it out. You do not need to redo your TALON run since all the jobs/database updates ran fine- all you need to do is run the talon_fetch_reads utility to get your output file:
talon_fetch_reads --h
Usage: talon_fetch_reads [-h] [--db FILE,] [--build STRING,]
[--datasets STRING,] [--o OUTPREFIX]
This utility queries a TALON database in order to get read-specific annotation
information.
optional arguments:
-h, --help show this help message and exit
--db FILE, TALON database
--build STRING, Genome build (i.e. hg38) to use. Must be in the
database.
--datasets STRING, Optional: Comma-delimited list of datasets to include.
Default behavior is to include all datasets in the
database.
--o OUTPREFIX Prefix for output files
Let me know how it goes!
from talon.
Thanks. The output file generated without any error.
from talon.
Related Issues (20)
- Key Error: "gene novelty" HOT 1
- Annotation name not found HOT 11
- Error when using non-human GTF HOT 2
- What database to use? HOT 2
- TALON on single-cell data HOT 10
- Multithread bam sort HOT 2
- Error message : ValueError: SAM transcript 463c55e7-91c0-45cf-9630-122afc9a2267 lacks an MD tag HOT 1
- I get 'Annotation name 'ADV5_annot' not found in this database' in step talon_abundance HOT 1
- numpy<1.24.0 should be added as a requirement
- Error: value too large to convert to int16_t HOT 1
- Single cell reads appear to be unlabelled when filtering transcripts HOT 1
- How to visualize custom gtf? HOT 1
- operand type error in talon_initialize_database
- Duplicate transcripts - forward and reverse. HOT 3
- talon_filter_transcript --minCounts with --maxFracA HOT 1
- Rules for considering splicing variants as identical in TALON HOT 2
- fail to extract CB field HOT 2
- What should I do if I have reads that are reverse complement to the original RNA sequences? HOT 2
- Isoforms defined by reads with high fraction A (>0.5) HOT 2
- Abundance and fraction_as not showing properly
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from talon.