Git Product home page Git Product logo

Comments (12)

dewyman avatar dewyman commented on June 1, 2024

Hi,
Can you show me what your config file looks like? This error suggests to me that you may not have the correct number of comma-delimited fields in it.
Best,
Dana

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Hi.

Here is my config file looks like

AllRNA,HFB1,MinION,/Users/Ashok/Desktop/Hoffman/ashokpat/MinION/11042019/HFB.sam

from talon.

dewyman avatar dewyman commented on June 1, 2024

Ok, that looks reasonable. But is there by any chance a blank line underneath this entry? That has happened to my labmate before.

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Yes. There was a blank line underneath. And I delete that line. Its working perfect now. Thanks

from talon.

dewyman avatar dewyman commented on June 1, 2024

Glad to hear it! Let me know if you have more questions :)

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Hi @dewyman,

It runs fine but while generating the read_annot.tvs file I have the following error. Can you look into it?

[ 2019-11-15 04:25:53 ] All jobs complete. Starting database update.
[ 2019-11-15 04:29:02 ] Validating database........
[ 2019-11-15 04:29:03 ] Database update complete.
[ 2019-11-15 04:29:03 ] Creating read-wise annotation file.
Traceback (most recent call last):
File "/usr/local/bin/talon", line 8, in
sys.exit(main())
File "/usr/local/lib/python3.7/site-packages/talon/talon.py", line 2419, in main
outprefix, datasets = datasets)
File "/usr/local/lib/python3.7/site-packages/talon/post/get_read_annotations.py", line 372, in make_read_annot_file
curr_gene_novelty = gene_novelty[gene_ID]
KeyError: 46490

from talon.

dewyman avatar dewyman commented on June 1, 2024

Hi,
It looks like the script that generates the output file is having trouble finding the novelty status of the gene with ID '46490'. This is something I should be able to patch for you relatively quickly, but I'd like to understand why it's happening to solve the underlying issue. Would you mind sending me the TALON database that was the output of your run? My email is [email protected] if you prefer.

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Sure. I am sending it to you.

from talon.

dewyman avatar dewyman commented on June 1, 2024

Ok I was able to figure out what is going on. The gene with TALON id '46490' is ENSG00000272675.1, which has a 'NOVEL' gene_status designation in the GENCODE GTF you provided. This confused the TALON read_annot utility because it was expecting 3 only different types of genes: Known (by virtue of the annotation), antisense novel, or intergenic novel (these latter two cases are called by TALON).

One thing I can do is add a try/except clause to the novelty lookup in the read_annot script so that when this type of issue happens, the novelty assignment in the outfile will be 'Other' rather than crashing the whole thing. Would that make sense for you?

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Yah Sure. That should be fine. I am using Gencode v32

from talon.

dewyman avatar dewyman commented on June 1, 2024

Ok, I have implemented the patch now, so feel free to clone from the master branch and try it out. You do not need to redo your TALON run since all the jobs/database updates ran fine- all you need to do is run the talon_fetch_reads utility to get your output file:

talon_fetch_reads --h

Usage: talon_fetch_reads [-h] [--db FILE,] [--build STRING,]
                         [--datasets STRING,] [--o OUTPREFIX]

This utility queries a TALON database in order to get read-specific annotation
information.

optional arguments:
  -h, --help          show this help message and exit
  --db FILE,          TALON database
  --build STRING,     Genome build (i.e. hg38) to use. Must be in the
                      database.
  --datasets STRING,  Optional: Comma-delimited list of datasets to include.
                      Default behavior is to include all datasets in the
                      database.
  --o OUTPREFIX       Prefix for output files

Let me know how it goes!

from talon.

ashokpatowary avatar ashokpatowary commented on June 1, 2024

Thanks. The output file generated without any error.

from talon.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.