Git Product home page Git Product logo

Comments (4)

taltman avatar taltman commented on September 28, 2024

This is what is desired:

Qualifier       /ribosomal_slippage
Definition      during protein translation, certain sequences can program
                ribosomes to change to an alternative reading frame by a 
                mechanism known as ribosomal slippage 
Value format    none 
Example         /ribosomal_slippage 
Comment         a join operator,e.g.: [join(486..1784,1787..4810)] should be used 
                in the CDS spans to indicate the location of ribosomal_slippage 

from vadr.

nawrockie avatar nawrockie commented on September 28, 2024

That's actually a different format than the .tbl file that vadr creates, despite them both being (confusingly) called feature tables. The format of vadr output 'feature tables' is described here: https://www.ncbi.nlm.nih.gov/genbank/feature_table/

The vadr format is a useful file format for submissions to GenBank. There may be ways to convert it to a format that ENA accepts for submissions, but I'm not sure what those formats are. The vadr .ftr output files may also be relatively easy to parse and reformat into an accepted ENA format.

from vadr.

taltman avatar taltman commented on September 28, 2024

Hi @nawrockie ,

I've so far used VADR to submit nine CoV genomes to ENA with remote homology to SARS-CoV-2. The latest version of DARTH will have the scripts to turn VADR output into a format that can be submitted to ENA using their Webin-CLI tool.

I looked at the GenBank feature_table page, but it doesn't talk about ribosome slippage, and how to encode it correctly (the page seems to be more concerned about syntax). INSDC is a collaboration between GenBank, DDBJ, and EMBL, so why wouldn't the semantics defined for INSDC apply to the GenBank?

from vadr.

nawrockie avatar nawrockie commented on September 28, 2024

VADR outputs the .tbl file format because that was preferred by the GenBank indexers for viral submissions at the time of development. In the GenBank submission pipeline, the vadr .tbl format file is then converted to 'asn' format using tbl2asn which is used to input the data into the GenBank database.

The latest version of DARTH will have the scripts to turn VADR output into a format that can be submitted to ENA using their Webin-CLI tool.

That sounds good. Have you finished developing the format conversion tool/script, or do you still have a problem with ribosomal_slippage?

from vadr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.