Comments (3)
Stan - Our code assumes input GFF3 following both the physical format and gene graph modeling described in the GFF3 specification.
The important part here is that CDS/exon features should have a type of RNA as their parent, where you have them attached to a gene feature. In that specification page search for this string "Here is how this gene should be described using GFF3".
The one thing not shown in that representation is where to attach annotation. We attach annotation features to the polypeptide for coding genes and the RNA feature for non-coding. Example:
gi|169887498|gb|CP000948.1| . gene 3734 5020 . + . ID=1_4
gi|169887498|gb|CP000948.1| . mRNA 3734 5020 . + . ID=1_4_mRNA;Parent=1_4
gi|169887498|gb|CP000948.1| . CDS 3734 5020 . + 0 ID=1_4_cds;Parent=1_4_mRNA
gi|169887498|gb|CP000948.1| . exon 3734 5020 . + . ID=1_4_exon;Parent=1_4_mRNA
gi|169887498|gb|CP000948.1| . polypeptide 3734 5020 . + . ID=1_4_polypeptide;Parent=1_4_mRNA;product_name=threonine synthase;Dbxref=EC:4.2.3.1;Ontology_term=GO:0004795,GO:0009088
from biocode.
from biocode.
You should also note that there are many GFF3-related utility scripts within biocode's 'gff' directory which could help here. Some scripts convert common tool 'gff' output to actual gff3 while others can be used to manipulate conventions within a gff file more directly. Hopefully this helps.
from biocode.
Related Issues (20)
- convert_gff3_to_ncbi_tbl HOT 5
- Syntax error on gff.py HOT 4
- Exclude mRNA features in bacterial TBL exports
- Attribute error for update_selected_column9_values.py HOT 1
- Biocode.gff module error HOT 2
- [convert_genbank_to_gff3.py] key_error: locus_tag HOT 5
- AttributeError: type object 'str' has no attribute 'maketrans' HOT 2
- AttributeError: 'Gene' object has no attribute 'add_CDS' HOT 4
- Insert EC numbers into chado database issue HOT 5
- convert_augustus_to_gff3.py error HOT 6
- Conda based install HOT 2
- convert_gff3_to_ncbi_tbl.py HOT 2
- Incorrect parent features from convert_tRNAScanSE_to_gff3.pl HOT 2
- Formatting Issue? HOT 2
- biocode error HOT 16
- product info not printout in tbl HOT 2
- fasta/fasta_simple_stats.py fails on any file with only one sequence
- [convert_genbank_to_gff3.py] No Locus_tag present in my genbank file HOT 2
- Python upgrade/conversion? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from biocode.