Comments (1)
Unfortunately that class code is currently reported for any >=1bp overlap on the opposite strand..
As a side note, the gffcompare documentation has been updated recently (see http://ccb.jhu.edu/software/stringtie/gffcompare.shtml) with a diagram documenting these and other (newer) codes that were added to gffcompare. For example the new code 's'
is much more specific as it also involves an intron overlap -- though obviously this is only helpful for multi-exon transcripts.
The C++ code actually retrieves the overlap length but the value is never exposed in the output files produced by gffcompare.
I was playing recently with a faster transcript classification code, temporarily called trmap (full code is in https://github.com/gpertea/trmap) which outputs the full exon structures (along with the class code) of both transcript query along with the overlapped references; I could easily modify that tool to also output the overlap length -- or one can use a post-processing script to filter those 'x'
overlaps as needed, based on the exon coordinates.
Eventually this trmap code will likely become part of gffcompare.. but until then it's functional on its own for such quick transcript classification.
from gffcompare.
Related Issues (20)
- How is 'tss_id' assigned?
- Exon-level sensitivity is not 100%, but there are zero missed exons.
- disable discard duplicate
- gffcompare -d option not having effect HOT 1
- Number of samples reported exceeds the maximum HOT 3
- keep reference transcripts
- Where can I get gffcompare v0.12.5? HOT 1
- Comparing two different mouse strain gtf files
- Definition of "Locus Level" HOT 2
- Availability of new version on Conda and biocontainers
- gff file and the XLOC numbers are not ordered correctly
- Extract transposon HOT 1
- Missing tmap and refmap files in query gff directory + missing statistics in stats file
- How to keep the CDS info for each transcript? HOT 2
- Transcript classification codes ? HOT 1
- class_code 'u' in all transcripts HOT 1
- Merging different gtf files
- How is class code X is identified in stranded data?
- There is no CDS information in the generated file๏ผ
- add option to assemble transfrags in the combined.gtf output HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gffcompare.