Git Product home page Git Product logo

Comments (3)

gpertea avatar gpertea commented on June 26, 2024 1

Really? Are you really going to argue that you just don't see a duplicate ID gene29892 in those first two lines of your ref.gff quote you're showing here ? :)

While the error message might not be very accurate (there is no actual overlap there, for those two duplicated gene IDs), the problem still stands: the ID gene29892 has obvious transcript properties (parenting CDS and exon features) while it is being declared twice in the file (1st at 74631-74744, 2nd time at 146276-147073). This is not a valid GFF3 format, as my understanding is that the feature IDs should be unique.. (though indeed my GFF parser is a bit more lenient about this, accepting non-unique IDs if the features are on separate reference sequences.. which is not the case here).

Then things went downhill ("downstream") from there.. according to the merciless GIGO principle.. I hope you understand, it's hard to somehow automagically fix invalid input reference annotation data, or guess what the authors of those data really meant there -- e.g. that it was somehow a 2-part gene there (?!), so it's OK if they broke the ID uniqueness rule on a whim..

from gffcompare.

gpertea avatar gpertea commented on June 26, 2024 1

Leaving aside the cheeky fun I had with my reply above, it turns out that in fact I was in the wrong there -- not about the duplicate IDs, but about the validity of that annotation, as this is a special case of trans-splicing where actually the current GFF3 specification does allow for the same ID in the case of discontinuous features like trans-splicing and fusions.
So please accept my belated apologies for the incorrect/incomplete answer -- and thanks for this trans-splicing example! I ran into this old closed issue while looking for trans-splicing examples so I can add trans-splicing support to my GFF parser (and thus to gffread, gffcompare etc.).

from gffcompare.

lakhujanivijay avatar lakhujanivijay commented on June 26, 2024

Hello Geo!

Glad to hear that. It's good that we are coming across such scenarios and I am happy that the project is on constant development. Taking it on a positive note.

Your hard work is much appreciated!

Regards
Vijay

from gffcompare.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.