Git Product home page Git Product logo

Comments (6)

huangyh09 avatar huangyh09 commented on July 17, 2024

Hi,

Thanks for using BRIE, and reporting the issue.

It seems the reason for the errors is that the GTF files didn't match the BRIE's requirement. BRIE requires the that features of a gene are listed in a particular order, as follows.

gene
transcript
exon
exon (optional)
exon (optional)
transcript (optional)
exon
exon (optional)
exon (optional)

Could you check if your GTF files have features in such order?

Cheers,
Yuanhua

from brie.

aarzalluz avatar aarzalluz commented on July 17, 2024

In the case of the custom annotation file, the features are not in that order, in fact, my .gtf file only contains the annotation info for the exons. As for the GENCODE file, the feature order seems to be the one you indicate, but other features such as CDS, start_codon, stop_codon and UTR are included as well.

Is there any way around this?

from brie.

huangyh09 avatar huangyh09 commented on July 17, 2024

For your custermised alternative exons, I guess you also have the upstream and downstream exons, right? If so, then you could create a gtf file with pseudo gene and pseudo transcripts (one including the exon, the other excuding). brie-event also gives the same format.

For the GENCODE file, the CDS, start_codon, stop_codon and UTR will be ignored, so won't affect. I haven't figured out why the above error occurs for this file. I have just downloaded M10, and brie-event works fine. BTW, which version of BRIE do you use? Is it Python 2 or 3? So I can test the specific one.

from brie.

aarzalluz avatar aarzalluz commented on July 17, 2024

It is the python3 version. I could always try to download the M10 file again and try, see if the file got corrupted somehow.

Concerning the custom annotation, I see your point. However, that would ultimately yield an expression estimate per event (exclusion and inclusion, for every spliced exon). Is there any way to use BRIE to obtain a single expression value per transcript? That is, not an estimate per exon inclusion/exclusion event, but a value for every true isoform/transcript of the gene. That is my ultimate goal, and the main reason why I'd rather not building a pseudo transcript annotation file.

from brie.

huangyh09 avatar huangyh09 commented on July 17, 2024

Thanks Ángeles for the updates. I figured out it is the incompatibility of brie-event with Python3. I will fix it and let you know.

Unfortunately, BRIE doesn't provide a full transcript quantification (actually it is an estimate as well, assigning reads to multiple competing transcripts from the same gene). There are quite a few good methods developed for bulk data. As you can find from the BRIE paper, the benefits of BRIE is that the sequence features bring good prediction to aid the splicing estimate in single cells. However, such features are hard to obtain at the transcript level, otherwise BRIE will be upgraded with the option for transcript level quantification.

from brie.

aarzalluz avatar aarzalluz commented on July 17, 2024

As you pointed out, I usually use RSEM to estimate expression at the transcript level. However, after reading the paper I was curious to try out BRIE and see whether it could provide the best of both worlds... But, as you're saying, it is not a trivial problem, considering the sparsity of single-cell data. It will be great to hear about that upgrade, if a time comes when it is possible.

Do let me know about the fix! Thanks for your help.

from brie.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.