Comments (6)
Hi,
Thanks for using BRIE, and reporting the issue.
It seems the reason for the errors is that the GTF files didn't match the BRIE's requirement. BRIE requires the that features of a gene are listed in a particular order, as follows.
gene
transcript
exon
exon (optional)
exon (optional)
transcript (optional)
exon
exon (optional)
exon (optional)
Could you check if your GTF files have features in such order?
Cheers,
Yuanhua
from brie.
In the case of the custom annotation file, the features are not in that order, in fact, my .gtf file only contains the annotation info for the exons. As for the GENCODE file, the feature order seems to be the one you indicate, but other features such as CDS, start_codon, stop_codon and UTR are included as well.
Is there any way around this?
from brie.
For your custermised alternative exons, I guess you also have the upstream and downstream exons, right? If so, then you could create a gtf file with pseudo gene and pseudo transcripts (one including the exon, the other excuding). brie-event
also gives the same format.
For the GENCODE file, the CDS, start_codon, stop_codon and UTR will be ignored, so won't affect. I haven't figured out why the above error occurs for this file. I have just downloaded M10, and brie-event
works fine. BTW, which version of BRIE do you use? Is it Python 2 or 3? So I can test the specific one.
from brie.
It is the python3 version. I could always try to download the M10 file again and try, see if the file got corrupted somehow.
Concerning the custom annotation, I see your point. However, that would ultimately yield an expression estimate per event (exclusion and inclusion, for every spliced exon). Is there any way to use BRIE to obtain a single expression value per transcript? That is, not an estimate per exon inclusion/exclusion event, but a value for every true isoform/transcript of the gene. That is my ultimate goal, and the main reason why I'd rather not building a pseudo transcript annotation file.
from brie.
Thanks Ángeles for the updates. I figured out it is the incompatibility of brie-event
with Python3. I will fix it and let you know.
Unfortunately, BRIE doesn't provide a full transcript quantification (actually it is an estimate as well, assigning reads to multiple competing transcripts from the same gene). There are quite a few good methods developed for bulk data. As you can find from the BRIE paper, the benefits of BRIE is that the sequence features bring good prediction to aid the splicing estimate in single cells. However, such features are hard to obtain at the transcript level, otherwise BRIE will be upgraded with the option for transcript level quantification.
from brie.
As you pointed out, I usually use RSEM to estimate expression at the transcript level. However, after reading the paper I was curious to try out BRIE and see whether it could provide the best of both worlds... But, as you're saying, it is not a trivial problem, considering the sparsity of single-cell data. It will be great to hear about that upgrade, if a time comes when it is possible.
Do let me know about the fix! Thanks for your help.
from brie.
Related Issues (20)
- Brie-quant input HOT 2
- Brie-Count and Brie-Quant Errors HOT 7
- Other AS types counting, no output HOT 6
- the output of brie-count has not .h5ad file HOT 1
- the output of brie-count don't contain the .h5ad file HOT 3
- the errors about the brie-count HOT 2
- brie-quant uses all available resources HOT 1
- visualize sashimi plot with brie2 HOT 3
- multiple 10x samples integration to run brie-count HOT 1
- Brie-count Multiprocessing using only 1 core HOT 2
- some questions about psi distribution
- Brie-count error with paired end smart-seq2 data HOT 1
- confused about Psi_95CI HOT 1
- p values outputfile
- MXE chromosomecoordinates to gene_ids
- How to get the specific sites where differential splicing events occurs
- ELBO_gain and effect size HOT 1
- split cells into groups and run brie2 HOT 3
- Error when running brie-count: IndexError: list index out of range HOT 1
- UnicodeDecodeError HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from brie.