Git Product home page Git Product logo

Comments (5)

LuyiTian avatar LuyiTian commented on June 19, 2024

It depends on which benchmark dataset you used. These cell lines have distinct gene expression patterns, so the single cell data can be easily clustered. This is why we have the mixture dataset, where the difference between clusters are more subtle. Also keep in mind that these dataset provide a baseline for method evaluation, which means having good performance on these dataset does not guarantee the similar performance on other more complicated data. If you want to challege your clustering methods, you can look at scRNAseq data from cells in differenciation, such as hematopietic stem cells (https://www.nature.com/articles/s41467-019-10291-0) or iPSCs (https://doi.org/10.1016/j.cell.2019.01.006). The different between different cell type/state will be more subtle, but the annotation will not be as good as our controlled experiment, due to the limited knowledge of the biological systems.

from sc_mixology.

qianchenxi1109 avatar qianchenxi1109 commented on June 19, 2024

Hi, I am experimenting with your test data as well. (Nice paper by the way, very comprehensive comparisons). However, I am having difficulties finding out the "true label" for cellmix data.

I understand that for CellLine scRNAseq data you used "cell_line_demuxlet" as your true label, and for RNAmix data you used "mix" column to store the true label info, but for cellmix data there is no such group info in "colData".

It will be great if you can help me with this issue. A bunch of thanks in advance.

from sc_mixology.

LuyiTian avatar LuyiTian commented on June 19, 2024

Hi, I am experimenting with your test data as well. (Nice paper by the way, very comprehensive comparisons). However, I am having difficulties finding out the "true label" for cellmix data.

I understand that for CellLine scRNAseq data you used "cell_line_demuxlet" as your true label, and for RNAmix data you used "mix" column to store the true label info, but for cellmix data there is no such group info in "colData".

It will be great if you can help me with this issue. A bunch of thanks in advance.

Yes I agree it is not very clear. The true label are the combinations of cell numbers from three cell lines, so is the RNA mixtures. but in RNA mixtures it is the combinations of RNA proportions not cell numbers. I will update the documentation.

from sc_mixology.

sgmccalla avatar sgmccalla commented on June 19, 2024

Hi @LuyiTian I am curious about the true labels as well. Which file(s) contains the true cluster labels? Thanks in advance.

from sc_mixology.

LuyiTian avatar LuyiTian commented on June 19, 2024

@sgmccalla I have updated the document. please check recent commit bba1c35.

from sc_mixology.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.