Git Product home page Git Product logo

Comments (7)

mschatz avatar mschatz commented on August 13, 2024

from genomescope.

ptranvan avatar ptranvan commented on August 13, 2024

Hello, we don't know many things about the genomic architecture of this species but it should be diploid. I did change "Average k-mer coverage "for polyploid genome" to 114 but got the same plot:

GenomeScope version 2.0
input file = user_uploads/va4PdkFoOZtVXygmISbA
output directory = user_data/va4PdkFoOZtVXygmISbA
p = 2
k = 21
initial kmercov estimate = 114

property min max
Homozygous (aa) 0% 100%
Heterozygous (ab) 0% 100%
Genome Haploid Length 251,713,776 bp 251,965,450 bp
Genome Repeat Length 52,929,734 bp 52,982,656 bp
Genome Unique Length 198,784,041 bp 198,982,794 bp
Model Fit 79.394% 92.8582%
Read Error Rate 0.236534% 0.236534%

http://genomescope.org/genomescope2.0/analysis.php?code=va4PdkFoOZtVXygmISbA

from genomescope.

mschatz avatar mschatz commented on August 13, 2024

from genomescope.

ViriatoII avatar ViriatoII commented on August 13, 2024

I'm also curious about this. Genomescope1 seems to have estimations in line with literature for my species while genomescope2 not (even when multiplying by 2 because of haploid vs diploid)

from genomescope.

mschatz avatar mschatz commented on August 13, 2024

from genomescope.

ViriatoII avatar ViriatoII commented on August 13, 2024

Hi Mike,
That's very kind of you, thank you.
As an example, this D. erucoides is estimated to have ~500 Mbps haploid genome size, 1000 Mbps in diploid size ( Lysák et al.,2009)

Genomescope1 predicts 435 Mbps haploid length, just short of literature, as well as a reasonable 0.65% of heterozygosity:
http://qb.cshl.edu/genomescope/analysis.php?code=izm95ZeGs1WxSvj8bidT

The genomescope2 run (even using max 100 000 coverage) predicts 212Mbps of haploid genome length, and a surprising max estimated heterozygosity of 20%.
http://qb.cshl.edu/genomescope/genomescope2.0/analysis.php?code=cKhv8wWNKBHMEgf3ilHd

I am applying the same pipeline to 20 species, some of which are tetraploid. I'd wish I could just apply the same parameters to all, or at least only treat the tetraploids differently and give them to genomescope2.

Appreciate the help,
Ricardo

from genomescope.

mschatz avatar mschatz commented on August 13, 2024

from genomescope.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.