Git Product home page Git Product logo

Comments (3)

foreignsand avatar foreignsand commented on August 13, 2024

Now that I've been looking at this a bit, I realize I made a mistake with the jellyfish count run. My value for -s was way off. Although now I'm having a difficult time determining how to calculate a value for this parameter. The documentation says:

The size parameter (given with -s) is an indication of the number k-mers that will be stored in the hash. For sequencing reads, this size should be the size of the genome plus the k-mers generated by sequencing errors. For example, if the error rate is e (e.g.Illumina reads, usually e~1%), with an estimated genome size of G and a coverage of c, the number of expected k-mers is G+Gcek.

So if the estimated genome size is 2.2 Gb and the coverage (for my PacBio Hifi sequencing?) was ~80X, the error rate was 1%, and the k-mer size I'm using is 31, should the value for -s be 2200000000 + 2200000000 * 80 * 1 * 31 or 5458200000000? That can't be right. That seems completely insane...

from genomescope.

foreignsand avatar foreignsand commented on August 13, 2024

Continuing on with this, I reran jellyfish count with -s 50G and got the same results.

GenomeScope_s50g

Attempting to run jellyfish count with -s 5000G and -s 500G failed due to memory issues, so I'm attempting it with -s 100G. If that doesn't produce something that makes more sense, I'm not entirely sure how to resolve this.

from genomescope.

foreignsand avatar foreignsand commented on August 13, 2024

I'm still getting the same issue. I'd love to know why that might be.

from genomescope.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.