Git Product home page Git Product logo

Comments (8)

haghshenas avatar haghshenas commented on August 25, 2024 1

I think I was able to reproduce this bug. I'm working on it and will update you.

from haslr.

haghshenas avatar haghshenas commented on August 25, 2024

Hi Jeff,

Sorry that I was busy with other stuff and got back to this so late.
Is the issue fixed? If yes, could you please let me know what the problem was and how you fixed it?

from haslr.

jstrickland63 avatar jstrickland63 commented on August 25, 2024

Hello, I also got this same error. I ran it on a node with 80 CPUs and 1500GB of RAM. I was able to run the sample dataset with no problems. The output is below for the .err file. Please let me know what other information you might need. I will also start tinkering with parameters as jelber2 did in #11 and see if I get lucky.

[NOTE] number of threads: 80

[NOTE] loading contig sequences...
processing file: /scratch2/jlstrck/Snake_HASLR_8March2020/sr_k49_a3.contigs.nooverlap.fa... Done in 7.63 CPU seconds (7.63 real seconds)
loaded 16708266 contigs
elapsed time 8.25 CPU seconds (10.37 real seconds)

[NOTE] calculating kmer frequency of unique contigs
mean: 67.42
elapsed time 9.85 CPU seconds (11.96 real seconds)

[NOTE] loading long read sequences...
processing file: /scratch2/jlstrck/Snake_HASLR_8March2020/lr25x.fasta... Done in 70.21 CPU seconds (70.23 real seconds)
loaded 4213353 long reads
elapsed time 80.08 CPU seconds (82.21 real seconds)

[NOTE] loading alignment between contigs and long reads...
processing file: /scratch2/jlstrck/Snake_HASLR_8March2020/map_contigs_k49_a3_lr25x.paf... Done in 103.05 CPU seconds (103.11 real seconds)
loaded 9169927 alignments
elapsed time 192.75 CPU seconds (218.77 real seconds)

[NOTE] fixing overlapping alignments...
elapsed time 225.54 CPU seconds (251.56 real seconds)

[NOTE] building compact long reads...
elapsed time 229.36 CPU seconds (255.92 real seconds)

[NOTE] building the backbone graph...
elapsed time 239.56 CPU seconds (268.25 real seconds)

[NOTE] cleaning weak edges...
removed 192666 edges
elapsed time 247.01 CPU seconds (277.75 real seconds)

[NOTE] cleaning tips...
removed 1324 tips
elapsed time 254.49 CPU seconds (287.72 real seconds)

[NOTE] cleaning simple bubbles...
removed 11407 simple bubbles
elapsed time 261.54 CPU seconds (296.75 real seconds)

[NOTE] cleaning super bubbles...
removed 265 super bubbles
elapsed time 268.81 CPU seconds (306.35 real seconds)

[NOTE] cleaning small bubbles...
removed 24 small bubbles
elapsed time 275.91 CPU seconds (316.15 real seconds)

[NOTE] calculating long read coordinates between anchors...
elapsed time 6082.05 CPU seconds (394.16 real seconds)

[NOTE] calling consensus sequence between anchors...
terminate called after throwing an instance of 'std::bad_alloc'
what(): std::bad_alloc

from haslr.

palfalvi avatar palfalvi commented on August 25, 2024

Hi @haghshenas !

I also got the same error recently on 35 CPUs and 2800 GB RAM trying to assemble a 2.3Gbp genome with illumina and ont data. Just would like to know if there is any solution or workaround. Would be great to test this assembler!

from haslr.

CIWa avatar CIWa commented on August 25, 2024

Hi,

Thank you very much for offering this great tool! For supporting the issue:
I am using a single node with 2 TB of RAM, for a 6 GB genome, PacBio CLR + Illumina PE data. RAM was okay before the crash (< 60 %). My version is 0.8a1, build successfully from source.

My call:
haslr.py -t 126 -o ./ -g 6g -l pacbio.fa -x pacbio -s R1.fastq R2.fastq

The error messages are:
[29-Jun-2020 10:07:29] assembling long reads using HASLR... failed
ERROR: "haslr_assemble" returned non-zero exit status

And in asm_contigs_k49_a3_c250_lr25x_b500_s3_sim0.85.err:
[NOTE] calling consensus sequence between anchors...
terminate called after throwing an instance of 'std::bad_alloc'
what(): std::bad_alloc

Best wishes,
Isabel

from haslr.

drs avatar drs commented on August 25, 2024

Hi @haghshenas,

I also got the same error trying to assemble a small genome (32.5 Mb) (Thalassiosira pseudonana) on two different system (Ubuntu 20.04 - 64 Gb RAM - 40 Threads and Debian 9 - 400 Gb RAM - 40 Threads).

I use different subsample (15X to 100X coverage) of a large dataset (SRA SRR7762361) to assemble the genome and the only dataset that fails with this error is the dataset with 15X coverage.

The command I used is :

$ haslr.py -o Assembly -g 12.2m -x nanopore -t 40 \
-s Illumina-R1.fastq Illumina-R2.fastq Illumina-Singles.fastq -l Nanopore-15X.fastq

I do not have any clue on why this specific dataset failed or if this is really related to this issue since this seesm to affect large gigabase sized genomes. I hope this additional information can be any valuable to you.

Regards,
Samuel Drouin

from haslr.

koujiaodahan avatar koujiaodahan commented on August 25, 2024

So, what's the solution to "terminate called after throwing an instance of 'std::bad_alloc'"

from haslr.

jessiepelosi avatar jessiepelosi commented on August 25, 2024

I've also run into this issue with the same error during calling consensus sequence between anchors...:

terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc

Has there been any progress on solving this issue since 2020?

Looking at this issue from STAR, it's likely not an issue with available memory but the number of headers in the initial short read assembly. Any advice/fixes?

from haslr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.