Git Product home page Git Product logo

Comments (8)

mjoppich avatar mjoppich commented on September 26, 2024

I can report the very same for the yeast genome ... Reads used: SRR5989373 .

Have you been able to resolve the issue?

If I call it with --ambiguity 0.5 --secondary --min-bin-perc 0.01 --bin-step 0.99 --max-regions 20 --mapq -1 --spliced --chain-min-cov 40 (which according to the help is equivalent), no reads align ...

from graphmap2.

jmaricb avatar jmaricb commented on September 26, 2024

Hi @ydLiu-HIT ,

which reference were you using? Could you send the link or share it?

Thanks

from graphmap2.

mjoppich avatar mjoppich commented on September 26, 2024

Hi @jmaricb

since I got a very similar problem, maybe you could use my case instead:

I used the reads from SRR5989373 together with the ensembl 94 release:

ftp://ftp.ensembl.org/pub/release-94/gtf/saccharomyces_cerevisiae/Saccharomyces_cerevisiae.R64-1-1.94.gtf.gz

ftp://ftp.ensembl.org/pub/release-94/fasta/saccharomyces_cerevisiae/dna/Saccharomyces_cerevisiae.R64-1-1.dna_sm.toplevel.fa.gz

Thanks for looking into this!

from graphmap2.

ydLiu-HIT avatar ydLiu-HIT commented on September 26, 2024

Hi @ydLiu-HIT ,

which reference were you using? Could you send the link or share it?

Thanks

Ensembl genome with version 92 of GRCm38. link: ftp://ftp.ensembl.org/pub/release-92/fasta/mus_musculus/dna/Mus_musculus.GRCm38.dna.primary_assembly.fa.gz

from graphmap2.

jmaricb avatar jmaricb commented on September 26, 2024

@mjoppich @ydLiu-HIT I have located this segmentation fault. It was happening in the ksw2 aligner which would crash for very big references or very big queries. I have added this exception and tried it with this dataset that you linked: https://www.ncbi.nlm.nih.gov/sra/?term=SRR6238555
and it didn't crash.
Can you try with the newest commit and let me know if it still crashes.

from graphmap2.

ydLiu-HIT avatar ydLiu-HIT commented on September 26, 2024

Hi jmaricb:

I just re-run GraphMap2(v0.6.3) with the same read and reference as before, but it still gets as segmentation fault as follow:

[11:54:03 BuildIndexes] Loading reference sequences.
[11:55:35 SetupIndex_] Building the index for shape: '11110111101111'.
[11:55:55 Create] Allocated memory for a list of 1362768835 seeds (128 bits each) (0.00003 sec, diff: 19.92923 sec).
[11:55:55 Create] Memory consumption: [currentRSS = 7801 MB, peakRSS = 7889 MB]
[11:55:55 Create] Collecting seeds.
[11:55:55 Create] Minimizer seeds will be used. Minimizer window is 5.
[12:03:44 Create] [currentRSS = 37193 MB, peakRSS = 49390 MB] Sequence: 44/44, len: 91744698, name: 'chrY'''
[12:03:50 Create] Final memory allocation after collecting seeds: [currentRSS = 37694 MB, peakRSS = 49390 MB]
[12:03:50 Create] Sorting the seeds using 24 threads.
[12:06:33 Create] Generating the hash table.
[12:07:01 Create] Calculating the distribution statistics for key counts.
[12:07:02 Create] Index statistics: average key count = 132.646856, max key count = 3457358.000000, std dev = 1632.888478, percentil (99.00%) (count cutoff) = 1181.000000
[12:07:31 Create] Memory consumption: [currentRSS = 38466 MB, peakRSS = 49390 MB]
[12:07:31 SetupIndex_] Finished building index.
[12:07:31 SetupIndex_] Storing the index to file: '/data/ydliu/Reference/mouse_GRCm38.fa.gmidx'.
[12:13:57 Index] Memory consumption: [currentRSS = 35868 MB, peakRSS = 49390 MB]
[12:13:57 Run] Hits will be thresholded at the percentil value (percentil: 99.000000%, frequency: 1181).
[12:13:57 Run] Minimizers will be used. Minimizer window length: 5
[12:13:57 Run] Reference genome is assumed to be linear.
[12:13:57 Run] One or more similarly good alignments will be output per mapped read. Will be marked secondary.
[12:13:57 ProcessReads] All reads will be loaded in memory.
[12:14:47 ProcessReads] All reads loaded in 49.92 sec (size around 3144 MB). (3213849871 bases)
[12:14:47 ProcessReads] Memory consumption: [currentRSS = 39749 MB, peakRSS = 49390 MB]

[1]+ Segmentation fault (core dumped) ~/software/graphmap2/bin/Linux-x64/graphmap2 align -x rnaseq -r /data/ydliu/Reference/mouse_GRCm38.fa -d /data2/ydliu/ONT_reads/SMRT/mouse/SRR6238555.fasta -o mouse_graphmap2.sam -t 24

from graphmap2.

ydLiu-HIT avatar ydLiu-HIT commented on September 26, 2024

The reference I was using: https://drive.google.com/file/d/1OwfUcsJ8iqvuKOR0UJpb1PYRYCl1pLbN/view?usp=sharing

from graphmap2.

jmaricb avatar jmaricb commented on September 26, 2024

I am looking into this right now, again. As I can see from your comment the tool crashed right after loading reads into the memory, before aligning single read, right? Right not that doesn't happen for me. It aligns reads slowly, but it hasn't crash yet. I will try to see what happens and will let you know.

from graphmap2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.