Comments (8)
I can report the very same for the yeast genome ... Reads used: SRR5989373 .
Have you been able to resolve the issue?
If I call it with --ambiguity 0.5 --secondary --min-bin-perc 0.01 --bin-step 0.99 --max-regions 20 --mapq -1 --spliced --chain-min-cov 40
(which according to the help is equivalent), no reads align ...
from graphmap2.
Hi @ydLiu-HIT ,
which reference were you using? Could you send the link or share it?
Thanks
from graphmap2.
Hi @jmaricb
since I got a very similar problem, maybe you could use my case instead:
I used the reads from SRR5989373 together with the ensembl 94 release:
ftp://ftp.ensembl.org/pub/release-94/gtf/saccharomyces_cerevisiae/Saccharomyces_cerevisiae.R64-1-1.94.gtf.gz
ftp://ftp.ensembl.org/pub/release-94/fasta/saccharomyces_cerevisiae/dna/Saccharomyces_cerevisiae.R64-1-1.dna_sm.toplevel.fa.gz
Thanks for looking into this!
from graphmap2.
Hi @ydLiu-HIT ,
which reference were you using? Could you send the link or share it?
Thanks
Ensembl genome with version 92 of GRCm38. link: ftp://ftp.ensembl.org/pub/release-92/fasta/mus_musculus/dna/Mus_musculus.GRCm38.dna.primary_assembly.fa.gz
from graphmap2.
@mjoppich @ydLiu-HIT I have located this segmentation fault. It was happening in the ksw2 aligner which would crash for very big references or very big queries. I have added this exception and tried it with this dataset that you linked: https://www.ncbi.nlm.nih.gov/sra/?term=SRR6238555
and it didn't crash.
Can you try with the newest commit and let me know if it still crashes.
from graphmap2.
Hi jmaricb:
I just re-run GraphMap2(v0.6.3) with the same read and reference as before, but it still gets as segmentation fault as follow:
[11:54:03 BuildIndexes] Loading reference sequences.
[11:55:35 SetupIndex_] Building the index for shape: '11110111101111'.
[11:55:55 Create] Allocated memory for a list of 1362768835 seeds (128 bits each) (0.00003 sec, diff: 19.92923 sec).
[11:55:55 Create] Memory consumption: [currentRSS = 7801 MB, peakRSS = 7889 MB]
[11:55:55 Create] Collecting seeds.
[11:55:55 Create] Minimizer seeds will be used. Minimizer window is 5.
[12:03:44 Create] [currentRSS = 37193 MB, peakRSS = 49390 MB] Sequence: 44/44, len: 91744698, name: 'chrY'''
[12:03:50 Create] Final memory allocation after collecting seeds: [currentRSS = 37694 MB, peakRSS = 49390 MB]
[12:03:50 Create] Sorting the seeds using 24 threads.
[12:06:33 Create] Generating the hash table.
[12:07:01 Create] Calculating the distribution statistics for key counts.
[12:07:02 Create] Index statistics: average key count = 132.646856, max key count = 3457358.000000, std dev = 1632.888478, percentil (99.00%) (count cutoff) = 1181.000000
[12:07:31 Create] Memory consumption: [currentRSS = 38466 MB, peakRSS = 49390 MB]
[12:07:31 SetupIndex_] Finished building index.
[12:07:31 SetupIndex_] Storing the index to file: '/data/ydliu/Reference/mouse_GRCm38.fa.gmidx'.
[12:13:57 Index] Memory consumption: [currentRSS = 35868 MB, peakRSS = 49390 MB]
[12:13:57 Run] Hits will be thresholded at the percentil value (percentil: 99.000000%, frequency: 1181).
[12:13:57 Run] Minimizers will be used. Minimizer window length: 5
[12:13:57 Run] Reference genome is assumed to be linear.
[12:13:57 Run] One or more similarly good alignments will be output per mapped read. Will be marked secondary.
[12:13:57 ProcessReads] All reads will be loaded in memory.
[12:14:47 ProcessReads] All reads loaded in 49.92 sec (size around 3144 MB). (3213849871 bases)
[12:14:47 ProcessReads] Memory consumption: [currentRSS = 39749 MB, peakRSS = 49390 MB]
[1]+ Segmentation fault (core dumped) ~/software/graphmap2/bin/Linux-x64/graphmap2 align -x rnaseq -r /data/ydliu/Reference/mouse_GRCm38.fa -d /data2/ydliu/ONT_reads/SMRT/mouse/SRR6238555.fasta -o mouse_graphmap2.sam -t 24
from graphmap2.
The reference I was using: https://drive.google.com/file/d/1OwfUcsJ8iqvuKOR0UJpb1PYRYCl1pLbN/view?usp=sharing
from graphmap2.
I am looking into this right now, again. As I can see from your comment the tool crashed right after loading reads into the memory, before aligning single read, right? Right not that doesn't happen for me. It aligns reads slowly, but it hasn't crash yet. I will try to see what happens and will let you know.
from graphmap2.
Related Issues (20)
- No reads are aligned? HOT 14
- Segmentation Fault on Yeast Transcriptome HOT 9
- clang error HOT 7
- Calling help return exit-code 1
- Sam file not create it killed HOT 1
- Error building with g++ 9.3
- fail to install the graphmap2 v0.6.4 HOT 6
- Graphmap2 index in shared memory
- make installation error
- Segmentation fault
- Header not contain PG line and duplicate SQ line
- Build fails on Linux ARM64 HOT 2
- `sse` should be used only on x86_64 HOT 1
- Alignment failed
- IGV not finding sequence from bam file generated from a graphmap2 sam file
- Memory up to 140GB!!!
- Please tag a release HOT 4
- Graphmap2 can't align dRNAseq reads with U HOT 1
- Counting alignment operations error HOT 6
- Compile error? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from graphmap2.