Comments (7)
Hi. What version are you running? I think there was a bug in v1.3.4 that prevented merging of translocations. Hopefully v1.3.5 should fix things
from dysgu.
Closing this for now as I think it should be fixed
from dysgu.
Hello, as of version 1.3.9 I do not believe the merging is fixed for translocations. For example, I get the following lines (first 8 columns) in my merged vcf file:
2L 54217 467217 G . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54218;CHR2=2R;CHR2_POS=716508;GRP=11060;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=4;WR=0;PE=4;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.129;MaxPROB=0.129
2L 54231 567511 T . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54232;CHR2=2R;CHR2_POS=716509;GRP=199545;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=5;WR=0;PE=5;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.125;MaxPROB=0.125
2L 54240 978912 T . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54241;CHR2=2R;CHR2_POS=716509;GRP=74302;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=4;WR=0;PE=4;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.135;MaxPROB=0.135
2L 54240 365308 T . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54241;CHR2=2R;CHR2_POS=716522;GRP=66880;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=3;WR=0;PE=3;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.12;MaxPROB=0.12
2L 54244 735714 C . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54245;CHR2=2R;CHR2_POS=716515;GRP=154395;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=21;WR=0;PE=21;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.185;MaxPROB=0.185
2L 54255 1066664 A . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54256;CHR2=2R;CHR2_POS=716515;GRP=75082;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=11;WR=0;PE=11;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.134;MaxPROB=0.134
2L 54279 838209 T . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54280;CHR2=2R;CHR2_POS=716508;GRP=220145;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=5;WR=0;PE=5;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.101;MaxPROB=0.101
2L 54365 707335 A . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54366;CHR2=2R;CHR2_POS=716512;GRP=137288;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=6;WR=0;PE=6;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.133;MaxPROB=0.133
2L 54791 19056 C . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54792;CHR2=2R;CHR2_POS=716519;GRP=97622;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=7;WR=0;PE=7;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.153;MaxPROB=0.153
2L 54830 86198 A . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54831;CHR2=2R;CHR2_POS=716552;GRP=90866;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=4;WR=0;PE=4;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.084;MaxPROB=0.084
2L 54839 37783 A . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54840;CHR2=2R;CHR2_POS=716518;GRP=38716;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=6;WR=0;PE=6;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.136;MaxPROB=0.136
2L 54864 145869 G . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54865;CHR2=2R;CHR2_POS=716522;GRP=114336;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=4;WR=0;PE=4;SR=0;SC=0;BND=0;LPREC=1;RT=pe;MeanPROB=0.11;MaxPROB=0.11
I believe they should all be merged as a single translocation. I ran dysgu merge with "--merge-dist 1000". Thanks for any help.
from dysgu.
Hi @tbenavi1,
Sorry I thought I had fixed this issue. I will take a look now
from dysgu.
Ive uploaded a patch which I think resolves the issue, however you will have to build from source until the patch is released to pypi for pip install dysgu
.
Also I think the option --merge-within True
might help when using the merge command - this performs an extra round of merging at the sample level before merging between samples. It can help increase the degree of merging
I managed to recreate the problem from the data you sent (the dummy vcf file is attached as a zip):
Running dysgu merge --merge-within True issue22.vcf issue22.vcf
resulted in two vcf records in the output:
2L 54217 2 G <TRA> . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54218;CHR2=2R;CHR2_POS=716508;GRP=11060;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=8;WR=0;PE=0;SR=0;SC=8;BND=8;LPREC=1;RT=pe;MeanPROB=0.657;MaxPROB=0.657 GT:GQ:MAPQP:SU:WR:PE:SR:SC:BND:COV:NEIGH10:PS:MS:RMS:RED:BCC:FCC:ICN:OCN:PROB 1/1:14:42.75:4:0:0:0:4:4:7.84:5:1:3:83:11:9:0.086:2.92:0.25:0.657 1/1:14:42.75:4:0:0:0:4:4:7.84:5:1:3:83:11:9:0.086:2.92:0.25:0.657 2L 54791 3 C <TRA> . lowProb SVMETHOD=DYSGUv1.3.9;SVTYPE=TRA;END=54792;CHR2=2R;CHR2_POS=716519;GRP=97622;NGRP=1;CT=3to5;CIPOS95=0;CIEND95=0;GC=0.0;NEXP=0;STRIDE=0;EXPSEQ=;RPOLY=0;OL=0;SU=8;WR=0;PE=0;SR=0;SC=8;BND=8;LPREC=1;RT=pe;MeanPROB=0.657;MaxPROB=0.657 GT:GQ:MAPQP:SU:WR:PE:SR:SC:BND:COV:NEIGH10:PS:MS:RMS:RED:BCC:FCC:ICN:OCN:PROB 1/1:14:42.75:4:0:0:0:4:4:7.84:5:1:3:83:11:9:0.086:2.92:0.25:0.657 1/1:14:42.75:4:0:0:0:4:4:7.84:5:1:3:83:11:9:0.086:2.92:0.25:0.657
issue22.vcf.zip
from dysgu.
Thank you so much! I'll wait to double check once the patch is released to pypi. Feel free to close the issue for now. I can reopen if I still find an issue.
from dysgu.
v1.3.10 is now on pypi, hopefully that should resolve the issue. Closing this for now
from dysgu.
Related Issues (20)
- Error with --search option HOT 11
- Generating Alternative Reference HOT 16
- Run OSError: [Errno 24] Too many open files Mac OS M HOT 4
- OverflowError: can't convert negative value to size_t HOT 2
- Dysgu filter IndexError: string index out of range HOT 6
- long reads default mapq lowered to 1: help text for dysgu call still says pacbio and nanopore mode has --mq 20 HOT 1
- When will docker image with new release be available? HOT 1
- Got an warning when Loading Model in "dysgu run" HOT 1
- problems genotyping, dysgu run --sites HOT 3
- clarification needed on RG and samples HOT 4
- Getting SV length in dysgu output vcf HOT 3
- _pickle.UnpicklingError: invalid load key, 'A'. Failed to read from standard input: unknown file type HOT 2
- Subject: Inquiry on Benchmarking DEL and INS Events with dysgu Pipelines. HOT 35
- TypeError: an integer is required when using --sites option and manta.vcf HOT 6
- When combining a large number of samples, the speed is very slow HOT 13
- When merging a large number of samples, the process is very slow
- Long run time HOT 13
- Parameters for R9 Guppy2, 4, 6 HOT 4
- Process_KILLED HOT 7
- Paired-end reads calling sv HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dysgu.