Git Product home page Git Product logo

woof's Introduction

woof's People

Contributors

pdiakumis avatar

Watchers

 avatar  avatar

woof's Issues

Read input as json

Instead of a headerless tsv, create a json with the ability to use key-values.

Improve comparison interface

Examples:

DNA

bcbio

woof compare 
  --mode bcbio-dna
  --bc1 /path/to/run1/sampleA/final
  --bc2 /path/to/run2/sampleA/final 

umccrise

woof compare
  --mode umccrise
  --um1 /path/to/run1/sampleA/umccrised
  --um2 /path/to/run2/sampleA/umccrised

RNA:

woof compare 
  --mode bcbio-rna
  --bc1 /path/to/run1/sampleA/final
  --bc2 /path/to/run2/sampleA/final 

Questions

  • How to detect genome assembly (hg37 vs. hg38)

Might be able to get that info from the bcbio config

Copy recursively when dir exists

Fix recursive copy function. Getting the below when re-starting and the WDL files already exist:

Directory not copied. Error: [Errno 17] File exists : woof/work/wdl'

Explore MuTect2 FPs compared to other callers

From Oliver (https://umccr.slack.com/archives/C025TLC7D/p1553825820038500):

Diff bcbio 1.1.3 / 1.1.4. Not surprised we're seeing those changes. A fair number of those 'false positives' (new MuTect2 calls) might actually now just be calls that VarDict/Strelka2 already had.
Which made me think of the easiest evaluation metric: how many of the 'new' (potential FP) calls for a caller have support from the other callers? And respectively, how many of the false negative calls (lost in the new version) are unique to that caller?

Read Cromwell + AWS Batch docs

Automate Manta SV isec

Currently all happens withing Rmd report.
Workflow would go like:

  • Manta1 + Manta2
  • isec
  • Eval Table, Circos with FP/FN

Normalise VCFs

From https://github.com/vladsaveliev/vcf_stuff/blob/master/README.md

  1. Split multi-allelic variants into single sample records.

For instance, split one record

#CHROM  POS     ID      REF     ALT
1       10       .      A       T,C

Into 2 separate records

#CHROM  POS     ID      REF     ALT
1       10       .      A       T
1       10       .      A       C

For that, we are using vt tools:

vt decompose -s vcf_file
  1. Decompose biallelic block substitutions.

For instance, split the following one records:

#CHROM  POS     ID      REF     ALT
1       20      .       AG      CT       

into 2 separate ones:

#CHROM  POS     ID      REF     ALT
1       20       .      A       C
1       20       .      G       T

We are using for that vcflib's vcfallelicprimitives:

vcfallelicprimitives -t DECOMPOSED --keep-geno vcf_file
  1. Left-align and normalize indels, check if REF alleles match the reference.

For instance, given that the reference chromosome 1 starts with GCTCCG, split the following records

#CHROM  POS     ID      REF     ALT
1       2       .       CTCC    CCC,C,CCCC

into the following 3:

#CHROM  POS     ID      REF     ALT
1       1       .       GCTC    G
1       2       .       CT      C
1       3       .       T       C

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.