Git Product home page Git Product logo

duartemolha / convading_reload Goto Github PK

View Code? Open in Web Editor NEW

This project forked from molgenis/convading

1.0 1.0 3.0 7.19 MB

Fork of the CoNVaDING software - Copy Number Variation Detection In NGS Gene panels was designed for small (single-exon) copy number variation (CNV) detection in high coverage NGS data

License: GNU Lesser General Public License v3.0

Perl 30.24% Shell 0.25% HTML 69.51%

convading_reload's People

Stargazers

 avatar

Watchers

 avatar  avatar

convading_reload's Issues

Get off_target complement of input bed tile

as a first step in the detection of copy number on off-target we need to create a complement bed file of input data.

This required the input by the user of the genome assembly or alternatively, we can try and autodetect from the BAM file inputs.

As a possible improvement to this method I will investigate removing from the complement region of homology to the targeted regions (since these will, by definition suffer some amplification bias (on non-amplicon based methods)

Code crashes on Grch38 reference

Code crashes on Grch38 reference available in the GATK bundle. this is due to the fact this contains 'chr' style chromosomes and 'HLA-' type contigs

Ideally the code below should just read the header to an hash and validate each line of input to that hash. and if not the line/linenumber and maybe the chromosomes matched against.

https://github.com/duartemolha/CoNVaDING_reload/blob/master/CoNVaDING.pl#L2845-L2914

Lazy fix:
https://github.com/duartemolha/CoNVaDING_reload/blob/master/CoNVaDING.pl#L2874 change to if ($chr =~ m/^chr.+|^HLA-.+/gs) { and test.

CBS log ratio CNV detection

investigate the possibility of adding CBS log ratio CNV detection using the normalised counts.

I think it would be interesting to add another orthogonal method of CNV detection that can be used to confirm or give additional support to the aberrations detected.

make norm-count files include the description of how those norm counts were calculated

Currently when creating the norm-count files from input bam files, the user can choose to either keep or remove duplicates from the input. For amplicon based methods, we do not want to remove duplicates, however for hybridization-based methods we do.

I think the norm-count files should contain within them the information about how those counts where achieved, especialy if the input was filtered for duplicates or not.

This will in turn enable automatic selection of only controls that where calculated the same way as the sample we want to analyse.

For example, we could have a controls folder with 60 control samples (30 for amplicon based inputs) and 30 of hybridization inputs.

When user would select a new sample to analyse and included -rmdups parameter, the script can then read the controls folder and only select the best controls from the subset of 30 that are compatible.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.