Git Product home page Git Product logo

Comments (8)

ericminikel avatar ericminikel commented on August 19, 2024 1

@neilfws they're in there as separate lines, i.e. the dataset is non-unique on chrom, pos, ref:

$ cat output/clinvar.tsv | cut -f1,2,3 | uniq -c | sort -nr | head
  15 3  37067119    T
   8 20 62046495    C
   4 X  153296529   G
   4 3  37089131    A
   4 3  37056036    G
   4 2  47702410    G
   4 2  189860905   G
   4 17 41276033    C
   4 17 41244936    G
   4 17 41219627    G

from clinvar.

dakl avatar dakl commented on August 19, 2024

I've now made a first version of the converter available at dakl/macarthur-clinvar-to-vcf. Feel free to check it out. It currently ignores the mut column form the TSV altogether until I know what to do with it.

thanks

from clinvar.

ericminikel avatar ericminikel commented on August 19, 2024

mut tells you which allele, the reference or alternate, is the one to which the annotations (e.g. pathogenic assertion) refer.

I guess we should add a data dictionary!

from clinvar.

neilfws avatar neilfws commented on August 19, 2024

What about cases where a variant has two or more alternate alleles? These do not seem to be represented in this TSV file.

from clinvar.

hujingchu avatar hujingchu commented on August 19, 2024

@ericminikel
hi, how can I find the illustration of determining the 'mut' ? It seems Clinvar has not provide such information?
thanks

from clinvar.

ericminikel avatar ericminikel commented on August 19, 2024

@hujingchu mut was originally (still is?) in the ClinVar XML file, parsed here https://github.com/macarthur-lab/clinvar/blob/master/src/parse_clinvar_xml.py#L62 though I hear the XML format has changed and I haven't kept up so I can't speak to whether it is still in there. ask @bw2 -- he's taken over development of this repo.

from clinvar.

hujingchu avatar hujingchu commented on August 19, 2024

@ericminikel thank you, I saw the annotations, the method is a little bit strange.
Looking forward @bw2 's advice.

from clinvar.

bw2 avatar bw2 commented on August 19, 2024

@hujingchu @ericminikel
After spending some time looking into this, I think the mut column is outdated.
The current spreadsheet for submitting variants to clinvar doesn't have instructions for submitting assertions about the REF allele, so I'll remove the mut column from future releases.

from clinvar.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.