Git Product home page Git Product logo

Comments (7)

ericminikel avatar ericminikel commented on July 19, 2024

Interesting... is that field new? I haven't seen it before. At a quick glance it looks like there are >80K values in the file, not sure how many unique variants are actually annotated:

$ gunzip -c ClinVarFullRelease_2016-08.xml.gz | grep ModeOfInheritance | wc -l
   84328

Here are counts of the values in that field:

$ gunzip -c ClinVarFullRelease_2016-08.xml.gz | grep ModeOfInheritance | sed 's/.*\">//' | sed 's/<\/.*//' | sort | uniq -c
51208 Autosomal dominant inheritance
22387 Autosomal recessive inheritance
 951 Autosomal unknown
  49 Codominant
 127 Mitochondrial inheritance
 108 Other
  14 Sex-limited autosomal dominant
1209 Somatic mutation
 317 Sporadic
 947 X-linked dominant inheritance
4542 X-linked inheritance
 997 X-linked recessive inheritance
   5 Xlinked NA
  24 Xlinked recessive
  48 Xlinked unknown
   1 Y-linked inheritance
 158 autosomal dominant
  90 autosomal recessive
1146 autosomal unknown

@bw2 would you by any chance have time to look into this more? If ClinVar is really going to start filling in this field for a large subset of variants then I imagine this is something we'd want to support.

from clinvar.

vigneshravi avatar vigneshravi commented on July 19, 2024

Yes, our group is also interested in getting these fields annotated with the final TSV results.

THank you!

from clinvar.

bw2 avatar bw2 commented on July 19, 2024

Thanks for pointing out ModeOfInheritence. I've added it to the flatfile and vcf as inheritance_modes. The source for it appears to be an optional field in the clinvar submission spreadsheet - currently 19% of variants have it populated.

Also while looking at this, I noticed a couple other fields I hadn't seen before so I added them also:
disease mechanism - this might also be coming from the clinvar submissions spreadsheet (though I couldn't find a field for it).
prevalence and age of onset - clinvar takes these from orphanet.
xrefs - misc. other cross-reference info and ids packed into this column as key:value;key:value;...

I've created a pull request with these changes: #12
Any feedback on the code or data files is appreciated.

from clinvar.

vigneshravi avatar vigneshravi commented on July 19, 2024

Thank you for the additions. This is definitely helpful. Could you also add the origin info to it? It has germline, de novo and somatic as values for it.

from clinvar.

bw2 avatar bw2 commented on July 19, 2024

sure, since the table is changing, I'll add this one also.

from clinvar.

bw2 avatar bw2 commented on July 19, 2024

I've added the 'origin' column to PR #12 , and also put in a new 'gold_stars' column which just maps a variant's review_status to a number between 0 - 4. Any feedback is appreciated.

from clinvar.

vigneshravi avatar vigneshravi commented on July 19, 2024

Thank you again! This is very useful..

from clinvar.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.