Git Product home page Git Product logo

gatk4-basic-joint-genotyping's Introduction

gatk4-basic-joint-genotyping

Basic joint genotyping with GATK4. NOT Best Practices, only for teaching/demo purposes.

Inputs and outputs

Required inputs

  • One or more per-sample GVCF files (.g.vcf), provided as an array
  • Genomic resources: reference genome in FASTA format (.fasta) and its accessory files (.fasta.fai and .dict)
  • List of intervals to process in GATK intervals list format (.list)

Optional inputs

  • Resourcing and environment parameters including memory, disk space and container are all customaizable

Outputs

  • A multi-sample VCF of variants joint-called across the cohort, block-gzipped (.gz) with tabix index (.gz.tbi)

Overview of the pipeline

This workflow consists of four steps:

RenameAndIndexFile

Ensures that the input GVCF files have the appropriate file extensions (.g.vcf.gz) and creates an index file (.tbi).

  • Per file, scattered by input file
  • Expects an input GVCF
  • Outputs a copy of the GVCF (renamed if it did not have the right extension) and its index file.

ImportGVCFs

Imports data from GVCF into a GenomicsDB datastore

  • Across all inputs, scattered by genome interval
  • Expects an array of input GVCFs
  • Outputs a tarred GenomicsDB datastore

GenotypeGVCFs

Applies joint genotyping to all samples present in the datastore

  • Across all inputs, scattered by genome interval
  • Expects a tarred GenomicsDB datastore
  • Outputs a VCF file with variant calls made across the cohort

MergeVCFs

Merges VCF files across intervals generated by the scatter above

  • Across genomic intervals
  • Expects an array of per-interval VCFs
  • Outputs the final cohort VCF

gatk4-basic-joint-genotyping's People

Contributors

jin0008 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.