Git Product home page Git Product logo

skirt's Introduction

SKIRT

Description

This is a project for a pipeline of high-resolution KIR allele annotations on human genome assemblies.

It uses several libraries including argparse, pandas, re, gzip, and Biopython. The script provides options for outputting in different formats, including BED, VCF, and HAP(CSV).

Library Installation

This project requires Python 3.6+ and the following Python libraries installed:

  • BioPython
  • pandas
  • argparse

You can install these packages using pip:

pip install biopython pandas argparse

To install this project, clone the SKIRT repository to your local machine:

git clone https://github.com/calvinckhung/skirt.git

Export your SKIRT directory

echo "export SKIRT_WD=<Your SKIRT directory>"

Update IPDKIR

We already include the 2.12.0 IPDKIR in the project. In case you want to update from scratch, clone the IPDKIR repository to the SKIRT directory:

cd skirt
git clone https://github.com/ANHIG/IPDKIR.git

This will overwrite the included IPDKIR files. Be careful.

Install Minimap2

For installation of minimap2, refer to its GitHub page https://github.com/lh3/minimap2. Ensure your $PATH variable contains the path to the minimap2 executable binary. Otherwise, export this variable:

echo "export MM2_BIN_PATH=<Your minimap2 binary path>"
echo "export PATH=$PATH:$MM2_BIN_PATH" >> ~/.bashrc

Install BLAST+

Don't forget to install tblastn, part of the BLAST+ suite:

cd ~/bin
wget ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/2.14.0/ncbi-blast-2.14.0+-x64-linux.tar.gz
tar -xzf ncbi-blast-2.14.0+-x64-linux.tar.gz
echo "export BLAST_BIN_PATH=<Path to tblastn executable binary>"
echo "export PATH=$PATH:$BLAST_BIN_PATH" >> ~/.bashrc

For a detailed installation description of BLAST+, please refer to https://www.ncbi.nlm.nih.gov/books/NBK569861/ If you have BLAST+ installed already, don't forget to specify your path as above.

Usage of SKIRT

To run SKIRT KIR allele annotation for your own assembly sequence, run below command under SKIRT directory:

./scripts/miniskirt.sh <path to the assembly sequence (.fa or .fa.gz)> <output path>

You will get *.BED, *.VCF and *.Hap.CSV files. Usually takes between 5~20 minutes.

skirt's People

Contributors

calvinckhung avatar michael-ford avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.