Git Product home page Git Product logo

bjornwallner / proqdock Goto Github PK

View Code? Open in Web Editor NEW
9.0 3.0 5.0 29.19 MB

ProQDock is a Protein-Protein docking scoring program that uses high level coordinate driven features compiled and trained by a Support Vector Regression algorithm. The current version has 13 features: both all-atom and interface features. ProQDock should also serve to develop hybrid methods in future attempts to solve this 'hard' problem.

Perl 5.44% Fortran 12.16% Shell 2.80% C++ 68.60% C 1.35% Makefile 0.08% Python 9.57%

proqdock's Introduction

ProQDock

Requires Python 3.x, absl-py and numpy

  1. sc from ccp4 (http://www.ccp4.ac.uk/) [executable_name: sc]
  2. delphi v.6.2 (http://compbio.clemson.edu/delphi) [executable_name: delphi95]
  3. Rosetta (https://www.rosettacommons.org/software/)
  4. ProQ2 (https://github.com/bjornwallner/ProQ_scripts/)
  5. needle from EMBOS (http://emboss.sourceforge.net/download/)

Installation

$ git clone https://github.com/bjornwallner/ProQDock
$ cd ProQDock
$ ./compileF # Assumes you have gfortran installed

edit the flagfile and change the fullpaths for the following fields according to your system and save it as myflagfile:

--sc_path=/software/presto/software/CCP4/7.1.016-foss-2019b-SHELX-ARP-8.0-1/ccp4-7.1/bin/
--delphi_path=/proj/wallner/users/x_bjowa/local/DelPhi_Linux_SP_F95/
--rosetta=/proj/wallner/apps/rosetta/Rosetta/main/
--svm_path=/proj/wallner/users/x_bjowa/local/svm_light_linux64/
--proqpath=/proj/wallner/users/x_bjowa/github/ProQ_scripts/bin/

Running:

    1. The coordinate (PDB) file for the model
    2. The full-length target sequence (in FASTA format) 
  • The model must be either the full-length sequence or subset of the target sequence
  • i.e., Any and all sub-sequence present in the model must be present in the tareget
  • PDB file MUST contain corrdinates of geometrically fixed Hydrogen atoms
  • preferably fixed by REDUCE v.2 or atleast compatible with the REDUCE format (http://kinemage.biochem.duke.edu/downloads/software/reduce/)

Preparatory Step:

The full length (FASTA) target sequence (one letter amino acid code followed by a header starting with >) should be prepared and provided alongside the coordinate (PDB) file for the model.

You can generate the fasta sequence by using:

$ EXEC/pdb2fasta.pl <input.pdb>  >  input.pdb.fasta

Run Step:

$ ./run_ProQDock.py <model.PDB> <full-length.fasta> --flagfile myflagfile

where,

If you intend to Run ProQDock for a list of PDB files (models) corresponding to the same target sequence, then, SKIP repeating the profile generation step more than once, by providing a [consistent basename] for the fasta file.

Experimental --AF flag will use the average plddt from AlphaFold model instead of ProQ2.

$ ./run_ProQDock.py <model.PDB> --AF --flagfile myflagfile

The following bash script which runs on a set of (example) models kept at ./EXAMPLE_CASE_STUDY/

$ ./runex.bash

Example Output for a single PDB file:

$ ./run_ProQDock.py ./TESTPDBS/1u07.pdb ./TEMP/1u07.pdb.fasta --flagfile flagfile
                                                  ....                   ....                                      ..      
                                               .n~8888888nx           .xH888888Hx.                            < .z@8"`      
     .d``            .u    .          u.     :88>'8888888888:       .H8888888888888:           u.              !@88E        
     @8Ne.   .u    .d88B :@8c   ...ue888b   :8888 "*888888888k      888*"""?""*88888X    ...ue888b        .    '888E   u    
     %8888:u@88N  ="8888f8888r  888R Y888r  '88888.         "8>    'f     d8x.   ^%88k   888R Y888r  .udR88N    888E u@8NL  
      `888I  888.   4888>'88"   888R I888>   ?88888          'X    '>    <88888X   '?8   888R I888> <888'888k   888E`"88*"  
       888I  888I   4888> '     888R I888> ?  %888!           !     `:..:`888888>    8>  888R I888> 9888 'Y"    888E .dN.   
       888I  888I   4888>       888R I888>  ".:88"            !            `"*88     X   888R I888> 9888        888E~8888   
     uW888L  888'  .d888L .+   u8888cJ888     xHH8Hx.        .X  :    .xHHhx.."      !  u8888cJ888  9888        888E '888&  
    '*88888Nu88P   ^"8888*"     "*888*P"    :888888888hx....x\8..X   X88888888hx. ..!    "*888*P"   ?8888u../   888E  9888. 
    ~ '88888F`        "Y"         'Y"      :~  `"8888888888!`'8888  !   "*888888888"       'Y"       "8888P'  '"888*" 4888" 
       888 ^                                      `""*8*""`   "*"          ^"***"`                     "P'       ""    ""   
       *8E                                                                                                                  
       '8>                                                                                                                  
        "
=================================================================================================================================
Feature   Range  Direction  Description                                            Feature_Type   Stability   Native_values (DB3)
---------------------------------------------------------------------------------------------------------------------------------
EC:       [-1,1] Positive   Electrostatic balance at the interface                 Interface      Variable    0.20 (+/-0.38)
Sc:       [-1,1] Positive   Geometric / Steric fit at the interface                Interface       Stable     0.68 (+/-0.17)
rGb:      [-1,1] Positive   Distribution of hydrophobicity with respect to burial  All_Atom        Stable     0.06 (+/-0.02)
Ld:       [0, 1] Ambiguous  Link (packing) densely at the interface                Interface       Stable     0.15 (+/-0.17)
nBSA:     [0, 1] Ambiguous  Size of the interface relative to the whole complex    Interface      Variable    0.07 (+/-0.05)
Fintres:  [0, 1] Ambiguous  Size of the interface relative to the whole complex    Interface      Variable    0.19 (+/-0.13)
CPscore:  [0, 1] Positive   inter-residue contacts preference                      Interface      Variable    0.35 (+/-0.17)
CPM:      [0, 1] Positive   Likelihood of (Sc,EC) given nBSA compared to native    Interface      Variable    0.80 (+/-0.18)
rTs:      [0, 1] Negative   Energetic Stability of the whole molecule              All_Atom        Stable     0.22 (+/-0.21)
Isc:      [0, 1] Negative   Binding energy                                         Interface      Variable    0.20 (+/-0.33)
Erep:     [0, 1] Negative   Unfavourable Energy due to atomic Clashes              All_Atom        Stable     0.41 (+/-0.15)
Etmr:     [0, 1] Negative   Energetic Stability without condiderating clashes      All_Atom        Stable     0.28 (+/-0.16)
ProQ2:    [0, 1] Positive   Overall Structural Quality                             All_Atom       Variable    0.80 (+/-0.12)
---------------------------------------------------------------------------------------------------------------------------------
ProQDock  [0, 1] Positive   The predicted DockQ score, Basu and Wallner, 2016, https://doi.org/10.1371/journal.pone.0161879
   DockQ Statistics on CAPRI data:  
    0    <  DockQ <  0.23 - Incorrect
    0.23 <= DockQ <  0.49 - Acceptable quality
    0.49 <= DockQ <  0.80 - Medium quality
            DockQ >= 0.80 - High quality
=================================================================================================================================
EC=0.259
Sc=0.495
rGb=0.028
Ld=0.102
nBSA=0.054
Fintres=0.193
CPscore=0.161
CPM=0.422
Isc=0.923
rTs=0.898
Eatr=0.085
Erep=0.549
Etmr=0.873
ProQ2=0.794
==========================
ProQDock=0.266

Reference

  Finding correct protein–protein docking models using ProQDock
  Sankar Basu and Björn Wallner* 
  Bioinformatics (2016) 32 (12): i262-i270. 
  doi: 10.1093/bioinformatics/btw257

The article is avialable online here: http://bioinformatics.oxfordjournals.org/content/32/12/i262.abstract

proqdock's People

Contributors

bjornwallner avatar nemo8130 avatar

Stargazers

 avatar Kevin D. Oliphant avatar  avatar Akshay avatar xvazquezc avatar Kristian Want avatar Arita Zhang avatar  avatar Alexander Goncearenco avatar

Watchers

James Cloos avatar  avatar  avatar

proqdock's Issues

error!!

File "run_ProQDock.py", line 185
logging.info(f"Found these chains: {sorted(pdb_data['pdb_chains'].keys())}")
^
SyntaxError: invalid syntax

How do I resolve this error

needle from the EMBOSS package needs to be installed

I have installed EMBOSS but "needle from the EMBOSS package needs to be installed" is still displayed when I run example file.
Furthermore I export EMBOSS's 'bin' in the area, it still not work.
How can I fix it.
Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.