Requires PERL (v.5.8 or higher), and a fortran90 compiler (prefered: ifort) and six additional packages to be pre-installed
- sc from ccp4 (http://www.ccp4.ac.uk/) [executable_name: sc]
- delphi v.6.2 (http://compbio.clemson.edu/delphi) [executable_name: delphi95]
- EDTSurf (http://zhanglab.ccmb.med.umich.edu/EDTSurf/) [executable_name: EDTSurf]
- Rosetta (https://www.rosettacommons.org/software/)
- ProQ2 (https://github.com/bjornwallner/ProQ_scripts/)
- needle from EMBOS (http://emboss.sourceforge.net/download/)
$ git clone https://github.com/bjornwallner/ProQDock
$ cd ProQDock
$ chmod +x compileF
$ ./compileF <fortran90-compiler> (Default: ifort)
open ProQDock
in any text editor and change the fullpaths for the following fields according to your system:
- sc_path=/software/apps/ccp4/ccp4-6.5.0/ccp4-6.5/bin
- delphi_path=/home/x_sabas/bin
- rosetta_path=/proj/wallner/users/x_bjowa/github/Rosetta/main/source/bin
- rosetta_db=/proj/wallner/users/x_bjowa/github/Rosetta/main/database
- svm_path=/home/x_sabas/proj/svm_light
- ESpath=/home/x_sabas/proj/EDTSurf
- proqpath=/home/x_sabas/proj/BACKUP/ProQ_scripts/bin
1. The coordinate (PDB) file for the model
2. The full-length target sequence
- The model must be a perfect or imperfect subset of the target sequence
- i.e., Any and all sub-sequence present in the model must be present in the tareget
- PDB file MUST contain corrdinates of geometrically fixed Hydrogen atoms
- preferably fixed by REDUCE v.2 or atleast compatible with the REDUCE format (http://kinemage.biochem.duke.edu/downloads/software/reduce/)
The full length (FASTA) target sequence (one letter amino acid code followed by a header starting with >) should be prepared and provided alongside the coordinate (PDB) file for the model.
You can generate the fasta sequence by using:
$ ProQDock/EXEC/pdb2fasta.pl <input.pdb> > input.pdb.fasta
$ ProQDock/ProQDock <model.PDB> <fulllength.fasta>
where,
-
model.pdb: The input pdb (coordinate file in Brrokheaven format; http://www.ccp4.ac.uk/html/procheck_man/manappb.html) file
-
fulllength.fasta: The fulllength fasta sequence of the target
-
optional arguments: -d (0/1) : delphi_mode:
0: Single_internal_dielectric (epsilon=2) 1: Multidielctric mode (internal dielectric modeled as a Gaussian Function)
If you intend to Run ProQDock for a list of PDB files (models) corresponding to the same target sequence, then, SKIP repeating the profile generation step more than once, by providing a [consistent basename] for the fasta file.
Launch the following bash script which runs on a set of (example) models kept at ./EXAMPLE_CASE_STUDY/
$ ./runex.bash
EXAMPLE OUTPUT:
$ cat EXAMPLE_CASE_STUDY/D1OYV-models.status.results
Model fnat LRMS iRMS CAPRI_class DockQ ProQDock
D1OYV-a102a 0.000 35.348 15.846 Incorrect 0.02118 0.05908
D1OYV-a104c 0.000 44.703 16.626 Incorrect 0.01432 0.21024
D1OYV-a106c 0.000 44.600 18.136 Incorrect 0.01395 0.03890
D1OYV-a115c 0.000 31.946 11.941 Incorrect 0.02722 0.02730
D1OYV-a61c 0.652 6.820 2.334 Acceptable 0.51761 0.51060
D1OYV-a61d 0.630 6.574 2.450 Acceptable 0.50960 0.46046
D1OYV-a81d 0.587 6.788 2.481 Acceptable 0.48841 0.42648
D1OYV-a41b 0.717 3.619 1.428 Medium 0.69617 0.48751
D1OYV-a62c 0.707 3.991 1.628 Medium 0.66168 0.55795
D1OYV-a82d 0.707 4.347 1.593 Medium 0.65639 0.48664
Example Output for a single PDB file:
$ ./ProQDock ./TESTPDBS/1u07.pdb ./TEMP/1u07.pdb.fasta
===========================================================================================
Your Input: 1u07.pdb
The file has a Proper extension (.pdb / .PDB)
Normal Delphi (single internal dielectric, epsilon = 2.0) will be executed
===========================================================================================
Total number of residues in the Complex: 178
EC=0.11
Sc=0.767
rGb=0.07170
Ld= 0.14815
nBSA=0.11534
Fintres=0.28652
CPscore= 0.48131
CPM= 0.89454
rTs=0.08767
Isc=0.00002
Erep=0.27086
Etmr=0.21257
ProQ= 0.845
======================== The ProQDock Score ====================
0.85573
================================================================
Feature Range Favorable Directions Description Feature_Type Stability** Native_values (DB3)
EC: [-1,1] Higher the Better Electrostatic balance at the interface Interface Variable 0.20 (+/-0.38)
Sc: [-1,1] Higher the Better Geometric / Steric fit at the interface Interface Stable 0.68 (+/-0.17)
rGb: [-1,1] Higher the Better Distribution of hydrophobicity with respect to burial All_Atom Stable 0.06 (+/-0.02)
Ld: [0, 1] Ambiguous Link (packing) densely at the interface Interface Stable 0.15 (+/-0.17)
nBSA: [0, 1] Ambiguous Size of the interface relative to the whole complex Interface Variable 0.07 (+/-0.05)
Fintres: [0, 1] Ambiguous Size of the interface relative to the whole complex Interface Variable 0.19 (+/-0.13)
CPscore: [0, 1] Higher the Better Preference of different inter-residue contacts at the interface Interface Variable 0.35 (+/-0.17)
CPM: [0, 1] Higher the Better Likelyhood of (Sc,EC) as a function of nBSA compared to native Interface Variable 0.80 (+/-0.18)
rTs: [0, 1] Lower the Better Energetic Stability of the whole molecule All_Atom Stable 0.22 (+/-0.21)
Isc: [0, 1] Lower the Better Binding energy Interface Variable 0.20 (+/-0.33)
Erep: [0, 1] Lower the Better Unfavourable Energy due to atomic Clashes All_Atom Stable 0.41 (+/-0.15)
Etmr: [0, 1] Lower the Better Energetic Stability without condiderating clashes All_Atom Stable 0.28 (+/-0.16)
ProQ: [0, 1] Higher the Better Overall Structural Quality All_Atom Variable 0.80 (+/-0.12)
** In Correct Models
Finding correct protein–protein docking models using ProQDock
Sankar Basu and Björn Wallner*
Bioinformatics (2016) 32 (12): i262-i270.
doi: 10.1093/bioinformatics/btw257
The article is avialable online here: http://bioinformatics.oxfordjournals.org/content/32/12/i262.abstract