Git Product home page Git Product logo

sambinder's Introduction

SAMbinder

Prediction of SAM binding sites

SAMbinder is a Python-based tool for predicting SAM interacting residue in a protein chain. It comprises of number of machine learning models for example, SVC model, Random Forest model, Artificial Neural Network model, which is implemented using Scikit package. These models are developed using widely used features like (i) Binary Profile of patterns and (ii) Evolutionary Information in the form of PSSM matrix generated using PSI-BLAST. Results are produced in the form of propensity score in between value 0-9. Residues showing the propensity score equal or above the selected threshold are said to be “Interacting” whereas residues showing value lesser than selected threshold are said to be “Non-Interacting”. Prediction model developed using evolutionary information where SVC machine learning technique was implemented performed best in our study.

Reference

Agrawal P, Mishra G, Raghava G P S (2020) SAMbinder: A Web Server for Predicting S-Adenosyl-L-Methionine Binding Residues of a Protein From Its Amino Acid Sequence. Frontiers in Pharmacology 10:1690

Web server

https://webs.iiitd.edu.in/raghava/sambinder/

PIP Installation

PIP version is also available for easy installation and usage of this tool. The following command is required to install the package

pip install sambinder

To know about the available option for the pip package, type the following command:

sambinder -h

Installation

Command for downloading SAMbinder

git clone https://github.com/raghavagps/sambinder

SAMbinder is an open source Python based software, which operates depending on the Python environment (Python version 3.3 or above) and can be run on multi-OS systems (such as Windows, Linux and Mac operating systems). Before running SAMbinder, user should make sure of all the following packages are installed in their Python environment: sys, wget, os, shutil, scipy, numpy(), pandas(), sklearn version 0.19.1, math and re. For convenience, we strongly recommend users to install the Anaconda Python 3.3 (or above) in their local machine. The software can be freely downloadable from https://www.anaconda.com/download/ . User also needs to download the model directories and blastpr floder in order to run the prediction. Please run the commands given below to download the model folders and blastpr folder

COMMANDS

wget -c http://webs.iiitd.edu.in/gpsr2/blastpr.zip 
wget -c http://webs.iiitd.edu.in/gpsr2/sam_models.zip  

After downloading these folders, untar these folders in the present directory using the command

unzip blastpr.zip
unzip sam_models.zip

For users who want to do prediction by using our SAMbinder package

cd to the “SAMbinder” folder which contains SAMbinder.py. This is the main code which is used to run machine learning and do prediction of SAM interacting residue in a given target protein. It allows users to do prediction using 5 different machine learning models. For more information, run:

python3 sambinder.py -h

Examples for users to do SAM interacting residue prediction.

The input protein sequence for SAMbinder.py should be in fasta format. Please find the example in example folder. The following parameters are required by SAMbinder.py

COMMAND

python3 sambinder.py -i <input_file> -o <output_file> -m <method> -t <threshold>

where,

  • <input_file>: Input file having sequence file in FASTA format
  • <output_file>: Output file generated by SAMbinder having prediction result
  • <threshold>: User defined threshold score (between 0-9)
  • <method>: Machine Learning method and the type of input feature it used

The value of method can be between 1-6 with each numeral representing the following prediction methods:

  1. Binary SVC
  2. Binary Random Forest
  3. Binary MLP
  4. Binary KNN
  5. PSSM SVC
  6. PSSM Random Forest

For more information type the following command

python3 sambinder.py –h

In our package, we have provided 6 different machine learning models which utilizes different features.

  • Method '1' is Support Vector Classifier which utilizes binary profile of the pattern as an input feature.
  • Method '2' is Random Forest Classifier which also utilizes binary profile of the pattern as an input feature.
  • Method '3' is Artificial Neural Network model developed using binary profile of the pattern as input feature.
  • Method '4' is K Nearest Neighbor method developed using binary profile of the pattern as input feature.
  • Method '5' is Support Vector Classifier which utilizes evolutionary information in the form of PSSM profile as an input feature.
  • Method '6' is Random Forest classifier which also utilizes evolutionary information in the form of PSSM profile as an input feature. The PSSM profile is generated using PSI-BLAST by running against the SwissProt database.

sambinder's People

Contributors

mishragauravgm avatar raghavagps avatar

Stargazers

 avatar

Watchers

 avatar  avatar

sambinder's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.