Git Product home page Git Product logo

bambi2b's Introduction

BAMBI2b: Bayesian inference of sequence motifs with 2-block symmetry

Bacterial TF binding sites are usually symmetrical in sequence where the conserved (core) segments (blocks) are separated by a stretch of uninformative nucleotides and may be shifted from both ends (Figure 1). BAMBI-2b can estimate the length and location of such segments to better capture symmetric TF motifs.

Files: The file main.cpp needs to be compiled. This file has two dependencies: database.h and database.cpp, which must be in the same directory with the file main.cpp. To compile the program type “make” in command line within the same directory. The executable “BAMBI2b” will be generated.

The input sequences should be given in fasta format. The example data (sequences.fasta) includes the upstream sequences of a set of E. coli genes which have similar gene expression profiles in certain experiments, i.e., dinB, yafN, dinG, sulA, dinI, umuD, ydjM, yebG, recA, lexA, and recN, respectively.

Usage: ./BAMBI2b -i sequences.fasta -o bambi motifs.txt -P 10000 -LM 19 -UM 21 -LB 1 -UB 5 -LF 0 -UF 3 -r0 1 -r1 300 -s0 1 -s1 1 -s2 1 -d 0.25,0.26,0.25,0.24 -n 2 -sm
-i : (REQUIRED) (Path to) Input file name (in fasta format)
-o : (optional) Output file name (default: bambi motifs.txt)
-P : (optional) The number of samples to estimate the motif PWM from the sequence set. It is recommended to use 50 times the length of the longest input sequence (50 × (max sequence length) )
-LM : (optional) Lower limit for motif length M (default: 16)
-UM : (optional) Upper limit for motif length (default: 26)
-LB : (optional) Lower limit for the length of conserved block B (default: 0)
-UB : (optional) Upper limit for the length of conserved block B (default: UM/2)
-LF : (optional) Lower limit for the length of flanking uninformative sequence (default: 0)
-UF : (optional) Upper limit for the length of flanking uninformative sequence (default: UM - 2)
-r0 : (optional) The prior information regarding the amount of instances of the motif in each sequence. Unless specific knowledge is available, it is recommended to use 1 for r0 and to set r1 to be the average length of the input sequences (default: 1)
-r1 : (optional) (default: average length of input sequences)
-s0 : (optional) Expected number of instances with palindromic symmetry per input sequence (default: 1)
-s1 : (optional) Expected number of instances with inverted-repeat symmetry per input sequence (default: 1)
-s2 : (optional) Expected number of instances with direct-repeat symmetry per input sequence (default: 1)
-d : (optional) Background nucleotide distribution (default: 0.25,0.25,0.25,0.25)
-n : (optional) Number of repeated runs (default: 1)
-sm : (optional) Flag for searching minor (secondary) sites via maximum likelihood estimation (default: false)
-op : (optional) Block similarity metric. Use the option 2 for a faster block symmetry estimation. Option 2 ignores the default cross-entropy calculations between blocks, and only counts the symmetric base-word ratios. (default: 1)

-- For more detail, please see Readme.pdf --

Please cite:
Abdulkadir Elmas, Xiaodong Wang and Michael S. Samoilov
Reconstruction of novel transcription factor regulons through inference of their binding sites
BMC Bioinformatics 2015 16:299
https://doi.org/10.1186/s12859-015-0685-y

bambi2b's People

Contributors

aelmas avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.