motif-finder's Introduction

Motif-finder

Bioinformatic tool for finding cliques in biological sequences.

This program loads FASTQ format sequences and uses a heuristic algorithm to find the best sequence motif.

How it works

First split your FASTQ sequences into separate files with the nucletide sequence in one file and quality in the other. Example files are available in the repository.

The program first reads the two files:

Then, it prompts you to enter the values of the k-mer size on which you want to find your biggest clique, as well as the thershold of the quality of reads. After that, a graph is built, with nodes being the representation of a subsequence the size of the k-mer input, where edges are made based on the quality threshold to other, simillar nodes.

A heuristic algorithm starts to analyse the graph, finding the best clique between the input sequences. The algorithm finds the clique in a polynomial time.

Recommend Projects

wojdob / motif-finder Goto Github PK

motif-finder's Introduction

Motif-finder

How it works

motif-finder's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent