Git Product home page Git Product logo

inferreg's Introduction

InferReg

Introduction

InferReg is a GNN-based tool for inferring gene regulatory networks from high-throughput genomic data. It leverages the latest advances in machine learning to provide users with an accurate and efficient way to predict regulatory interactions between transcription factors (TFs) and target genes.

Usage

Here is a step-by-step guide on how to use InferReg:

environment setup

Users of InferReg need to correctly set up Python 3, pandas, PyTorch, and PyTorch Geometry (PyG) in the local environment. To facilitate a seamless installation process, please refer to the following comprehensive guides tailored for each library:

  1. Python3 >= 3.8.
  2. pandas: Pandas can be easily installed using pip, Python's package installer.
  3. PyTorch: To install PyTorch with support for your specific hardware (CPU or GPU) and operating system, visit the official installation page: https://pytorch.org/get-started/locally/
  4. PyTorch Geometric (PyG): For installing PyG, a library dedicated to deep learning on irregularly structured data such as graphs, consult its official documentation at: https://pytorch-geometric.readthedocs.io/en/latest/index.html

Data Preparation

Prepare the following files in the code/data/raw directory:

  • gene_expression.csv: A CSV file containing gene expression data with gene names in rows and sample names in columns. Below is a sample snapshot of the file: gene_expression file sample snapshot
  • edges.tsv: A TSV file with edge information, where each row represents a TF and its target genes. Below is a sample snapshot of the file: edges file sample snapshot
  • pos_edges.tsv: A TSV file containing positive edge information validated by ChIP-Seq data. pos_edges file sample snapshot

Sample data files(Arabidopsis) are provided at the https://doi.org/10.5281/zenodo.11176206

Data Preprocessing

Navigate to the code/src/data_preprocess directory and run python data_loader.py to load the input data.

Model Training

To train the model, execute python model_fit.py from the code directory. There is no need to set any parameters.

Prediction

To make a prediction, run python predict.py from the code directory. The predicted regulatory relationships will be saved in the data/predicted/at_edges.tsv file. The path data/predicted will be automatically generated upon the first execution.

inferreg's People

Contributors

shengtudai2 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.