Git Product home page Git Product logo

gowalk's Introduction

GOwalk is a personal project influenced by GeneWalk https://churchman.med.harvard.edu/genewalk

This version constructs a PPI of gene products modified by the ubiquitin-like interferon-stimulated gene 15 (ISG15) connected to the associated enriched GO-terms acyclic graph using NetworkX.

GOwalk receives published proteomic data sets and performs random walks using node2vec generating latent representations of how each gene in the list relates to one another, and to their overall function.

The goal is to provide contextual insights for high-throughput data, offering important descriptions that help guide decision-making in research.

Why make it?

Whereas reductionism has yielded splendid results in science, there is an important sense in which it is artificial, and in this sense false. By starting from wholes and moving ‘down’ into parts, one is moving in the opposite direction from the way matters arise. Ursula Goodenough, The Sacred Emergence of Nature

Access to higher dimensional insights: machine-readable ontologies are used to identify interesting patterns and associations among your genes.

Emergene provides some cross-compatibility with R's bioconductor: This work is heavily influenced by tools such as GoSemSim, RRVGO, and g:Profiler.

How do I use it?

1. Initialize Enrichment() object with a project name and gene list

from enrich import *

name = 'project_name'
file = 'path_to_genes.csv'

data = Enrichment(name, file)   # automatically creates project directory

2. Navigating your genes

Once the object is initialized, you can view various attributes such as:

data.name   # project name
data.genes  # straight forward
data.pairwise   # PPIs from STRINGdb
data.network    # networkx representation
data.sub_network    # dict of spectral clusters as cluster:gene sublist pairs

3. Performing enrichment analysis

data.get_gprofiler()    # stores enrichment data from gprofiler as data.enrichment
data.get_semantics(ont='BP, MF, or CC')    # returns a similarity matrix (sem_sim) and semantically reduced df (sem_red)

data.enrichment
data.sem_sim
data.sem_red

4. Visualizing the data

In progress

Dependencies

Future attributes to be added

  • Notebook format
  • Shell compatibility
  • Semantic similarity analysis
  • Semantic reduction
  • Class with visualization methods
  • Save states for easy re-loading
  • More species

gowalk's People

Contributors

gaelanm avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.