Git Product home page Git Product logo

cs5228_project's Introduction

Exploring Gene Expression Data Using Weighted Gene Co-expression Network Analysis Across Multiple Cancer Types

CS5228 Knowledge Discovery and Data Mining

A.V. AKILA RAVIHANSA PERERA โ€“ A0212216X

TRAN KHANH HUNG โ€“ A0212253W

Method

  • WGCNA

Weighted gene correlation network analysis (WGCNA) is a powerful method that uses a topological overlap module approach for constructing co-expression networks based on gene expression data. This method involves reconstructing gene co-expression modules and summarizing modules using module eigengenes (ME) and intramodular hub genes.

  • Gene Enrichment and Pathway Analysis

Biologically interesting modules were identified using Fisher's exact test. The overlapping and union sets of genes from theses interesting gene module pairs were subjected to Gene Set Enrichment Analysis using topGO package).

Dataset

Three gene expression datasets for three cancer types (GBM, OV, BRCA) were selected from TCGA (The Cancer Genome Atlas).

Analysis

Data Exploration

Description BRCA GBM OV
Filter threshold (CV) 0.5 0.5 0.5
Sample Clustering Dendrogram
deepSplit 3 3 3
minClusterSize 30 30 30
Number of gene modules 19 19 20



Scale Free Topology Model Mean Connectivity Selected Soft Threshold
9
3
6

Clustering Tree

Genes Module Eigengenes
GBM GBM
GBM GBM
GBM GBM



Gene Expression Network

Network Heatmap Eigengene Adjacency Heatmap Eigengene Dendrogram

Pairwise Analysis of Gene Modules

Overview

Item BRCA and GBM GBM and OV OV and BRCA
--- Intersection ---
Name black_darkred_genes.txt black_brown_genes.txt turquoise_black_genes.txt
Lowest p-value 0 0 0
--- Unique (A) ---
Name black_genes.txt black_genes.txt turquoise_genes.txt
Lowest p-value 0 0 0
--- Unique (B) ---
Name darkred_genes.txt brown_genes.txt black_genes.txt
Lowest p-value 0 0 0



Gene Overlap Across Module Pairs

BRCA and GBM GBM and OV OV and BRCA
BRCA_GBM GBM_OV OV_BRCA
BRCA_GBM GBM_OV OV_BRCA



Project Structure and Run Instructions

  • Download and extract datasets to ./data directory

  • Install dependencies (R packages)

    • Run 0_install_dependencies.R
  • Gene filtering

    • Run 1_wgcna_cluster.R
  • Build gene expression network and identify gene modules

    • Run 2_module_detection.R
  • Generate gene network plots

    • Run 3_network_visualization.R
  • Gene Enrichment Analysis

    • Run 4_gene_enrichment.R

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.