Git Product home page Git Product logo

moclim's Introduction

MoCLIM: Towards Accurate Cancer Subtyping via Multi-Omics Contrastive Learning with Omics-Inference Modeling

This repository contains the implementations of MoCLIM, presented at the 32nd ACM International Conference on Information and Knowledge Management (CIKM '23).

Original paper: [https://dl.acm.org/doi/10.1145/3583780.3614970].

Presentation&Slide: [https://youtu.be/26uYBmsyiLM]

MoCLIM, developed by Ziwei Yang, Zheng Chen, Yasuko Matsubara, and Yasushi Sakurai, introduces a novel approach to cancer subtype identification using multi-omics contrastive learning with omics-inference modeling.

MoCLIM Framework Overview
image

An overview of the MoCLIM workflow: (1) MoCLIM takes multi-omics data as input. (2) Omics-specific encoders parallelly learn latent features for each omics source. (3) Multi-omics contrastive learning with a contrastive anchor integrates the learned features. The clustering is implemented on the integrated feature. (4) Comprehensive biomedical evaluations following the feature learning help users understand the results generated by MoCLIM.


Background

Each specific cancer comprises multiple subtypes. These subtypes refer to groups of patients with specific biochemical mechanisms that require tailored therapeutic strategies.

Different cancer subtypes often share the same morphological traits. This can result in high similarity in histopathological images. However, the differences can be found across various omics levels.

Omics Data for Cancer Subtyping
image

Motivation

We position MoCLIM in a biological axiom: genome-wide transcriptomics analysis is the mainstay of omics studies.

In this schematic diagram, various omics information, including genomics (DNA) and proteomics (protein) data, are interconnected through processes like gene expression and regulation at the mRNA (transcriptomics) level.

Biological Observation: Transcriptomics as a Focal Point
image

Cancer Subtyping Results & Biomedical Evaluations

Experimental results on six cancer datasets demonstrate that our approach significantly improves data fit and subtyping performance in fewer high-dimensional cancer instances.

Moreover, our framework incorporates various medical evaluations as the final component, providing high interpretability in medical analysis.

Cancer Subtyping Examples
image
Gene Set Enrichment Analysis Examples
image

Implementation Details

The implementation details and code can be found in the "MoCLIM_main". Make sure to follow the setup instructions below to run the code successfully.

Setup

To set up the project environment, install the required dependencies using pip:

pip install -r requirements.txt

If you're using a CUDA version other than 10.2, please ensure PyTorch is installed for the appropriate CUDA version. Refer to the instructions for detailed instructions.

Citation

If you find our work helpful for your research, please consider citing our paper:

@inproceedings{MoCLIM,
author={Yang, Ziwei and Chen, Zheng and Matsubara, Yasuko and Sakurai, Yasushi},
booktitle = {Proceedings of the 32nd ACM International Conference on Information and Knowledge Management}, 
title={MoCLIM: Towards Accurate Cancer Subtyping via Multi-Omics Contrastive Learning with Omics-Inference Modeling}, 
year={2023},
series = {CIKM '23}
pages={2895โ€“2905}}

Thank you for your interest in our research. For any questions or inquiries, feel free to reach out to us.

moclim's People

Contributors

yangziwei96 avatar chenzrg avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.