greenelab / biobombe Goto Github PK

View Code? Open in Web Editor NEW

63.0 6.0 25.0 2.33 GB

BioBombe: Sequentially compressed gene expression features enhances biological signatures

Home Page: https://greenelab.github.io/BioBombe/

License: BSD 3-Clause "New" or "Revised" License

Jupyter Notebook 72.80% Python 1.57% R 1.29% Shell 0.07% HTML 24.27%

gene-sets msigdb gene-expression tcga compression hetnet biobombe network autoencoder

biobombe's People

Contributors

Stargazers

Watchers

biobombe's Issues

Restructure visualize_genesets.R

Currently the results are being read in for all gene sets. They should be read in once, and then visualized and subset.

Change y axis label for Feature Rank Plot

The plot generated here needs an updated y axis label. It should read: "Absolute Rank Enrichment"

Add t-test for NBL Cell lines in MYCN amplification signature application

Update Supplementary Figure 3 - Correlation Summary

Switch panels c and d with a and b

Validate MYCN status in NBL Cell Lines

Related with #163

Resources include https://www.nature.com/articles/sdata201733/tables/3 and https://figshare.com/articles/STAR-reads/7613975

Update GTEx Supplementary Figure

Currently, A and B are plotted on the same row with two columns. I need to make two rows and 1 column instead

Add GTEx Module README and analysis bash script

Update GTEx Geneset Panels C and D

After changes are merged in #125, the function plot_gene_set() will change. I will need to rerun the visualize notebook in the gtex module after the update

Find Sex Feature in TCGA

Related to #163 as was previously done in GTEx. Also, box plots can be changed to display different correlations with transformed data in both cases as well

Add Ensemble Analysis to TCGA FIgure

We are interested in comparing ensemble VAE performance to ensemble multi-algorithm performance in cancertype and mutation prediction.

I removed the colorblindr dependency in #13 because the package is not currently a conda recipe. Adding back this dependency will require a conda-forge pull request that I will save for a later date.

Update TCGA Supplementary Figure

Switch panels A and B - also, the two panels currently in A are not in the correct order

Add Vince as an Author

Will need to update the author list (and title) on the website once new preprint is posted

related to #181

cc @vincerubinetti

Rename Module 6 to `6.biobombe-projection`

Remove Redundant Supplementary TCGA Figures

A couple figures are redundant - added with different names

Add Supplementary Table for Signature Genes

Need to track Neutrophils_HPCA_2 and Monocytes_FANTOM_2 genes

Remake Supplementary Figure 1

Need to update with strip text background color - also should make it so it can be in portrait orientation

Visualize Max Score Feature by Dimension + Algorithm

I have biobombe scores for many datasets by collections - plot z dimension of max feature

Move E and F of GTEx Figure to Supplement

Sample Correlation for HGSC Subtypes

Could be interesting to observe the correlation pattern for OV

https://github.com/greenelab/interpret-compression/blob/master/4.analyze-components/figures/TCGA/sample-correlation/sample-type/sample-correlation_OV_TCGA_signal_pearson.png?raw=true

Split out by HGSC subtype assignment

Update Figure 5 - TCGA Classify

Should add points representing raw data in panel C. What is the performance and percent zero coefficients?

Describe Directory Structure in Module README

a more complete description of the directory tree structure will help orient a new viewer to the results.

Update Figure 6 - Coverage Analysis

I don't think I need to label all facets - probably just A, B, and C is sufficient

Reorder Modules to Squish in New Module 7

I am adding a new module 7 in #71 - i will need to update the other module numbers (GTEX and TCGA)

Convert z score to p value and bonferroni correct in k dimension by geneset top feature plot

related to #108

Split GTEx Figure G and H into New Supplementary Figure

This Figure is large. Panels G and H can be moved to a supplement.

Update GTEx Figure to Include Correlation

Add correlation estimates for panels E and F

Update GTEx Supplementary Figure

Need to lower case panel labels and move z to k

https://github.com/greenelab/BioBombe/blob/master/8.gtex-interpret/1.visualize-gtex-blood-interpretation.ipynb

Make Mission of Predicting Cancer-Types and Mutations Clear

Need to write carefully about this point in README (see #90) and especially in the manuscript

Add analysis to supplementary TCGA classification

Need to predict with top 1 feature
also determine which z the features are coming from

Can also split out "monocyte" vs other in that plot

Is it worth creating a lookup table with colors -- HEX code as you've done before?

It will be good to update HEX colors in a table lookup. Also related to #14

greenelab / biobombe Goto Github PK

biobombe's People

Contributors

Stargazers

Watchers

Forkers

biobombe's Issues

Recommend Projects

Recommend Topics

Recommend Org