The egem from sriram-lab

Fix leaks in exchange and demand reactions

These reactions have leaks - need to resolve them.

correlation plots between gene expression and histone markers

Fix medium conditions

Describe the bug
@ScottCampit did not add some medium components, as there were few cell lines that were grown in the medium conditions, and the amount of glucose in the medium was similar to another medium condition. However, it is now apparent that these assumptions do not hold, and I need to finally fix the medium conditions.

Steps to take

Fix medium conditions for those that point to either RPMI or DMEM
Add new DMEM conditions that include:
- (+) gln
- (-) gln
- (++) glc
- (+) glc
- (-) glc
- (+) pyr
- (-) pyr

Complete end of summer checklist

To do:

clean up the code into manageable snippets (4-5hrs)
Make all code that are not functions into functions. Clean up all inefficient parts of your code. Make your code in standard PEP8 format.
generalize the code so that it works with any dataset for gene expression, proteomics, whatever (1-2hrs)
Same as above, but in addition to making it a function, have it so that the script automates several processes, including mapping gene symbols, etc.
vectorize the code (1-2 hrs)
There are tons of instances were you're dynamically updating variables. This is computationally inefficient. Instead, we should use vectorized implementations via numpy and pandas.

A useful resource: https://towardsdatascience.com/python-vectorization-5b882eeef658

metabolic_sensitivity.m dynamic range

Things to do:

Capture epsilons for each reaction of interest with the highest dynamic range
Set epsilons for each reaction to that value
Obtain values and evaluate results

Issue with the SRA case in metabolic_sensitivity

Describe the bug
The reduced cost line in metabolic sensitivity returns an output error. The value that I want stored (which in some cases is 0) is returned as an empty array. This causes a size error.

To Reproduce
Run the code as written in the scampit branch

Expected behavior
I would expect that the empty value will be filled in as a 0 if the reduced cost were in fact 0.

- [ ] Add individual histone markers as individual nodes in the metabolic model

- [ ] Be able to capture new sources of lipids to the metabolic model if they are in medium components

Look in medium lists to see if there are medium that contain lipids
If so, be able to modify substrate uptake rate

Exploratory analysis of histone marker and metabolic gene expression

Mine metabolic gene data from GEO
Create heatmap showing correlation between gene expression and histone marker from the CCLE dataset

FBA case in metabolic_sensitivity.m does not output correct metabolic fluxes

I wrote code that outputs the metabolic fluxes for various medium conditions in three separate cases. One case uses FBA as the optimization method for calculating the metabolic fluxes. However, from Shen et al., 2019, the output does not match.

To test:

Use acetylation model in same analysis
- If error, then there is issue with your code
Use new eGEM in Shen et al., analysis
- If error, then there is an issue with your model

Stratify CCLE cell lines to a high flux and low flux group to perform similar analyses to Shen et al.,

- [ ] Use RECON3 as the base metabolic model for lipid reactions

Show all reactions even when performing histone only optimizations

Add PTM nodes into metabolic network

To add these metabolic nodes, I need to look for literature evidence about

the histone PTM, and
the readers and writers that influence the histone markers

Lauren:

H3K27

Scott:

H3K4
H3K9

Later...

H3K56
H3K79

Histone reader, writer, and eraser expression and individual histone marker expression

Now that you have extracted more genes that correspond to histone writers and erasers, there are several more tasks we can do for more in-depth analyses:

Extract synonyms for each writer and eraser. This will increase the number of hits in your analysis
- Store as a json file and read into Python as a dictionary. This is the first step and will help you make this into a function
- Write your code as a function that reads in either txt or json files and outputs the visualization as an svg or png file. Make it so that users can specify whether they want an svg or png output.
Get reader genes and see if there is a correlation. My guess is that there will be a (weak) correlation

Metabolic sensitivity heatmaps

ISSUE WITH SOME METABOLIC REACTIONS
Some metabolic reactions that are expected to output a flux value return 0 in the metabolic_sensitivity.m code.

To Reproduce
Steps to reproduce the behavior:

Go to run_all.m to line 145
Run the metabolic_sensitivity function
It will return errors for the excel sheets, but also the resulting heatmap does not contain metabolic fluxes that we expect.

Fix recon1 rxngenemat

The RxnGeneMat does not accurately map genes and reactions, probably because old identifiers are used. Need to fix this now.

- [ ] Add other acylation reactions

Examples of other acylation reactions

Butryation
Hydroxybutyration

Adding individual histone writer and eraser reactions to metabolic model

Fill metabolic map and write code to add reactions RECON1 using metabolic map as an xlsx or txt file.

Scott:Get both acetylation and methylation cases

H3K4
H3K9

Lauren: methylation cases are a must, acetylation is optional

H3K27

To pick up later:

H3K56
H3K79
Acetylation cases

Create a normoxic and hypoxic metabolic model

Make module for seashore-ludlow analysis

Steps for drug sensitivity analysis:

Make code from Shen et al into a MATLAB function
Put AUC data in right format in MATLAB variable
Ensure that H3 relative values are formatted correctly
Run with bulk methylation genome scale metabolic model unconstrained (without inhibitor gene expression)
Analyze the results

Validation steps:

Use Le Roy et al data for proteomics instead of H3 relative values from the CCLE database
- Create a MATLAB variable containing the methylation data

Next steps:

Add individual methylation reactions for specific metabolic markers onto the genome-scale model and save as a new metabolic model
Run same analysis to see if we can predict drug sensitivity for specific histone markers rather than bulk methylation.

sriram-lab / egem Goto Github PK

egem's People

Contributors

Stargazers

Watchers

Forkers

egem's Issues

Steps to take

Recommend Projects

Recommend Topics

Recommend Org