Comments (4)
Hi,
Thanks for your interests in our method. I just updated the wiki page with some runtime information.
https://github.com/ma-compbio/Higashi/wiki/Higashi-Usage#runtime-of-higashi
A more detailed discussion on the runtime can be found in our published version of the manuscript. For your machine, I think the bottleneck could be the memory. The consumption of memory depends on the number of cells, sequencing depths of the dataset and the resolution. 32GB should be enough for most datasets at 1Mb resolution.
from higashi.
Yes, training with CPU is feasible.
The formula makes sense. But you do not need to go over the whole training dataset once. Our test showed that, about 45 ~ 60 epochs are enough for most of the dataset.
from higashi.
I'm new in deep learning and i have access to a node only with CPUs (maybe 20 cores and 256 GB). Is training only with CPU feasible? THX!!!
from higashi.
Sorry for asking too many questions in an issue. I don't understand the measuring of runtime from wiki page.
A scHiC dataset (e.g. Nagano et al. dataset) with 1,171 single cells and 56,800 median contacts per cell has about 7e7 observed positive triplets. When training with cd-GNN (k=4
, fast mode) as wiki page shows, it takes 7e7/(192*1000)epoch * 109.6s/epoch = 40000s = 11h
to go over the whole training dataset once. Does the formula make sense?
from higashi.
Related Issues (20)
- error when running scTAD.py HOT 1
- Error running simulated data
- The main_cell.py is so slow HOT 5
- Problem running Higashi on Ramani et al. HOT 5
- What are the configure options mean?
- Stop with OSError when run "higashi_model.train_for_imputation_nbr_0()" HOT 6
- Error in fh_model.prep_dataset() "Pack from sparse mtx to tensors" HOT 2
- ERROE when run process.py: no config file HOT 1
- Predicting structures from embedding vector HOT 2
- wrapper.fast_process_data() - method does not exist HOT 2
- ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (15361,) + inhomogeneous part. HOT 3
- RuntimeError: received 0 items of ancdata
- Higashi stuck on training at higashi_model.train_for_imputation_nbr_0() on SLURM system HOT 7
- ValueError: setting an array element with a sequence. HOT 1
- RuntimeError: CUDA out of memory.
- The Dip-C data processing keeps encountering errors. HOT 3
- how the cell_name in data.txt corresponds to the cell_type in label_info.pickle? HOT 3
- config problem HOT 1
- key error HOT 1
- resuming interrupted training and imputation using Higashi+FastHigashi protocol HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from higashi.