When setting n_jobs=-1 on M

Thanks for raising this <a class="user-mention notranslate" data-hovercard-type="user"

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

I'm attempting to reimplement this paper: <a href="https://www.resea

RuntimeError: Cannot clone object LassoCV(...), as the constructor either does not set or modifies parameter precompute about celer HOT 6 CLOSED

chrism2671 commented on August 23, 2024

RuntimeError: Cannot clone object LassoCV(...), as the constructor either does not set or modifies parameter precompute

from celer.

Comments (6)

mathurinm commented on August 23, 2024

Thanks for raising this @chrism2671, it should be fixed by #295 that I'll merge as soon as the CI is green.

If you use celer in research work, we kindly ask that you cite our papers (see the README) ; if it's in an industrial context, we'd love to hear more about your usecase. You can also have a look at our skglm package that implements many more models and penalties, in particular non convex ones that have better sparsifying properties.

from celer.

chrism2671 commented on August 23, 2024

Wow, that was fast! And here I was struggling to install the deps to do it myself! Thank you so much! :D

from celer.

mathurinm commented on August 23, 2024

@chrism2671 this would really help us:

If you use celer in research work, we kindly ask that you cite our papers (see the README) ; if it's in an industrial context, we'd love to hear more about your usecase. You can also have a look at our skglm package that implements many more models and penalties, in particular non convex ones that have better sparsifying properties.

from celer.

chrism2671 commented on August 23, 2024

I'm attempting to reimplement this paper:

https://www.researchgate.net/publication/314533287_Sparse_Signals_in_the_Cross-Section_of_Returns

On my significantly reduced dataset (1 year of data, 100 columns), this takes my laptop approximately 2-3 days using sklearn, and seems to be about 10 hours using Celer. The paper using R/glmnet, and thanks a supercomputer center in its notes.

I've worked hard to try to accelerate this (even writing my own lasso in KDB/q), and experimenting with Cuda Rapids, but Celer is the fastest by far. In my case, because of the MultiOutputRegressor, I'm doing many millions of small regressions. I do wonder if Python's inefficient multiprocessing is part of the bottleneck.

from celer.

mathurinm commented on August 23, 2024

How many rows do you have in your dataset ?
For such "large n_samples, small n_features" datasets, you should have a look at our GramCD solver in skglm which is super fast : scikit-learn-contrib/skglm#229

from celer.

chrism2671 commented on August 23, 2024

It's the other way unfortunately, (n_samples=30, n_features=300)

from celer.

Recommend Projects

RuntimeError: Cannot clone object LassoCV(...), as the constructor either does not set or modifies parameter precompute about celer HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent