When I read the code of logistic regression, I found that angel seems to fetch total w

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Does logistic regression of angel use sparsity in data? about angel HOT 7 CLOSED

angel-ml commented on May 13, 2024

Does logistic regression of angel use sparsity in data?

from angel.

Comments (7)

piaoxijun commented on May 13, 2024

Same question.
It seem sparselr in the project also fetch all feature value when it perform a getRow op.

I wonder how many rows does PSModel support?
How about I implement it with one rows as one feature, like:
val model = PSModel[DenseDoubleVector](FTRL_WEIGHT, feaNum, 1)
and
val featrueValues = model.getRows(indexArr)
Will it be efficient?

And, What is the difference between row and column for the matrix storage? what is their capacity?

from angel.

facaiy commented on May 13, 2024

@piaoxijun Great idea. It sounds reasonable to use multiple rows for fetching parameters flexibly if row operator is only supported.

from angel.

TAAAN commented on May 13, 2024

@facaiy thanks for your issue.

Dense data:
For gradient descent LR, we use DenseDoubleVector to represent data and model, each vector is a double array, LR model is only a long array, even super large model doesn't cost much memmory, so dense LR can support super-large dimension data.
Besedes, in pracitve, many elements of LR model is not zero. We have impleted densefloat datatype, you can chage LR datatype to densefloat to save more memmory.
Sparse data:
For sparse datatype, we use map to store keys and values, so it cost more memmory whe the data and model is not very sparse, and cost even much more time to compute.
Optimizer:
For LR, we implemented 2 optimizers, gradient descent (as you see) and ADMM optimizer:

SparseLogisticRegression Line 65
```
val (history, z) = ADMM.runADMM(train,
   ...
```
in ADMM optimizer, we use LBFGS. Besides, LBFGS is not a memmory friendly optimization algorithm.

We also implemented LR on Spark on Angel, with gradient descent、OWLQN、LBFGS and ADMM.
LR and SparseLR:
In LR， we use dense datatype and gradient descent, in sparseLR, we use sparse data type and ADMM. There's seperation for small and huge model, just a kind of implementation. We have implemented DenseDouble\DenseInt\DenseFloat\SparseDouble\SparseFloat ... data type, you can choose your data type and operatin type according to your data. You can edit LR's code to chage data type to sparse.

from angel.

TAAAN commented on May 13, 2024

@facaiy @piaoxijun

You can define your PSModel sparse double datatye to pull only none zero elements.
Yor can use psFunc to diy the element you pull.
About @piaoxijun 's idea, it's a great idea, but not efficient. For we sotre matrix in the way of lines, if you define too many lines in a PSModel, it will cost huge memmory.

from angel.

facaiy commented on May 13, 2024

@TAAAN Thanks for your quick reply.

Perhaps you mistake my question. I agree that parameters in LR might be dense, while training data is not. In practice, high-dimension data is always very sparse. Hence, for non-zero bits of data, it is more efficient to fetch the corresponding bits (about 10~15% or less of total bits) of lrModel.weight, and then calculate gradient, update parameter.
I agree that multiple rows might be not efficient, which depends on partition strategy. If by default, ps parameter is always partitioned by column, all rows of 1-dim vector will be stored at single machine. Worst case.

from angel.

TAAAN commented on May 13, 2024

@facaiy

For Model，you can set it dense or sparse
For training data, it's sparse

     val batchGD = GradientDescent.miniBatchGD(trainData, lrModel.weight, lr, logLoss,
       batchSize, batchNum)

The feature size of LabeledData in Worker(TrainData), has no direct relationship with Model in the PSServer

from angel.

facaiy commented on May 13, 2024

@TAAAN Thanks for the detailed explanation. However, my point is not about model or data indeed, but the cost of communication between worker and ps. Perhaps we could have further discussion in the future. Anyway, thank you all the same, TAAAN.

from angel.

Does logistic regression of angel use sparsity in data? about angel HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent