In the context of binary SVM classification problems, while I was doing experiments on

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Accuracies lower than 50% if the random seed is unlucky about libsvm HOT 8 CLOSED

giulio-datamind commented on July 18, 2024

Accuracies lower than 50% if the random seed is unlucky

from libsvm.

Comments (8)

cjlin1 commented on July 18, 2024

The main issue is that you have too few data. If -b 1 is used, which is for probabilistic outputs, internally we conduct a cross validation process. Thus there is some randomness. To have deterministic results, either fix the seed or, if prob outputs not needed, remove -b 1

from libsvm.

giulio-datamind commented on July 18, 2024

@cjlin1 thank you very much for your reply.

In my case I need probabilistic results, so option -b 1 is mandatory.

Yes, I could change the seed to make this particular example work, but in general I cannot consider this a solution, since for other input data I could fall into the same low-accuracy problem.

Why do you think that constraining the sigmoid function to be always increasing (considered the fact that we expect high probabilities for samples labeled +1 and low probabilities for those labeled -1) could not be a solution? Are there any drawbacks in constraining someway probA < 0 inside sigmoid_train function?

from libsvm.

cjlin1 commented on July 18, 2024

I think we do have that sigmoid is always increasing. So I don't understand your question

from libsvm.

giulio-datamind commented on July 18, 2024

I'm sorry: I probably made some confusion with the sign of probA (I edited the above messages to fix them). I try to explain me better with other words.

Consider the input data attached to my first message. Working on this data I experimented that by setting (at startup) the random generator seed to some integer number, normally (i.e., for about the 96% of these seeds) the resulting trained probabilistic model has 100% accuracy. However, for some seeds (only about 3,5%; an example is srand(42) on my machine) the trained model has accuracy of 0%.

I noticed that 0% accuracy models have a positive value for probA, while 100% accuracy models have a negative one. The sigmoid function is defined as SF(x) = 1/(1+exp(probA*x+probB)), where x is the decision value. I stated that 0% accuracy models are associated to a decreasing sigmoid function because for x that tends to +infinity, SF(x) tends to 1 if probA < 0 and to 0 if probA > 0.

Considering the fact that we expect high probabilities (i.e., high values of SF(x)) for samples labeled +1 and low probabilities for those labeled -1, I suspect that there is room for an improvement if we constrain probA to be always lower than 0.

I attach the adaptation of svm-train.c that I used to make the experiments, hoping it can help.

from libsvm.

cjlin1 commented on July 18, 2024

What if you put 10 copies of the same data together as input? I suspect the situation may be improved.

…

On 2023-11-02 19:56, giulio-datamind wrote: I'm sorry: I probably made some confusion with the sign of probA (I edited the above messages to fix them). I try to explain me better with other words. Consider the input data attached to my first message. Working on this data I experimented that by setting (at startup) the random generator seed to some integer number, normally (_i.e._, for about the 96% of these seeds) the resulting trained probabilistic model has 100% accuracy. However, for some seeds (only about 3,5%; an example is srand(42) on my machine) the trained model has accuracy of 0%. I noticed that 0% accuracy models have a positive value for probA, while 100% accuracy models have a negative one. The sigmoid function is defined as SF(x) = 1/(1+exp(probA*x+probB)), where x is the decision value. I stated that 0% accuracy models are associated to a decreasing sigmoid function because for x that tends to +infinity, SF(x) tends to 1 if probA < 0 and to 0 if probA > 0. Considering the fact that we expect high probabilities (_i.e._, high values of SF(x)) for samples labeled +1 and low probabilities for those labeled -1, I suspect that there is room for an improvement if we constrain probA to be always lower than 0. I attach the adaptation of _svm-train.c_ that I used to make the experiments [1], hoping it can help. -- Reply to this email directly, view it on GitHub [2], or unsubscribe [3]. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#207 (comment)", "url": "#207 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ] Links: ------ [1] https://github.com/cjlin1/libsvm/files/13238410/svm-train-adapted.zip [2] #207 (comment) [3] https://github.com/notifications/unsubscribe-auth/ABI3BHWESD3VUCLBRNPWI63YCOC5NAVCNFSM6AAAAAA6XYI3SCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOJQGU4TGMZXGQ

from libsvm.

giulio-datamind commented on July 18, 2024

Yes, you are correct.

By simply replicating the input data 10 times, the input file becomes like this; with this input all the 1000 experimented seeds lead to a 100% accuracy model.

I think, however, that there is no reason for not trying to directly improve the algorithm in order to make it work better also for lower-cardinality datasets, as is often the case.

from libsvm.

giulio-datamind commented on July 18, 2024

I tried to impose the constraint probA < 0 by adding the line

newA = newA > -eps ? -2 * eps - newA : newA;

immediately after the

newA = A + stepsize * dA;

in the backtracking loop of sigmoid_train function. Furthermore, I set the initial value to A = 1 instead of A = 0.

In practice, I implemented the constraint by reflecting, at every iteration, the point (A, B) of the parameters' search space around the line A = -eps.

With these changes even if the random seed choice is unfortunate, the accuracies of the trained models never fall below 50%. This happens because in the worst case the samples are classified with a very flat sigmoid (when A is near 0) all into the same class; but, unlike before, it cannot happen that the classification is opposite to the labeling.

Are there any disadvantages I didn't foresee in these modifications?

from libsvm.

cjlin1 commented on July 18, 2024

It's ok to impose such a constraint, but then this is a constrained optimization problem. Either a constrained optimization algorithm is used or you need to prove the convergence of your setting.

…

On 2024-01-04 17:54, giulio-datamind wrote: I tried to impose the constraint probA < 0 by adding the line > newA = newA > -eps ? -2 * eps - newA : newA; immediately after the > newA = A + stepsize * dA; in the backtracking loop of sigmoid_train function. Furthermore, I set the initial value to A = 1 instead of A = 0. In practice, I implemented the constraint by reflecting, at every iteration, the point _(A, B)_ of the parameters' search space around the line _A = -eps_. With these changes even if the random seed choice is unfortunate, the accuracies of the trained models never fall below 50%. This happens because in the worst case the samples are classified with a very flat sigmoid (when _A_ is near _0_) all into the same class; but, unlike before, it cannot happen that the classification is opposite to the labeling. Are there any disadvantages I didn't foresee in these modifications? -- Reply to this email directly, view it on GitHub [1], or unsubscribe [2]. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#207 (comment)", "url": "#207 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ] Links: ------ [1] #207 (comment) [2] https://github.com/notifications/unsubscribe-auth/ABI3BHRM3JKLE64UZUUOSG3YMZ35XAVCNFSM6AAAAAA6XYI3SCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNZWHAYDSNJYGY

from libsvm.

Accuracies lower than 50% if the random seed is unlucky about libsvm HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent