Hello,thanks for your share. There will be an error that the value o

Hi, thanks for your attention. <a class="user-mention notranslate" data-hovercard-type

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

question about loss about mar HOT 4 CLOSED

kovenyu commented on August 23, 2024

question about loss

from mar.

Comments (4)

KovenYu commented on August 23, 2024

Hi, thanks for your attention. @ShenXianwen

did you set a small batch size? what was the number, exactly? when was the nan appearing, during training after several epochs or right in the first epoch?

sorry I could not try by myself since I have no access to the servers until next week.

from mar.

ShenXianwen commented on August 23, 2024

hello,thanks for your reply.I set the batch size to 64.The nan appearing during training after 2 epochs.I didn't use your prepared data(MSMT17.mat and Market.mat). I followed your steps to run the construct_dataset_Market.m and construct_dataset_MSMT17.m in MATLAB. But I used the prepared_weight.pth.

from mar.

KovenYu commented on August 23, 2024

ok. let me try it next week when I have the access to servers.

from mar.

KovenYu commented on August 23, 2024

Hi @ShenXianwen, it turns out that the nan comes out because the default learning rate is too large for a small batch size like 64. A small batch size indicates a stronger and sharper gradient (large batch size would average over more samples, thus smooth gradient), so we need to turn down the lr. I did not try much, but dividing the lr by 10 would enable you to get rid of this problem.

However we should note that the performance would probably drop, since the distribution estimation is less precise due to small batch size.

from mar.

Recommend Projects

question about loss about mar HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent