Hello, I run the code "train_dise_gta2city.py" following the procedu

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Unable to reproduce the values in the paper about dise-domain-invariant-structure-extraction HOT 6 OPEN

a514514772 commented on September 7, 2024

Unable to reproduce the values in the paper

from dise-domain-invariant-structure-extraction.

Comments (6)

a514514772 commented on September 7, 2024

Hi,

Unfortunately, the training under the pytorch framework is non-deterministic. A relevant issue is here: https://discuss.pytorch.org/t/random-seed-initialization/7854/18 Even though we re-run the code, there still exists some fluctuation, but it is not difficult to get a value higher than 44% (with batch size = 2).

To your guess,

During the final testing, we use 1024x2048. As you tested, we think it doesn't affect too much.
Our apologies. That's a typo. Following the setting of Tsai et al., we use exactly the same initial weights as Deeplabv2 used, so the code is correct. A relevant issue is here: wasidennis/AdaptSegNet#5

The potential reason could be the version of libraries. I am re-running my code with batch size = 1. I hope I can come up with some good results and random seed to help you reproduce the performance.

--- edit ---
In total, the model will be trained for 250000 steps. The smaller batch size you use, the less samples the model sees (i.e. you only see half of the number of data as compared to mine). It could be helpful to train for more steps and adjust the learning rate moderately.

Cheers,
Hui-Po

from dise-domain-invariant-structure-extraction.

subeeshvasu commented on September 7, 2024

Thank you for all the suggestions. I will try them out and see if I can improve the values.Please let me know if you are able to get to the value of 44% with batch size = 1.

from dise-domain-invariant-structure-extraction.

crazygirl1992 commented on September 7, 2024

hello,i run the code,but my device is two 1080GPU,i want to know that what are your device? and how long you cost to train the model? @subeeshvasu

from dise-domain-invariant-structure-extraction.

subeeshvasu commented on September 7, 2024

@crazygirl1992 I was using a single GTX TitanX (12 GB). With this settings, for batch size = 1, training cost was approximately: 2hrs, 20 minutes per 1000 iterations.

from dise-domain-invariant-structure-extraction.

crazygirl1992 commented on September 7, 2024

thank you very much,and can you achive the paper's result now? the training almost 250000 iterations in his paper,and 20*250mins,not 2hrs

from dise-domain-invariant-structure-extraction.

subeeshvasu commented on September 7, 2024

I couldn't get those values. With batch size = 4, one could reproduce the values I guess!.

from dise-domain-invariant-structure-extraction.

Unable to reproduce the values in the paper about dise-domain-invariant-structure-extraction HOT 6 OPEN

Comments (6)

Related Issues (12)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent