Interesting but unusable. Too much false positive. about dirl-inharmonious-region-localization HOT 7 CLOSED

bcmi commented on June 22, 2024

Interesting but unusable. Too much false positive.

from dirl-inharmonious-region-localization.

Comments (7)

jimleungjing commented on June 22, 2024

I am try DIRL with my own little test dataset. In 48 total here is only 2 success and around 45 false positive alarms.

For example: (fake, mask, DIRL result)

Our model is trained on iHarmony4 dataset. It is better to finetune our model on your own dataset.

from dirl-inharmonious-region-localization.

alishan2040 commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea.
@jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50?
ERROR:
weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.

I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.

Thanks!!!

from dirl-inharmonious-region-localization.

jimleungjing commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea. @jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50? ERROR: weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.

I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.

Thanks!!!

I just fixed this bug. However, I suggest that you should reduce the dimension of encoder outputs since the output dimension of resnet50 will lead to super heavy model and take more GPU memory to train. For example, you can apply an extra conv1x1 to reduce the encoder tensor dimension.

from dirl-inharmonious-region-localization.

alishan2040 commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea. @jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50? ERROR: weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.
I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.

Thanks!!!

I just fixed this bug. However, I suggest that you should reduce the dimension of encoder outputs since the output dimension of resnet50 will lead to super heavy model and take more GPU memory to train. For example, you can apply an extra conv1x1 to reduce the encoder tensor dimension.

@jimleungjing Thanks for fixing it. Yes you are right, changing the backbone to ResNet50 results in more memory cost. Could you please guide if I need to add conv1x1 after each encoder layer (output). Wouldn't that change the dimensions in the decoder too? What changes do I have to make for code to work? Also if you've pretrained weights with backbone 50 on iHormany4 datasets, please share?

Thanks again!

from dirl-inharmonious-region-localization.

jimleungjing commented on June 22, 2024

Well, you can add conv1x1 in the decoder to reduce tensor dimension, and rewrite self.dims according to the output dimension of conv1x1, other code should remain the same. You can check out the tensor dimension in debug mode.

from dirl-inharmonious-region-localization.

alishan2040 commented on June 22, 2024

Okay. As I noticed fine-tuning works better than training from scratch. Did you train your model on iHarmony4 dataset with backbone Resnet50?

from dirl-inharmonious-region-localization.

jimleungjing commented on June 22, 2024

Okay. As I noticed fine-tuning works better than training from scratch. Did you train your model on iHarmony4 dataset with backbone Resnet50?

No, you can try it.

from dirl-inharmonious-region-localization.

Interesting but unusable. Too much false positive. about dirl-inharmonious-region-localization HOT 7 CLOSED

Comments (7)

Related Issues (7)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent