Git Product home page Git Product logo

Comments (7)

jimleungjing avatar jimleungjing commented on June 22, 2024

I am try DIRL with my own little test dataset. In 48 total here is only 2 success and around 45 false positive alarms.

For example: (fake, mask, DIRL result) military118_1_1

Our model is trained on iHarmony4 dataset. It is better to finetune our model on your own dataset.

from dirl-inharmonious-region-localization.

alishan2040 avatar alishan2040 commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea.
@jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50?
ERROR:
weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.

I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.

image
image

Thanks!!!

from dirl-inharmonious-region-localization.

jimleungjing avatar jimleungjing commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea. @jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50? ERROR: weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.

I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.

image image

Thanks!!!

I just fixed this bug. However, I suggest that you should reduce the dimension of encoder outputs since the output dimension of resnet50 will lead to super heavy model and take more GPU memory to train. For example, you can apply an extra conv1x1 to reduce the encoder tensor dimension.

from dirl-inharmonious-region-localization.

alishan2040 avatar alishan2040 commented on June 22, 2024

@Vadim2S yes fine-tuning is a good idea. @jimleungjing Did you train your network with different backbone (resnet50 instead of resnet34)? There is an option in code to change backbone but the code did not work when we do so. Is it possible if you can review the problem with changing backbone from resnet34 to resnet50? ERROR: weight of size [256, 256, 3, 3], expected input[8, 512, 226, 226] to have 256 channels, but got 512 channels instead.
I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part.
image image
Thanks!!!

I just fixed this bug. However, I suggest that you should reduce the dimension of encoder outputs since the output dimension of resnet50 will lead to super heavy model and take more GPU memory to train. For example, you can apply an extra conv1x1 to reduce the encoder tensor dimension.

@jimleungjing Thanks for fixing it. Yes you are right, changing the backbone to ResNet50 results in more memory cost. Could you please guide if I need to add conv1x1 after each encoder layer (output). Wouldn't that change the dimensions in the decoder too? What changes do I have to make for code to work? Also if you've pretrained weights with backbone 50 on iHormany4 datasets, please share?

Thanks again!

from dirl-inharmonious-region-localization.

jimleungjing avatar jimleungjing commented on June 22, 2024

Well, you can add conv1x1 in the decoder to reduce tensor dimension, and rewrite self.dims according to the output dimension of conv1x1, other code should remain the same. You can check out the tensor dimension in debug mode.

from dirl-inharmonious-region-localization.

alishan2040 avatar alishan2040 commented on June 22, 2024

Okay. As I noticed fine-tuning works better than training from scratch. Did you train your model on iHarmony4 dataset with backbone Resnet50?

from dirl-inharmonious-region-localization.

jimleungjing avatar jimleungjing commented on June 22, 2024

Okay. As I noticed fine-tuning works better than training from scratch. Did you train your model on iHarmony4 dataset with backbone Resnet50?

No, you can try it.

from dirl-inharmonious-region-localization.

Related Issues (7)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.