Git Product home page Git Product logo

Comments (5)

mihaidusmanu avatar mihaidusmanu commented on June 2, 2024

Dear Rolf. It is true that the off-the-shelf version of MobileNetV2 might not be directly adequate for the D2 approach: as for ResNets, you would need to go to ~1/16th of input resolution to get a similar performance to the VGG counterpart. However, there are a few things that you could try:

  1. As we did for VGG at test time, modifying the network to use dilated convolutions and no stride for the last block before the cut. This would get you to 1/8th of the resolution without any need for re-training.

  2. To further improve the resolution, MobileNetV2 can be modified to obtain a lower stride. One could do this by either moving the strided convolutions at the end of each block (instead of the beginning) or by simply removing the stride of the first block. However, this would require re-training on ImageNet since the receptive fields of all layers would be completely different.

from d2-net.

Lakaemper avatar Lakaemper commented on June 2, 2024

Hello Mihai, thanks for the fast answer and the suggestions. I will try #1, and also test convolutional transpose. I'm shying back a bit from fully retraining MobileNet.
Have a nice day!

from d2-net.

lovekittynine avatar lovekittynine commented on June 2, 2024

Hi, @mihaidusmanu ,in the suplementary material it set K=4 for mining harder samples because the receptive field of conv_3 is 65x65 for VGG.However, doesn't it is 92x92 for conv4_3? so the K may be set 8?

from d2-net.

ZhouYi0824 avatar ZhouYi0824 commented on June 2, 2024

@Lakaemper Hi Lakaemper, I am working about trying to get D2 faster too. So I am wondering about werther it works with MobileNetV2. Thank you!

from d2-net.

mihaidusmanu avatar mihaidusmanu commented on June 2, 2024

I will close this issue since there are no recent updates. Feel free to re-open a new one if you run into any issues with the provided code.

from d2-net.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.