Git Product home page Git Product logo

Comments (6)

guangkaixu avatar guangkaixu commented on August 17, 2024

Hi @erzhu222 . Thank you for your following!

  1. To my understand, you want to know about the "pre-trained model" loaded before training. The pre-trained model uses the imagenet pre-trained model, as other works did. The models we provide in README.md (res50 and resnext101) are used for inference only, which have been trained following our paper.
  2. Thanks for reminding me. The DIML dataset can be downloaded in the DiverseDepth project, which can be used directly. We can not find the 3D Ken Burns dataset, but it is not so important during training.
  3. You can generate the optical flow between stereo images, and take the horizontal optical flow as the disparity value after filtering out the large portrait optical flow(e.g. bigger than 2 pixels). Note that the generated disparity value is up to scale and may contain noise, so it belongs to the low-quality dataset, and only the ranking loss is used during training.

from adelaidepth.

erzhu222 avatar erzhu222 commented on August 17, 2024

Thanks for your reply!

  1. For the first question, I mean the weights you provided (not the weights of backbone), what dataset are they(resnet50 and resnext101) trained on ?
  2. I see the DIML dataset in the DiverseDepth project, thanks for your sharing.
  3. I have calculated optical flow with Flownet2 and get the horizontal optical flow, how can I filter the large portrait optical flow(e.g. bigger than 2 pixels)
    thanks again!

from adelaidepth.

guangkaixu avatar guangkaixu commented on August 17, 2024

For Q1, we trained the models on Taskonomy(part of it), DIML, 3D Ken Burns, Holopix50K and HRWSI, which have been released except for the 3D Ken Burns.

For Q3, you can generate a valid_mask with portrait optical flow smaller than 2 pixels. Then, set the disparity value of invalid mask (~valid_mask) to 0, during which "0" stands for invalid values or regions.

from adelaidepth.

erzhu222 avatar erzhu222 commented on August 17, 2024

Thanks,for Q1, DiverseDepth was not used for train?

from adelaidepth.

guangkaixu avatar guangkaixu commented on August 17, 2024

I double-checked the paper, and DiverseDepth is not employed for training. But more datasets can bring more accuracy and robustness. Just train it with more data as much as possible.

By the way, if you would like to train on large diverse datasets, you may be interested in our BoostingDepth, whose code and data will be released after accepted.

from adelaidepth.

erzhu222 avatar erzhu222 commented on August 17, 2024

I double-checked the paper, and DiverseDepth is not employed for training. But more datasets can bring more accuracy and robustness. Just train it with more data as much as possible.

By the way, if you would like to train on large diverse datasets, you may be interested in our BoostingDepth, whose code and data will be released after accepted.

OK,thanks for your confirmation,I have downloaded the 3D Ken Burns dataset, could you please provide the annotation file when you traind. Thanks again!

from adelaidepth.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.