Thanks for your great work, I have some questions about the train dataset: <ol dir

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Thanks for your reply！ For the first question, I mean the wei

Some questions about train data about adelaidepth HOT 6 CLOSED

aim-uofa commented on August 17, 2024

Some questions about train data

from adelaidepth.

Comments (6)

guangkaixu commented on August 17, 2024

Hi @erzhu222 . Thank you for your following!

To my understand, you want to know about the "pre-trained model" loaded before training. The pre-trained model uses the imagenet pre-trained model, as other works did. The models we provide in README.md (res50 and resnext101) are used for inference only, which have been trained following our paper.
Thanks for reminding me. The DIML dataset can be downloaded in the DiverseDepth project, which can be used directly. We can not find the 3D Ken Burns dataset, but it is not so important during training.
You can generate the optical flow between stereo images, and take the horizontal optical flow as the disparity value after filtering out the large portrait optical flow(e.g. bigger than 2 pixels). Note that the generated disparity value is up to scale and may contain noise, so it belongs to the low-quality dataset, and only the ranking loss is used during training.

from adelaidepth.

erzhu222 commented on August 17, 2024

Thanks for your reply！

For the first question, I mean the weights you provided (not the weights of backbone), what dataset are they(resnet50 and resnext101) trained on ?
I see the DIML dataset in the DiverseDepth project, thanks for your sharing.
I have calculated optical flow with Flownet2 and get the horizontal optical flow, how can I filter the large portrait optical flow(e.g. bigger than 2 pixels)
thanks again!

from adelaidepth.

guangkaixu commented on August 17, 2024

For Q1, we trained the models on Taskonomy(part of it), DIML, 3D Ken Burns, Holopix50K and HRWSI, which have been released except for the 3D Ken Burns.

For Q3, you can generate a valid_mask with portrait optical flow smaller than 2 pixels. Then, set the disparity value of invalid mask (~valid_mask) to 0, during which "0" stands for invalid values or regions.

from adelaidepth.

erzhu222 commented on August 17, 2024

Thanks，for Q1, DiverseDepth was not used for train？

from adelaidepth.

guangkaixu commented on August 17, 2024

I double-checked the paper, and DiverseDepth is not employed for training. But more datasets can bring more accuracy and robustness. Just train it with more data as much as possible.

By the way, if you would like to train on large diverse datasets, you may be interested in our BoostingDepth, whose code and data will be released after accepted.

from adelaidepth.

erzhu222 commented on August 17, 2024

I double-checked the paper, and DiverseDepth is not employed for training. But more datasets can bring more accuracy and robustness. Just train it with more data as much as possible.

By the way, if you would like to train on large diverse datasets, you may be interested in our BoostingDepth, whose code and data will be released after accepted.

OK，thanks for your confirmation，I have downloaded the 3D Ken Burns dataset, could you please provide the annotation file when you traind. Thanks again!

from adelaidepth.

Some questions about train data about adelaidepth HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent