Hi, Thanks for sharing the inference code. When the model infers the depth for the

Output depth data format about mannequinchallenge HOT 5 OPEN

google commented on July 27, 2024

Output depth data format

from mannequinchallenge.

Comments (5)

fcole commented on July 27, 2024 2

Yes, the output is a floating-point value. Each output map is scaled by an unknown factor relative to the ground truth (i.e., it's not in units of meters or anything like that).

from mannequinchallenge.

fcole commented on July 27, 2024

The model estimates depth up to an unknown scale parameter, so the units themselves are not that meaningful. The error metrics we use for evaluation measure the accuracy of the depth map up to scale. This is a consequence of the training data (multi-view stereo) also having a scale ambiguity.

from mannequinchallenge.

astro-fits commented on July 27, 2024

Hi fcole,
Do you mean that a depth map predicted by the pre-trained model is scaled by an unknown factor, in comparison with the "depth ground truth " ?

from mannequinchallenge.

jasjuang commented on July 27, 2024

Hi, is the depth image predicted by the network a 32-bit continous floating-point image? Or is it just an 8-bit image?

from mannequinchallenge.

astro-fits commented on July 27, 2024

Thanks for your reply. I found that such scaling factor is correlated with the normalization of depth ground truth (i.e. normalized from 1 to 3 or from 1 to 10 meters) when I train a model. The factor is also increased with the enhancement of training epoch.

from mannequinchallenge.

Recommend Projects

Output depth data format about mannequinchallenge HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent