Git Product home page Git Product logo

Comments (4)

Nik-V9 avatar Nik-V9 commented on July 29, 2024

Hi, yes, it is possible to extend our current RGB-D implementation to work with a Stereo setup. Support for two images as input will require significant restructuring of the code. Hence, this is currently not on our radar.

from splatam.

darshanAtSwaayatt avatar darshanAtSwaayatt commented on July 29, 2024

Hi @Nik-V9 ,

I have a stereo data and used "https://wiki.ros.org/stereo_image_proc" ROS package to generate depth images.
I gave this data as input to the Splatam code.
I updated the code with necessary scripts like the data.yaml, config script, dataset class, etc. but the output received is not proper.
For the below scene (data is captured using the stereo setup mounted on a car. So the camera motion is in the forward direction only)
left_4881

Corresponding depth image
disp_4881

the output generated was
FRONT VIEW
image

SIDE VIEW
image

TOP VIEW
image

But using Iphone 12 pro data, the output was proper.

The config files updated by me for above data are in below zip:
Splatam_updated_files.zip

Also wanted to understand how I can find "png_depth_scale" parameter for my data?
As far as my understanding I can use below formula:
image

where

  • f is the focal length
  • B is the baseline distance between the stereo cameras,
  • D_image : is the pixel value in the depth image.
  • Z_meters : Real world depth in meters.

Z_meters will be the png_depth_scale for my data. But this will not be constant, as Z_meters is inversely proportional to D_image.

Can you tell what I am doing wrong?
Any suggestions to fix my problem.

Thanks and Regards,
@darshanAtSwaayatt

from splatam.

darshanAtSwaayatt avatar darshanAtSwaayatt commented on July 29, 2024

Hi @Nik-V9 ,

I have a stereo data and used "https://wiki.ros.org/stereo_image_proc" ROS package to generate depth images. I gave this data as input to the Splatam code. I updated the code with necessary scripts like the data.yaml, config script, dataset class, etc. but the output received is not proper. For the below scene (data is captured using the stereo setup mounted on a car. So the camera motion is in the forward direction only) left_4881

Corresponding depth image disp_4881

the output generated was FRONT VIEW image

SIDE VIEW image

TOP VIEW image

But using Iphone 12 pro data, the output was proper.

The config files updated by me for above data are in below zip: Splatam_updated_files.zip

Also wanted to understand how I can find "png_depth_scale" parameter for my data? As far as my understanding I can use below formula: image

where

  • f is the focal length
  • B is the baseline distance between the stereo cameras,
  • D_image : is the pixel value in the depth image.
  • Z_meters : Real world depth in meters.

Z_meters will be the png_depth_scale for my data. But this will not be constant, as Z_meters is inversely proportional to D_image.

Can you tell what I am doing wrong? Any suggestions to fix my problem.

Thanks and Regards, @darshanAtSwaayatt

Hi @Nik-V9 , Can you please help.

from splatam.

Nik-V9 avatar Nik-V9 commented on July 29, 2024

Hi, if the depth image you are saving has the pixel values in meters, the png_depth_scale will be 1.

Essentially png_depth_scale is a parameter to convert pixel values of a depth image to metric depth. In your case, after computing the metric depth if you are saving the image as is, the png_depth_scale will be 1.

Also based on the stereo depth image shared by you, there seems to be a lot of holes in the depth map. Without good multi-view coverage, I'm unsure how well the current version of SplaTAM will perform on the data.

from splatam.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.