Git Product home page Git Product logo

Comments (2)

zhizdev avatar zhizdev commented on June 24, 2024 1

Hi David, thanks for the response! I really appreciate it!

It took a while for me to look at the example_co3d_challenge_submission.py and see how the query image is rendered and pasted onto the original image.

Some notes in case it may be helpful for future readers:

  1. When we are loading pytorch3d cameras and cropped images with the training data loader, the cameras intrinsic parameters are adjusted according to the object bounding box detected from the foreground mask.
  2. When we load the fewview_dev/fewvew_test sets, the query image is not given; thus, we are given unadjusted intrinsic parameters. This may result in different focal length than we are used to seeing for views with box crop.

from co3d.

davnov134 avatar davnov134 commented on June 24, 2024

Hi, the focal lengths within a scene are different because we let COLMAP to extract an image-specific (as opposed to a camera-specific) focal length. The main reason is that the consumer smartphones used to capture the videos often automatically change focus during the capture, which effectively leads to a varying focal length of the video frames.

In terms of example_co3d_challenge_submission.py, we did several unit tests that ensure that the focal length is correctly adjusted when cropping the image around a segmentation mask. The algorithm does not handle samples from different sets (fewview_dev/fewvew_test) differently so there should not be an issue.

Please note that, for the test subset (i.e. fewvew_test/manyview_test), all depth maps and all test images are redacted (i.e. these are blank black images). Hence, example_co3d_challenge_submission.py produces non-sensical results for fewvew_test/manyview_test since the DBIR renderer generates an invalid point cloud from the redacted depth maps of the source views.

from co3d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.