nianticlabs / mickey Goto Github PK

View Code? Open in Web Editor NEW

314.0 10.0 14.0 781 KB

[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

Home Page: https://nianticlabs.github.io/mickey/

License: Other

Python 100.00%

computer-vision cvpr2024 dsac mickey pose-estimation ransac ransac-algorithm metric-correspondences

mickey's People

Stargazers

Watchers

Forkers

hiyyg panoptiqai phantomxm2021 agipro alakia jackzhousz 0iui0 peterzs qian5683 21pirlo21 saulocatharino kinggreat24 aurvis hardikdava

mickey's Issues

Basic question on resizing images

I'm a beginner with ML models for perception, so apologies if this question is basic.

In demo_inference.py there is a way to read images, which are resized to a default size of (540, 720). Then, there is a read_intrinsics function with a default parameter resize.

My question is: If input images do not have a size of (540, 720), does the resizing for the intrinsics need to be applied too? If not, when would one use it?

A naive question about depth prediction

I am new to MVS research, so pardon me if the question is naive.
How do you ensure that the predicted depth is a correct metric depth? In other words, how do you ensure the depth predictions have correct scale. AFAIK you do not supervise your depth maps directly.

How to perform image pair matching

Thank you for your work. I would like to ask if the input is an image pair, one as a reference image and the other as a source image, can the source image be converted to the perspective of the reference image through the Mickey model? Similar to solving the transformation matrix between the two? Please tell me how to implement it in the code, because the demo seems to be more inclined to output the depth map and the confidence score map, but it does not give me the conversion result. Looking forward to your reply!

visualization

hello， Could you provide visuliaztion code generating Fig6 in the paper

Obtaining pose confidence measurements

Hi, thanks for the interesting work. We'd like to use this in the context of detecting loop closures for visual SLAM.

As the dataset we are using does not really line up well with what MicKey was trained on (indoor scenes of partially constructed buildings), I don't expect MicKey to perform well out of the box. However, I'd like to somehow quantify "how bad is the domain gap".

In the website, you show a confidence metric between two images:

Is it possible to obtain this number from the pose matcher? The output of the model from demo_inference.py is:

Data keys: dict_keys(['image0', 'image1', 'K_color0', 'K_color1', 'kps0_shape', 
'kps1_shape', 'depth0_map', 'depth1_map', 'down_factor', 'kps0', 'depth_kp0', 
'scr0', 'kps1', 'depth_kp1', 'scr1', 'scores', 'dsc0', 'dsc1', 'kp_scores', 'final_scores', 
'R', 't', 'inliers', 'inliers_list'])

Is the confidence in one of these output variables?

(a more general question: What do all of these mean?)

cross-domain image matching

Thanks for your work!
I want to know if this work can do cross-domain image matching? For example, visible light images and infrared images?

nianticlabs / mickey Goto Github PK

mickey's People

Stargazers

Watchers

Forkers

mickey's Issues

Basic question on resizing images

A naive question about depth prediction

How to perform image pair matching

visualization

Obtaining pose confidence measurements

cross-domain image matching

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent