kunkun0w0 / clean-torch-nerfs Goto Github PK

View Code? Open in Web Editor NEW

41.0 41.0 6.0 7.88 MB

Clean NeRFs Reproduction using Tiny Lego

Python 100.00%

clean-torch-nerfs's People

Contributors

Stargazers

Watchers

Forkers

qhfan william2ai imliekai zhou0531 bjy12 jidedaka

clean-torch-nerfs's Issues

How to understand the process of projecting sampling points onto feature maps?

ref_pos = torch.einsum("kij,kbsj->kbsi", R_t, pos - camera_pos)
uv_pos = ref_pos[..., :-1] / ref_pos[..., -1:] / scale

Can you briefly explain how this code implements projection? It's so hard to understand

About feature_matching function

Hi, thanks for your excellent and brief implementation!

def feature_matching(self, pos):
        n_rays, n_samples, _ = pos.shape
        pos = pos.unsqueeze(dim=0).expand([self.n, n_rays, n_samples, 3])
        camera_pos = self.camera_pos[:, None, None, :]
        camera_pos = camera_pos.expand_as(pos)
        ref_pos = torch.einsum("kij,kbsj->kbsi", self.R_t, pos-camera_pos)
        uv_pos = ref_pos[..., :-1] / ref_pos[..., -1:] / self.scale
        uv_pos[..., 1] *= -1.0
        return F.grid_sample(self.reference, uv_pos, align_corners=True, padding_mode="border")

In main/PixelNeRF/Dataset.py line 89

 uv_pos = ref_pos[..., :-1] / ref_pos[..., -1:] / self.scale

This operation aims to transform 3D points from camera coordinate system to pixel coordinate system and scale to range [-1,1] .
But here the camera coordinate system is the OpenGL form (i.e. x->right, y->upper, z->inner) and most of the 3D points will have negative z-axis coordinate values. So I think we should use the absolute value of ref_pos[..., -1:] to do the transformation, or we will get a upside-down reprojection result as I observed in experiment.

About the sample_rays_np

感谢您的复现。

def sample_rays_np(H, W, f, c2w):
    # image coordinate (i,j)
    # c2w:[:3,:3] --> [R,t]
    i, j = np.meshgrid(np.arange(W, dtype=np.float32), np.arange(H, dtype=np.float32), indexing='xy')
    dirs = np.stack([(i - W * .5 + 0.5) / f, -(j - H * .5 + 0.5) / f, -np.ones_like(i)], -1)
    # print(dirs[..., None, :].shape)
    # print(c2w[:3,:3].shape)
    rays_d = np.sum(dirs[..., None, :] * c2w[:3, :3], -1)
    rays_o = np.broadcast_to(c2w[:3, -1], np.shape(rays_d))
    return rays_o, rays_d

你好，我问一下，这个c2w读取的是pose, pose不是w2c吗。
另外这个地方最后为什么要加0.5呢？

dirs = np.stack([(i - W * .5 + 0.5) / f, -(j - H * .5 + 0.5) / f, -np.ones_like(i)], -1)

kunkun0w0 / clean-torch-nerfs Goto Github PK

clean-torch-nerfs's People

Contributors

Stargazers

Watchers

Forkers

clean-torch-nerfs's Issues

How to understand the process of projecting sampling points onto feature maps?

About feature_matching function

About the sample_rays_np

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent