Comments (11)
Hi,
Most of the preprocessing was done using internal tools at google, but as a rule of thumb, it usually works well to use mask-rcnn for segmenting the foreground object, and then run colmap with the segmented masks.
For the Davis example, most the calibration is done using the camera calibration tool in Nuke, with some mannual selection of detected keypoints.
from dynamic-video-depth.
Thank you for your reply. I noticed that you mentioned using ORBSLAM2 and COLMAP to produce camera pose estimates in your paper. I wonder whether the above method can be successful without mask or manual selection for only a small number of foreground moving objects.
from dynamic-video-depth.
orbslam typically works with reasonable dynamic scenes but it does require camera intrinsics. I think you can assume a reasonable focal length, and pass the keypoints and camera calibrations to colmap for further optimizations.
from dynamic-video-depth.
Thank you very much😁
from dynamic-video-depth.
I want to confirm the pose information required for the preprocessing process. Is the coordinate system of pose consistent with colmap? It is the world to camera coordinate system Tcw.
from dynamic-video-depth.
Hi, we assume a x right, y down Image coordinate system where the origin is top left, and the pose matrices in the npz files are camera2world transformations.
from dynamic-video-depth.
Add an additional discovery, does the text of the output video confuse refine and initial? Looks like the first one is initial, and second one is refine.
from dynamic-video-depth.
I think the order is correct; the initial depth will be flickering due to inconsistency.
The refined depth might suffer some detail loss due to flow inaccuracies on fast moving/ thin structures
from dynamic-video-depth.
Thank you for your reply. When I tested some videos of road scenes, I found that it would blur a lot of distant details, but I found that there were still a lot of details in the initial estimation of the network. I want to know if there are any suggestions on network parameter adjustment. I think it is mainly due to the influence of flow information. Can you give me some suggestions?
from dynamic-video-depth.
Hi, since our method takes optical flow and camera poses as geometric cues, objects further in the scene need more accurate flows and larger baselines. Single image depth maps are trained in a supervised way, therefore, are agnostic to such issues, but not temporally consistent.
from dynamic-video-depth.
Thank you for your reply.
from dynamic-video-depth.
Related Issues (14)
- Can not reproduce training result
- Question about triangulation results file
- Question about the colmap parameter setting and image resize need to convert the camera pose HOT 2
- How to get the triangulation files for customized videos? HOT 3
- Parameter finetuning vs Output finetuning
- Is there any plan to release model? HOT 3
- Script for rendering the teaser video (fix-view-video)
- Davis example dataset download link missing
- Question about scene flow sign
- Questions about the preprocessing code HOT 2
- Setting `--batch_size` to 8 results in RuntimeError HOT 1
- SyntaxError: invalid syntax HOT 1
- DAVIS datafiles uncomplete? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dynamic-video-depth.