Git Product home page Git Product logo

Comments (5)

geopavlakos avatar geopavlakos commented on August 20, 2024

In this case, the original pred_cam[:,0] value corresponds to s, the scaling factor of the weak perspective projection, which approximates f/Z. So the depth of the human is Z = f/s. Then, we also divide by the factor bbox_size/2, so that we project the human to [-0.5,0.5].

from 4d-humans.

dengyang11 avatar dengyang11 commented on August 20, 2024

Thanks again. In addition, why focal length changes with image size? Thanks

  1. scaled_focal_length = model_cfg.EXTRA.FOCAL_LENGTH / model_cfg.MODEL.IMAGE_SIZE * img_size.max()
  2. pred_keypoints_2d = perspective_projection(pred_keypoints_3d,
    translation=pred_cam_t,
    focal_length=focal_length / self.cfg.MODEL.IMAGE_SIZE)

from 4d-humans.

geopavlakos avatar geopavlakos commented on August 20, 2024

You can use an arbitrary focal length value when you use the above equation. We adopt the design decisions of ProHMR. Note that self.cfg.MODEL.IMAGE_SIZE is constant (set to 256).
For the demo code, this is just a design choice to visualize the results with larger focal length values in general. You could experiment with other values too.

from 4d-humans.

dengyang11 avatar dengyang11 commented on August 20, 2024

Thanks again

from 4d-humans.

nnop avatar nnop commented on August 20, 2024

Then, we also divide by the factor bbox_size/2, so that we project the human to [-0.5,0.5].

You mean normalize to [-1, 1]? @geopavlakos
And I think it's more proper to normalize by bbox_size instead of image size. It's the bbox size which is resized to MODEL.IMAGE_SIZE.

from 4d-humans.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.