Git Product home page Git Product logo

image_shape_align's Introduction

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% License and attribution:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

This package provides software modules for image and shape analysis. The code is released under the MIT license and can be used for any purpose with proper attribution. The code accompanies the following paper, which should be cited in publications that use the provided modules:

Single-View Reconstruction via Joint Analysis of Image and Shape Collections
Qixing Huang, Hai Wang, and Vladlen Koltun
ACM Transactions on Graphics 34(4), 2015

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the parameters
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  load('data\parameters.mat');

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Attention: The rendering setting in Para (i.e., Para.rHeights,
% Para.rWidhts) is tested on Windows machines. If you are using
% Linux or Mac, please call the following function to update
% these two vectors:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Para = update_para(Para);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the images and shapes
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  load('data\car.mat');
  Note that the shapes are assumed to be consistently oriented in
  a world coordinate system

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module I: Pose estimation
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  The pose estimation module predicts the camera pose for the underlying object with respect to the world coordinate system of the shapes. This is done by sampling the camera poses, rendering the shapes from the camera poses, and comparing real images to rendered images. The most time-consuming part is image rendering, which is done by precomputation.

  Cameras = cam_camera_sampling(Para); % camera simulation
  hogRender = cam_shape_hog_dess(...   %
                Shapes,...  % The aligned input shapes
                Cameras,... % The sampled camera poses
                Para,...    % Please refer to the function body
                verbose);   % verbose = 1 if you want to print it
                              out


  Camera estimation can be done in two ways: for a single image
  in isolation or for a collection of images

  Camera_init = cam_pose_est_single(...
                  Image,...     % The input image object
                  Shapes,...    % The aligned input shapes
                  Cameras,...   % The sampled camera poses
                  hogRender,... % The rendered hog descriptors
                  Para,...  % Please refer to the function body
                  verbose); % verbose = 1 if you want to print it
                              out

  Cameras_init = cam_pose_est_joint(...
                  inputImages,...  % The input images
                  Shapes,...       % The aligned input shapes
                  cameraSamples,...% The sampled camera poses
                  hogRender,...    % The rendered hog descriptors
                  Para,...   % Please refer to the function body
                  verbose)   % verbose = 1 if you want to print
                               it out

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module II: Pairwise image-shape alignment
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  This module performs image-shape alignment for one image. It selects the closest shape and deforms it to fit the image object. Note that the pose estimation module is packed in the following main function:

  [Camera_opt, Shape_opt] = demo_i2s_align(...
    Image,...  % Input image
    Shapes,... % Input shapes
    Para);     % Parameters used in alignment. Please refer to the
                function body for details

 Output argments:
  'Shape_opt' : The optimized shape, which aligns with the input
                image object
  'Camera_opt': The associated optimized camera configuration

Demo:
   >load('data\chair.mat');
   >load('data\parameters.mat');
   >[Camera_opt, Shape_opt] = demo_i2s_align(Images{1}, Shapes, Para);


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module III: Assembly-based reconstruction
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Assembly-based reconstruction aims to recover the underlying 3D model of an image object by assembling parts from a small collection of relevant shapes. The input consists of one image and a few shapes, each of which comprises a number of components. We assume that the shapes are consistently aligned in a world coordinate system. We assume that the pose of the image object is given. The main function is

  [Shape_opt, Camera_opt] = demo_ibm_mix_and_match(...
    Image,...             % The input image
    Camera_init,...       % The initial camera configuration for
                            all input shapes
    Shapes,...            % The input shapes
    Para);                % Please refer to the functional body
                            for details

  Output argments:
    'Shape_opt' : The reconstructed shape in the world coordinate
                  system
    'Camera_opt': The associated optimized camera configuration

Demo:
   >load('data\mix_and_match.mat');
   >load('data\parameters.mat');
   >[Shape_opt, Camera_opt] = demo_ibm_mix_and_match(Image, Camera_init, Shapes, Para);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module IV: Joint image-shape correspondence
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Joint image-shape correspondence module estimates dense pixelwise correspondences among a collection of images and a collection of shapes. We assume that (i) the camera poses of the images are pre-computed (e.g., using Module I), and (ii) the shapes are consistently aligned in a world coordinate system. The main function is

  PairMatches = demo_joint_i2s_corres_main(...
    Images,...       % Input images
    ImageCameras,... % The camera configurations associated with
                       each input image
    Shapes,...       % The input shapes (aligned in a
                       world coordinate system)
    Top,...          % (#Images x #Shapes) is a sparse matrix
                       that specifies image-shape pairs
                       for matching.
    Para);           %

  Warning: If you use >100 images and/or >100 shapes,
           the computation may take a very long time.

  Demo:
    >load('data\joint_matching.mat');
    >load('data\parameters.mat');
    >Corres_rs = demo_joint_i2s_corres_main(Images(1:10), ImageCameras(1:10), Shapes(1:5), ones(10,5), Para);


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% External libraries
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  The software package uses the following external libraries:

  Piotr's Image & Video Matlab Toolbox:
  http://vision.ucsd.edu/~pdollar/toolbox/doc/
  http://research.microsoft.com/en-us/downloads/389109f6-b4e8-404c-84bf-239f7cbf4e3d/

  The SIFT flow package:
  http://people.csail.mit.edu/celiu/SIFTflow/

  The OPENGM2 structure predication package:
  http://hci.iwr.uni-heidelberg.de/opengm2/?l0=library

  HOG descriptor: functions 'imResample', 'hog'

  Edge map: function 'edgesDetect'

  OPENGM2: the trws algorithm

References:

Navneet Dalal, Bill Triggs: Histograms of Oriented Gradients for Human Detection. CVPR 2005

Piotr Dollár, C. Lawrence Zitnick: Fast Edge Detection Using Structured Forests. IEEE Trans. Pattern Anal. Mach. Intell. 37(8): 1558-1570 (2015)

Ce Liu, Jenny Yuen, Antonio Torralba: SIFT Flow: Dense Correspondence across Scenes and Its Applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5): 978-994 (2011)

Andres, B. and Beier T. and Kappes, J.H. : OpenGM: A C++ Library for Discrete Graphical Models. http://arxiv.org/abs/1206.0111

image_shape_align's People

Contributors

huangqx avatar tuanfeng avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.