A resource repository for scene understanding, inspired by 3D-Machine-Learning.
Please feel free to pull requests to add papers.
- Survey
- Dataset
- Holistic Scene Understanding
- Room Layout Estimation
- Floorplan
- Primitive Detection
- Object Reconstruction
-
RGBD Datasets: Past, Present and Future (CVPRW'16) [Project] [Paper]
-
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey (IEEE Access'19) [Paper]
-
[NYUv2] Indoor Segmentation and Support Inference from RGBD Images (ECCV'12) [Project] [Paper]
-
SUN3D: A Database of Big Spaces Reconstructed using SfM and Object Labels (ICCV'13) [Project] [Paper]
-
SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite (CVPR'15) [Project] [Paper]
-
SceneNN: a Scene Meshes Dataset with aNNotations (3DV'16) [Project] [Paper]
-
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes (CVPR'17) [Project] [Paper]
-
[2D-3D-S] Joint 2D-3D-Semantic Data for Indoor Scene Understanding (CoRR'17) [Project] [Paper]
-
Matterport3D: Learning from RGB-D Data in Indoor Environments (3DV'17) [Project] [Paper] [Code]
-
The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes (CVPR'16) [Project] [Paper]
-
SceneNet: Understanding Real World Indoor Scenes With Synthetic Data (CVPR'16) [Project] [Paper]
-
[SUNCG] Semantic Scene Completion from a Single Depth Image (CVPR'17) [Project] [Paper]
-
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-training on Indoor Segmentation? (ICCV'17) [Project] [Paper]
-
InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset (BMVC'18) [Project] [Paper]
-
Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry (ECCV'10) [Paper]
-
Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces (NeurIPS'10) [Paper]
-
Efficient Structured Prediction for 3D Indoor Scene Understanding (CVPR'12) [Paper]
-
Efficient Exact Inference for 3D Indoor Scene Understanding (ECCV'12) [Paper]
-
Recovering Free Space of Indoor Scenes from a Single Image (CVPR'12) [Paper]
-
Understanding Indoor Scenes using 3D Geometric Phrases (CVPR'13) [Paper]
-
Scene Parsing by Integrating Function, Geometry and Appearance Models (CVPR'13) [Project] [Paper]
-
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding (ICCV'17) [Project] [Paper]
-
Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene (CVPR'18) [Project] [Paper] [Code]
-
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image (ECCV'18) [Project] [Paper] [Code]
-
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation (NeurIPS'18) [Project] [Paper] [Code]
-
Complete 3D Scene Parsing from an RGBD Image (IJCV'18) [Paper]
-
PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding (ECCV'14) [Project] [Paper]
-
Pano2CAD: Room Layout From A Single Panorama Image (WACV'17) [Paper]
-
Automatic 3D Indoor Scene Modeling from Single Panorama (CVPR'18) [Paper]
-
Recovering the Spatial Layout of Cluttered Rooms (ICCV'09) [Paper]
-
Estimating the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors (ICCV'13) [Project] [Paper]
-
Box In the Box: Joint 3D Layout and Object Reasoning from Single Images (CVPR'13) [Paper]
-
Rent3D: Floor-Plan Priors for Monocular Layout Estimation (CVPR'15) [Project] [Paper]
-
Learning Informative Edge Maps for Indoor Scene Layout Prediction (ICCV'15) [Homepage] [Paper]
-
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes (CVPR'16) [Paper]
-
A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method (ACCV'16) [Paper]
-
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation (CVPR'17) [Project] [Paper]
-
RoomNet: End-to-End Room Layout Estimation (ICCV'17) [Paper]
-
Thinking Outside the Box: Generation of Unconstrained 3D Room Layouts (ACCV'18)
-
Efficient 3D Room Shape Recovery From a Single Panorama (CVPR'16) [Project] [Paper] [Code]
-
LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image (CVPR'18) [Paper] [Code]
-
Layouts from Panoramic Images with Geometry and Deep Learning (IROS'18) [Paper] [Code]
-
HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation (CVPR'19) [Paper] [Code]
-
DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama (CVPR'19) [Project] [Paper]
-
Corners for Layout: End-to-End Layout Recovery from 360 Images (CoRR'19) [Project] [Paper] [Code]
-
Raster-to-Vector: Revisiting Floorplan Transformation (ICCV'17) [Project] [Paper] [Code]
-
FloorNet: A unified framework for floorplan reconstruction from 3D scans (ECCV'18) [Project] [Paper] [Code]
-
CubiCasa5K: A Dataset and an Improved Multi-Task Model for Floorplan Image Analysis (CoRR'19) [Paper] [Code]
-
DeepPerimeter: Indoor Boundary Estimation from Posed Monocular Sequences (CoRR'19) [Paper]
- Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes (CVPR'13) [Paper]
-
LSD: A Fast Line Segment Detector with a False Detection Control (TPAMI'10) [Paper]
-
Lifting 3D Manhattan Lines from a Single Image (ICCV'15) [Paper]
-
MCMLSD: A Dynamic Programming Approach to Line Segment Detection (CVPR'17) [Paper]
-
A Novel Linelet-Based Representation for Line Segment Detection (TPAMI'18) [Paper]
-
Novel Single View Constraints for Manhattan 3D Line Reconstruction (3DV'18) [Paper]
-
Learning Attraction Field Representation for Robust Line Segment Detection (CVPR'19) [Paper] [Code]
-
Learning to Parse Wireframes in Images of Man-Made Environments (CVPR'18) [Paper] [Code]
-
PPGNet: Learning Point-Pair Graph for Line Segment Detection (CVPR'19) [Paper] [Code]
-
PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image (CVPR'18) [Project] [Paper] [Code]
-
Recovering 3D Planes from a Single Image via Convolutional Neural Networks (ECCV'18) [Paper] [Code]
-
PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image (CVPR'19) [Paper]
-
Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding (CVPR'19) [Paper] [Code]
- Bottom-Up/Top-Down Image Parsing with Attribute Grammar (TPAMI'09) [Paper]
- Deep Cuboid Detection: Beyond 2D Bounding Boxes (CoRR'16) [Paper]
- 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction (ECCV'16) [Project] [Paper] [Code]
- A Point Set Generation Network for 3D Object Reconstruction from a Single Image (CVPR'17) [Paper] [Code]
-
A Papier-Mâché Approach to Learning 3D Surface Generation (CVPR'18) [Project] [Paper] [Code]
-
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images (ECCV'18) [Paper] [Code]
-
GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects (ICML'19) [Paper] [Code]
-
A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images (CVPR'19) [Paper]
-
GRASS: Generative Recursive Autoencoders for Shape Structures (SIGGRAPH'17) [Project] [Paper] [Code]
-
Learning Shape Abstractions by Assembling Volumetric Primitives (CVPR'17) [Project] [Paper] [Code]
-
3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks (ICCV'17) [Paper] [Code]
-
Im2Struct: Recovering 3D Shape Structure from a Single RGB Image(CVPR'18) [Paper] [Code]
-
Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids (CVPR'19) [Paper] [Code]