Awesome Scene Understanding

A resource repository for scene understanding, inspired by 3D-Machine-Learning.

Contributing

Please feel free to pull requests to add papers.

Survey
Dataset
- Realistic Datasets
- Synthetic Datasets
Holistic Scene Understanding
- Perspective Image
- Panoramic Image
Room Layout Estimation
- Perspective Image
- Panoramic Image
Floorplan
Primitive Detection
- Junction
- Line Segment
- Wireframe
- Plane
- Rectangle
- Cuboid
Object Reconstruction
- Voxel
- Point Cloud
- Mesh
- Primitive

Survey

RGBD Datasets: Past, Present and Future (CVPRW'16) [Project] [Paper]
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey (IEEE Access'19) [Paper]

Dataset

Realistic Dataset

[NYUv2] Indoor Segmentation and Support Inference from RGBD Images (ECCV'12) [Project] [Paper]
SUN3D: A Database of Big Spaces Reconstructed using SfM and Object Labels (ICCV'13) [Project] [Paper]
SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite (CVPR'15) [Project] [Paper]
SceneNN: a Scene Meshes Dataset with aNNotations (3DV'16) [Project] [Paper]
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes (CVPR'17) [Project] [Paper]
[2D-3D-S] Joint 2D-3D-Semantic Data for Indoor Scene Understanding (CoRR'17) [Project] [Paper]
Matterport3D: Learning from RGB-D Data in Indoor Environments (3DV'17) [Project] [Paper] [Code]

Synthetic Dataset

The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes (CVPR'16) [Project] [Paper]
SceneNet: Understanding Real World Indoor Scenes With Synthetic Data (CVPR'16) [Project] [Paper]
[SUNCG] Semantic Scene Completion from a Single Depth Image (CVPR'17) [Project] [Paper]
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-training on Indoor Segmentation? (ICCV'17) [Project] [Paper]
InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset (BMVC'18) [Project] [Paper]

Holistic Scene Understanding

Perspective Image

Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry (ECCV'10) [Paper]
Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces (NeurIPS'10) [Paper]
Efficient Structured Prediction for 3D Indoor Scene Understanding (CVPR'12) [Paper]
Efficient Exact Inference for 3D Indoor Scene Understanding (ECCV'12) [Paper]
Recovering Free Space of Indoor Scenes from a Single Image (CVPR'12) [Paper]
Understanding Indoor Scenes using 3D Geometric Phrases (CVPR'13) [Paper]
Scene Parsing by Integrating Function, Geometry and Appearance Models (CVPR'13) [Project] [Paper]
Im2CAD (CVPR'18) [Project] [Paper]
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding (ICCV'17) [Project] [Paper]
Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene (CVPR'18) [Project] [Paper] [Code]
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image (ECCV'18) [Project] [Paper] [Code]
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation (NeurIPS'18) [Project] [Paper] [Code]
Complete 3D Scene Parsing from an RGBD Image (IJCV'18) [Paper]

Panoramic Image

PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding (ECCV'14) [Project] [Paper]
Pano2CAD: Room Layout From A Single Panorama Image (WACV'17) [Paper]
Automatic 3D Indoor Scene Modeling from Single Panorama (CVPR'18) [Paper]

Room Layout Estimation

Perspective Image

Recovering the Spatial Layout of Cluttered Rooms (ICCV'09) [Paper]
Estimating the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors (ICCV'13) [Project] [Paper]
Box In the Box: Joint 3D Layout and Object Reasoning from Single Images (CVPR'13) [Paper]
Rent3D: Floor-Plan Priors for Monocular Layout Estimation (CVPR'15) [Project] [Paper]
Learning Informative Edge Maps for Indoor Scene Layout Prediction (ICCV'15) [Homepage] [Paper]
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes (CVPR'16) [Paper]
A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method (ACCV'16) [Paper]
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation (CVPR'17) [Project] [Paper]
RoomNet: End-to-End Room Layout Estimation (ICCV'17) [Paper]
Thinking Outside the Box: Generation of Unconstrained 3D Room Layouts (ACCV'18)

Panoramic Image

Efficient 3D Room Shape Recovery From a Single Panorama (CVPR'16) [Project] [Paper] [Code]
LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image (CVPR'18) [Paper] [Code]
Layouts from Panoramic Images with Geometry and Deep Learning (IROS'18) [Paper] [Code]
HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation (CVPR'19) [Paper] [Code]
DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama (CVPR'19) [Project] [Paper]
Corners for Layout: End-to-End Layout Recovery from 360 Images (CoRR'19) [Project] [Paper] [Code]

Floorplan

Raster-to-Vector: Revisiting Floorplan Transformation (ICCV'17) [Project] [Paper] [Code]
FloorNet: A unified framework for floorplan reconstruction from 3D scans (ECCV'18) [Project] [Paper] [Code]
CubiCasa5K: A Dataset and an Improved Multi-Task Model for Floorplan Image Analysis (CoRR'19) [Paper] [Code]
DeepPerimeter: Indoor Boundary Estimation from Posed Monocular Sequences (CoRR'19) [Paper]

Primitive Detection

Junction

Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes (CVPR'13) [Paper]

Line Segment

LSD: A Fast Line Segment Detector with a False Detection Control (TPAMI'10) [Paper]
Lifting 3D Manhattan Lines from a Single Image (ICCV'15) [Paper]
MCMLSD: A Dynamic Programming Approach to Line Segment Detection (CVPR'17) [Paper]
A Novel Linelet-Based Representation for Line Segment Detection (TPAMI'18) [Paper]
Novel Single View Constraints for Manhattan 3D Line Reconstruction (3DV'18) [Paper]
Learning Attraction Field Representation for Robust Line Segment Detection (CVPR'19) [Paper] [Code]

Wireframe

Learning to Parse Wireframes in Images of Man-Made Environments (CVPR'18) [Paper] [Code]
PPGNet: Learning Point-Pair Graph for Line Segment Detection (CVPR'19) [Paper] [Code]
End-to-End Wireframe Parsing (CoRR'19) [Paper] [Code]

Plane

PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image (CVPR'18) [Project] [Paper] [Code]
Recovering 3D Planes from a Single Image via Convolutional Neural Networks (ECCV'18) [Paper] [Code]
PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image (CVPR'19) [Paper]
Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding (CVPR'19) [Paper] [Code]

Rectangle

Bottom-Up/Top-Down Image Parsing with Attribute Grammar (TPAMI'09) [Paper]

Cuboid

Deep Cuboid Detection: Beyond 2D Bounding Boxes (CoRR'16) [Paper]

Object Reconstruction

Voxel-based

3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction (ECCV'16) [Project] [Paper] [Code]

Point Cloud-based

A Point Set Generation Network for 3D Object Reconstruction from a Single Image (CVPR'17) [Paper] [Code]

Mesh-based

Neural 3D Mesh Renderer (CVPR'18) [Project] [Paper] [Code]
A Papier-Mâché Approach to Learning 3D Surface Generation (CVPR'18) [Project] [Paper] [Code]
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images (ECCV'18) [Paper] [Code]
GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects (ICML'19) [Paper] [Code]
A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images (CVPR'19) [Paper]

Primitive-based

GRASS: Generative Recursive Autoencoders for Shape Structures (SIGGRAPH'17) [Project] [Paper] [Code]
Learning Shape Abstractions by Assembling Volumetric Primitives (CVPR'17) [Project] [Paper] [Code]
3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks (ICCV'17) [Paper] [Code]
Im2Struct: Recovering 3D Shape Structure from a Single RGB Image（CVPR'18) [Paper] [Code]
Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids (CVPR'19) [Paper] [Code]

mmlph / awesome-scene-understanding Goto Github PK

awesome-scene-understanding's Introduction

Awesome Scene Understanding

Contributing

Table of Contents

Survey

Dataset

Realistic Dataset

Synthetic Dataset

Holistic Scene Understanding

Perspective Image

Panoramic Image

Room Layout Estimation

Perspective Image

Panoramic Image

Floorplan

Primitive Detection

Junction

Line Segment

Wireframe

Plane

Rectangle

Cuboid

Object Reconstruction

Voxel-based

Point Cloud-based

Mesh-based

Primitive-based

awesome-scene-understanding's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org