Git Product home page Git Product logo

awesome-scene-understanding's Introduction

Awesome Scene Understanding

A resource repository for scene understanding, inspired by 3D-Machine-Learning.

Contributing

Please feel free to pull requests to add papers.

Table of Contents

  • RGBD Datasets: Past, Present and Future (CVPRW'16) [Project] [Paper]

  • Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey (IEEE Access'19) [Paper]

  • [NYUv2] Indoor Segmentation and Support Inference from RGBD Images (ECCV'12) [Project] [Paper]

  • SUN3D: A Database of Big Spaces Reconstructed using SfM and Object Labels (ICCV'13) [Project] [Paper]

  • SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite (CVPR'15) [Project] [Paper]

  • SceneNN: a Scene Meshes Dataset with aNNotations (3DV'16) [Project] [Paper]

  • ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes (CVPR'17) [Project] [Paper]

  • [2D-3D-S] Joint 2D-3D-Semantic Data for Indoor Scene Understanding (CoRR'17) [Project] [Paper]

  • Matterport3D: Learning from RGB-D Data in Indoor Environments (3DV'17) [Project] [Paper] [Code]

  • The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes (CVPR'16) [Project] [Paper]

  • SceneNet: Understanding Real World Indoor Scenes With Synthetic Data (CVPR'16) [Project] [Paper]

  • [SUNCG] Semantic Scene Completion from a Single Depth Image (CVPR'17) [Project] [Paper]

  • SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-training on Indoor Segmentation? (ICCV'17) [Project] [Paper]

  • InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset (BMVC'18) [Project] [Paper]

  • Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry (ECCV'10) [Paper]

  • Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces (NeurIPS'10) [Paper]

  • Efficient Structured Prediction for 3D Indoor Scene Understanding (CVPR'12) [Paper]

  • Efficient Exact Inference for 3D Indoor Scene Understanding (ECCV'12) [Paper]

  • Recovering Free Space of Indoor Scenes from a Single Image (CVPR'12) [Paper]

  • Understanding Indoor Scenes using 3D Geometric Phrases (CVPR'13) [Paper]

  • Scene Parsing by Integrating Function, Geometry and Appearance Models (CVPR'13) [Project] [Paper]

  • Im2CAD (CVPR'18) [Project] [Paper]

  • DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding (ICCV'17) [Project] [Paper]

  • Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene (CVPR'18) [Project] [Paper] [Code]

  • Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image (ECCV'18) [Project] [Paper] [Code]

  • Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation (NeurIPS'18) [Project] [Paper] [Code]

  • Complete 3D Scene Parsing from an RGBD Image (IJCV'18) [Paper]

  • PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding (ECCV'14) [Project] [Paper]

  • Pano2CAD: Room Layout From A Single Panorama Image (WACV'17) [Paper]

  • Automatic 3D Indoor Scene Modeling from Single Panorama (CVPR'18) [Paper]

  • Recovering the Spatial Layout of Cluttered Rooms (ICCV'09) [Paper]

  • Estimating the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors (ICCV'13) [Project] [Paper]

  • Box In the Box: Joint 3D Layout and Object Reasoning from Single Images (CVPR'13) [Paper]

  • Rent3D: Floor-Plan Priors for Monocular Layout Estimation (CVPR'15) [Project] [Paper]

  • Learning Informative Edge Maps for Indoor Scene Layout Prediction (ICCV'15) [Homepage] [Paper]

  • DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes (CVPR'16) [Paper]

  • A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method (ACCV'16) [Paper]

  • Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation (CVPR'17) [Project] [Paper]

  • RoomNet: End-to-End Room Layout Estimation (ICCV'17) [Paper]

  • Thinking Outside the Box: Generation of Unconstrained 3D Room Layouts (ACCV'18)

  • Efficient 3D Room Shape Recovery From a Single Panorama (CVPR'16) [Project] [Paper] [Code]

  • LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image (CVPR'18) [Paper] [Code]

  • Layouts from Panoramic Images with Geometry and Deep Learning (IROS'18) [Paper] [Code]

  • HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation (CVPR'19) [Paper] [Code]

  • DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama (CVPR'19) [Project] [Paper]

  • Corners for Layout: End-to-End Layout Recovery from 360 Images (CoRR'19) [Project] [Paper] [Code]

  • Raster-to-Vector: Revisiting Floorplan Transformation (ICCV'17) [Project] [Paper] [Code]

  • FloorNet: A unified framework for floorplan reconstruction from 3D scans (ECCV'18) [Project] [Paper] [Code]

  • CubiCasa5K: A Dataset and an Improved Multi-Task Model for Floorplan Image Analysis (CoRR'19) [Paper] [Code]

  • DeepPerimeter: Indoor Boundary Estimation from Posed Monocular Sequences (CoRR'19) [Paper]

  • Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes (CVPR'13) [Paper]
  • LSD: A Fast Line Segment Detector with a False Detection Control (TPAMI'10) [Paper]

  • Lifting 3D Manhattan Lines from a Single Image (ICCV'15) [Paper]

  • MCMLSD: A Dynamic Programming Approach to Line Segment Detection (CVPR'17) [Paper]

  • A Novel Linelet-Based Representation for Line Segment Detection (TPAMI'18) [Paper]

  • Novel Single View Constraints for Manhattan 3D Line Reconstruction (3DV'18) [Paper]

  • Learning Attraction Field Representation for Robust Line Segment Detection (CVPR'19) [Paper] [Code]

  • Learning to Parse Wireframes in Images of Man-Made Environments (CVPR'18) [Paper] [Code]

  • PPGNet: Learning Point-Pair Graph for Line Segment Detection (CVPR'19) [Paper] [Code]

  • End-to-End Wireframe Parsing (CoRR'19) [Paper] [Code]

  • PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image (CVPR'18) [Project] [Paper] [Code]

  • Recovering 3D Planes from a Single Image via Convolutional Neural Networks (ECCV'18) [Paper] [Code]

  • PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image (CVPR'19) [Paper]

  • Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding (CVPR'19) [Paper] [Code]

  • Bottom-Up/Top-Down Image Parsing with Attribute Grammar (TPAMI'09) [Paper]
  • Deep Cuboid Detection: Beyond 2D Bounding Boxes (CoRR'16) [Paper]
  • 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction (ECCV'16) [Project] [Paper] [Code]
  • A Point Set Generation Network for 3D Object Reconstruction from a Single Image (CVPR'17) [Paper] [Code]
  • Neural 3D Mesh Renderer (CVPR'18) [Project] [Paper] [Code]

  • A Papier-Mâché Approach to Learning 3D Surface Generation (CVPR'18) [Project] [Paper] [Code]

  • Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images (ECCV'18) [Paper] [Code]

  • GEOMetrics: Exploiting Geometric Structure for Graph-Encoded Objects (ICML'19) [Paper] [Code]

  • A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images (CVPR'19) [Paper]

  • GRASS: Generative Recursive Autoencoders for Shape Structures (SIGGRAPH'17) [Project] [Paper] [Code]

  • Learning Shape Abstractions by Assembling Volumetric Primitives (CVPR'17) [Project] [Paper] [Code]

  • 3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks (ICCV'17) [Paper] [Code]

  • Im2Struct: Recovering 3D Shape Structure from a Single RGB Image(CVPR'18) [Paper] [Code]

  • Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids (CVPR'19) [Paper] [Code]

awesome-scene-understanding's People

Contributors

bertjiazheng avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.