Git Product home page Git Product logo

rauchlukas / article-pcss_for_construction-a_survey Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 8.24 MB

Datahub to the Applied Science Paper: Semantic Point Cloud Segmentation with Deep-Learning-Based Approaches for the Construction Industry: A Survey by Lukas Rauch et al.

Home Page: https://rauchlukas.github.io/

applied-science benchmark construction dataset datasets deep-learning industrial machine-learning paper pointcloud

article-pcss_for_construction-a_survey's Introduction

Figure 13: Reported results for semantic segmentation task on the large-scale indoor S3DIS benchmark.

Semantic segmentation benchmark results on the S3DIS dataset

Authors: Lukas Rauch, Thomas Braml
Correspondence: [email protected]
DOI: https://doi.org/10.3390/app13169146

Abstract

Point cloud learning has recently gained strong attention due to its applications in various fields, like computer vision, robotics, and autonomous driving. Point cloud semantic segmentation (PCSS) enables the automatic extraction of semantic information from 3D point cloud data, which makes it a desirable task for construction-related applications as well. Yet, only a limited number of publications have applied deep-learning-based methods to address point cloud understanding for civil engineering problems, and there is still a lack of comprehensive reviews and evaluations of PCSS methods tailored to such use cases. This paper aims to address this gap by providing a survey of recent advances in deep-learning-based PCSS methods and relating them to the challenges of the construction industry. We introduce its significance for the industry and provide a comprehensive look-up table of publicly available datasets for point cloud understanding, with evaluations based on data scene type, sensors, and point features. We address the problem of class imbalance in 3D data for machine learning, provide a compendium of commonly used evaluation metrics for PCSS, and summarize the most significant deep learning methods developed for PCSS. Finally, we discuss the advantages and disadvantages of the methods for specific industry challenges. Our contribution, to the best of our knowledge, is the first survey paper that comprehensively covers deep-learning-based methods for semantic segmentation tasks tailored to construction applications. This paper serves as a useful reference for prospective research and practitioners seeking to develop more accurate and efficient PCSS methods.

Keywords: point cloud; semantic segmentation; deep learning; machine learning; construction; automation; open source; dataset; survey

Bibtext

@article{rauch:2023,
  title={Semantic Point Cloud Segmentation with Deep-Learning-Based Approaches for the Construction Industry: A Survey},
  author={Rauch Lukas, Braml Thomas},
  journal={Applied Science},
  year={2023},
  publisher={MDPI}
}

Figure 2 : A tree structure to summarize the variety of common dataset configurations for 3D scene understanding tasks. Semantic segmentation benchmark results on the S3DIS dataset

Supplementary Material

TABLE I. List of publicly available datasets for 3D-scene understanding, categories by data acquisition method, the content of the dataset, used hardware, data representation, and extent of available annotation classes.

Declaration of data type real-world (R), synthetic (S).

Nr. Year Name Resource Data type Objects Indoor sites Urban (S) Urban (D) Industrial Infrastructure / Rural Panoramic cameras Stereo camera RGB-D TLS MLS ALS Aerial photogrammetry IMU GPS RGB sequence Depth sequence Point cloud 3D model RGB Intensity Mesh Normals # Sem. classes Object detection Pose estimation Shape classfication Object tracking Semantic segmentation Instance sem. segmentation PC registration Scene reconstruction Surface reconstruction Volume reconstruction SLAM # Points # Frames # Scenes # Scans
1 2009 Oakland 3-D link R 1 1 1 5 1 1,6M
2 2011 Ford Campus Vision and Lidar Data Set link R 1 1 1 1 1 1 1 1 1 1 1 1 2
3 2012 KITTI stereo evaluation 2012 link R 1 1 1 1 1 1 1 1 8 1 1 1 1 1 1 1,5K 22
4 2013 NYUv2 link R 1 1 1 1 14 1 407,0K 464
5 2013 SUN3D link R 1 1 1 1 1 1 1 254 415
6 2013 Sydney Urban Objects link R 1 1 1 14 1 613
7 2014 Paris-rue-Madame database link R 1 1 1 1 17 1 1 2,0M 1 2

Reported results for semantic segmentation task on the large-scale indoor S3DIS benchmark (including all 6 areas, 6-fold cross validation). Ranked in descending order based on mIoU performance.

Declaration: C---convolution-based, G---graph-based, H---hybrid, P---pooling-based, R---RNN-based, T---Transformer-based, V---voxel-based.

Rank Year Model Name Link Method mIoU mAcc oAcc
1 2022 WindowNorm+StratifiedTransformer link T 77.60 85.8
2 2022 PointMetaBase-XXL link MLP 77.00 -
3 2022 PointNeXt-XL link MLP 74.90 83.0
4 2022 DeepViewAgg link H 74.70 83.8
5 2022 RepSurf-U link MLP 74.30 82.6
6 2022 WindowNorm+PointTransformer link T 74.10 82.5
7 2022 PointNeXt-L link MLP 73.90 82.2
8 2020 PointTransformer link T 73.50 81.9

article-pcss_for_construction-a_survey's People

Contributors

rauchlukas avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.