A Bottom-up Framework for Construction of Structured Semantic 3D Scene Graph

This work is based on our paper (IROS 2020, accepted). We proposed a new framework to construct 3D Structured Semantic Scene Graph. Our work is based on YOLO (V3), Apriltag and ORB-SLAM2, implemented in ROS.

Author: Bangguo Yu, Chongyu Chen, Fengyu Zhou, Fang Wan, Wenmi Zhuang and Yang Zhao

Affiliation: Shandong University and DarkMatter AI Research

3D Structured Semantic Scene Graph Framework

The input of the architecture includes the sequence of RGB-D images and scene priors of an improved S-AOG structure. In the first step, we use yolov-v3 and apriltag to recognize object information. Limited by the recognition algorithm for big objects like desk or room, we adopted the apriltag to detect them. After object recognition, the 3D bounding box is estimated by the height and width of 2D bounding box and depth information roughly. Specifically, we use ORB-SLAM2 to estimate the position. The relation extraction module is designed to obtain the relations between objects using 3D position and 3D bounding box. The attribute extraction module is designed to focus on the 3D position, size and color. Then using scene priors, a parse graph pg can be calculated for each image by maximizing a posteriori estimation (MAP), which is the best explanation for extracted objects, attributes and relations. The output of MAP is added to keyframe group filter (KGF). KGF module is designed to remove repeated and fake detections. After KGF optimization, parse graph is added to local scene graphs and displayed in real-time. Globe scene graph can be obtained by updating and merging the local scene graphs

Requirements

Ubuntu 16.04+
Python 2.7
YOLO (V3)
Apriltag
ORB-SLAM2
Graphviz
ROS kinetic

Installation

The code has been tested only with Python 2.7, CUDA 9.0 on Ubuntu 16.04.

Build a catkin_workspace

mkdir -p ~/catkin_ws/src
cd ~/catkin_ws/src/

Install YOLO (V3), Apriltag and ORB-SLAM2 repository in this workspace.

Download and build 3D Structured Semantic Scene Graph repository:

git clone https://github.com/ybgdgh/parse_graph.git
catkin_init_workspace
cd ..
catkin_make -DCMAKE_BUILD_TYPE=Release
echo "source ~/catkin_ws/devel/setup.bash" >> ~/.bashrc
source ~/.bashrc

Download dataset

The scene dataset:

rosbag

The scene prior:

scene priors

Result

Demo Video

video

ybgdgh / parse_graph Goto Github PK

parse_graph's Introduction

A Bottom-up Framework for Construction of Structured Semantic 3D Scene Graph

3D Structured Semantic Scene Graph Framework

Requirements

Installation

Download dataset

Result

Demo Video

parse_graph's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent