Git Product home page Git Product logo

rgbt-ped-detection's Introduction

Important notice, Our server is under maintenance. (Feb 07 14:30 KST ~ Feb 08 17:00 KST). We are sorry for your inconvenience.

KAIST Multispectral Pedestrian Detection Benchmark

By Soonmin Hwang, Jaesik Park, Namil Kim, Yukyung Choi, In So Kweon at RCV Lab. (KAIST) [Website] teaserImage

We developed imaging hardware consisting of a color camera, a thermal camera and a beam splitter to capture the aligned multispectral (RGB color + Thermal) images. With this hardware, we captured various regular traffic scenes at day and night time to consider changes in light conditions.

The KAIST Multispectral Pedestrian Dataset consists of 95k color-thermal pairs (640x480, 20Hz) taken from a vehicle. All the pairs are manually annotated (person, people, cyclist) for the total of 103,128 dense annotations and 1,182 unique pedestrians. The annotation includes temporal correspondence between bounding boxes like Caltech Pedestrian Dataset. More infomation can be found in our CVPR 2015 [paper] [Ext. Abstract].

 

Usage

  1. Clone this repository.
git clone --recursive https://github.com/soonminhwang/rgbt-ped-detection
  1. First, download multispectral dataset.
cd rgbt-ped-detection/data/scripts
chmod +x fetch_dataset_kaist_cvpr15.py & ./fetch_dataset_kaist_cvpr15.py & cd ../../

or

Run fetch_dataset_kaist_cvpr15.m in MATLAB.

  1. Altenatively, you can get direct links for the dataset here.

  2. Then just run acfDemoKAIST.m in MATLAB

Preview

Set00, Day-Campus Set04, Night-Road Set05, Night-Downtown

Toolbox

This repository includes an extension of Piotr's Computer Vision Matlab Toolbox. We modify some codes to deal with 4-ch RGB+T images, e.g. ${PIOTR_TOOLBOX}/channels/chnsCompute.m. All the modifications are in libs/.

 

Experimental results

Many researchers struggle to improve pedestrian detection performance on our benchmark. If you are interested, please see these works.

Also, another researches to employ multi-modality are presented.

  • Image-to-image translation [Arxiv '17]
  • Calibrations

Related benchmarks

Related

The horizontal lines divide the image types of the dataset (color, thermal and color-thermal). Note that our dataset is largest color-thermal dataset providing occlusion labels and temporal correspondences captured in a non-static traffic scenes.

Please see our Place Recognition Benchmark. [Link]

Citation

If you use our extended toolbox or dataset in your research, please consider citing:

@inproceedings{hwang2015multispectral,
	Author = {Soonmin Hwang and Jaesik Park and Namil Kim and Yukyung Choi and In So Kweon},
	Title = {Multispectral Pedestrian Detection: Benchmark Dataset and Baselines},
	Booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
	Year = {2015}
}

rgbt-ped-detection's People

Contributors

soonminhwang avatar

Stargazers

depeng_cust avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.