Git Product home page Git Product logo

liwanli1202 / driver-distraction-detection Goto Github PK

View Code? Open in Web Editor NEW

This project forked from snehil1703/driver-distraction-detection

0.0 2.0 0.0 949.38 MB

Categorizing actions of driver and coining them safe or unsafe - Using Convolutional Neural Networks - 1) Caffe with pre-trained model of AlexNet on ImageNet Dataset; 2) Fine-Tuned Caffe Model

License: Other

C++ 80.58% CMake 1.52% Makefile 0.39% Shell 0.57% Python 4.63% Cuda 3.13% MATLAB 0.49% M 0.01% JavaScript 4.65% Ruby 0.02% CSS 3.12% PowerShell 0.01% Protocol Buffer 0.89%

driver-distraction-detection's Introduction

Vehicle's Driver Detection

In this project we are developing a system to detect driver’s motion in an image taken from inside the car using camera on the dashboard. This project uses Convolutional Neural Networks to train the model to classify the images into 10 different categories :
c0 - safe driving
c1 - texting (right)
c2 - talking on the phone (right)
c3 - texting (left)
c4 - talking on the phone (left)
c5 - operating the radio
c6 - drinking
c7 - reaching behind
c8 - hair and makeup
c9 - talking to passenger

Dataset

https://www.kaggle.com/c/state-farm-distracted-driver-detection/data

2D dashboard camera images where the driver’s actions are captured from passanger seat’s POV.

Training set - 15698 images Validation set -3363 images Test set - 3364 images

CNN Model

Software Packages used - Caffe Pre-trained model - Alexnet trained on ImageNet Dataset Trained the model after fine-tuning on our train-set and validation-set for 100,000 iterations Final model - 90000 iteration

Observations

The accuracy of the model increased from 56% to 99.43% when weights from pretrained AlexNet model were used. By testing the images covered with patches it was observed the features learned were accurate. Images that in which irrelevant portion was covered, were classified correctly even after adding a patch. Whereas one of the images shows the classification wrong because a crucial part of the image was hidden. This image when tested without patch was classified correctly. Also, it was observed that the dataset was over-fitted to the angle at which the images were taken and maybe to the 10 drivers that constituted the train-set. Since the original test set and train set have similar drivers the accuracy is very high but it dropped drastically when new driver images are tested. To reduce the over-fitting problem model should be fine- tuned on a much more diverse dataset.

Caffe

Build Status License

Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by the Berkeley Vision and Learning Center (BVLC) and community contributors.

Check out the project site for all the details like

and step-by-step examples.

Join the chat at https://gitter.im/BVLC/caffe

Please join the caffe-users group or gitter chat to ask questions and talk about methods and models. Framework development discussions and thorough bug reports are collected on Issues.

Happy brewing!

License and Citation

Caffe is released under the BSD 2-Clause license. The BVLC reference models are released for unrestricted use.

Please cite Caffe in your publications if it helps your research:

@article{jia2014caffe,
  Author = {Jia, Yangqing and Shelhamer, Evan and Donahue, Jeff and Karayev, Sergey and Long, Jonathan and Girshick, Ross and Guadarrama, Sergio and Darrell, Trevor},
  Journal = {arXiv preprint arXiv:1408.5093},
  Title = {Caffe: Convolutional Architecture for Fast Feature Embedding},
  Year = {2014}
}

driver-distraction-detection's People

Contributors

snehil1703 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.