Git Product home page Git Product logo

caesar's Introduction

Caesar: Cross-Camera Complex Activity Detection

Alt Text

Code Structure

The workflow is shown in the image below. You will run one of the scripts to turn the device into a mobile/tracker/action/web node.

Workflow of Caesar Demo

  • mobile/ src code for object detection
  • tracker/ src code for tracking and ReID
  • server/ src code for action detection
  • web/ src code for web interface
  • network/ src code for I/O and RPC
  • config/ config file for the system setup
  • main_xxx.py the main script to run the node
  • checkpoints/ folder that contains the model files
  • data/ a test video is here
  • misc/ scripts for debugging/testing

Requirements

  • Python 3.X
  • OpenCV 3.X
  • TensorFlow 1.12+
  • Install other py packages using pip install -r requirements.txt

Run the Pipeline

  1. Clone the repo with
git clone https://github.com/USC-NSL/Caesar.git
  1. Prepare models (download the models to the checkpoints/ folder)
  • Object Detection (Option-1: MobileNet-V2, Option-2: MaskRCNN). The MaskRCNN model is slower but detects better for people in drone/surveillance camera. It requires additional setup. Please check its repo for details and dependency.
  • ReID (DeepSort)
  • Action Detection (ACAM). For ACAM-related setup, please look at its repo and make sure your environment can run its demo code.
  1. Update config files to your settings All the files you need to change are in config/ folder.
  • Modify label_mapping.txt to parse your object detector's output (key is the integer class id, value is the string object label). The default one is for SSD-MobileNetV2.
  • Modify camera_topology.txt to indicate the camera connectivity (see the inline comments in the file for detail)
  • Modify act_def.txt to define your complex activity using the syntax (see the inline comments in the file for detail)
  • Modify const_xxx.py to specify the model path, threshold, machine ip, etc., for this node (mobile, tracker, action, web). Read the inline comments for detail
  1. Run the node
  • For a mobile node, run python main_mobile.py (need GPU). If you are using a mobile GPU like Nvidia TX2, please set the board to the best-performance mode for max FPS. Here is a tutorial for TX2.
  • On a server: Run one of python main_tracker.py / python main_act.py / python main_web.py (the tracker and the act node need GPU).
  1. About the runtime
  • All the node will periodically ping their next hop for reconnection, so it doesn't matter what order you start/end these nodes.
  • Each node will save logs into a debug.log text file in the home dir
  • If you want to end one node, press Ctrl-C to exit, some node may need Ctrl-C second time to be fully ended.

Supported Action Vocabulary

You can add more DNN-generated single-tube actions, check FAQ for how to do.

  • Cross-tube actions: close, near, far, approach, leave, cross
  • Single-tube actions: start, end, move, stop, use_phone, carry, use_computer, give, talk, sit, with_bike, with_bag

Run Caesar Node by Node (Offline)

Instead of running all the nodes online, you may want to just run part of the workflow and check the intermediate results (like detection and tracknig). Therefore, you will need to follow these:

  • First, turn on the SAVE_DATA in xx_const.py so a node could save its results to an npy file under res/[main_node_name]
  • Then, you can run main_gt.py. This script will act as node that reads the raw videos and the npy data. It could render the data in frames (e.g. detections, track ids, actions) so you can see them without running the webserver. Moreover, it upload the data to the next running node. Parameters for main_gt.py can be changed in config/const_gt.py.

Example:

  • Run main_mobile.py only on several videos and get the data saved. Then you can config the npy data source to be the mobile's output for main_gt.py (speicify this in its config file config/const_gt.py). Moreover, you should make the tracker node's IP as the server address for main_gt.py.
  • Then, run main_tracker.py and main_gt.py. Now main_gt.py displays the mobile's detection result, and also uploads the data to the tracker node for tracking.
  • When finished, you can see the tracker's npy file in its own result folder. Set this file to be the source data path in main_gt.py and change its server address to main_act.py.
  • Now you can run main_gt.py again but as a tracker node, and use it to input to main_act.py for action results. You can repeat the process for debugging main_web.py in the last step.

FAQ

Q: Object detection on Nvidia TX2 is slow, how to make it faster?

  • A: Instead of using the default MobileNet model, you should use its optimized TensorRT version for TX2. Nvidia has a chain to squeeze the model so it runs much faster. Here is the tutorial for generating the TensorRT model. You can download ours here. The useage of TensorRT model is the same as regular model. With TensorRT, we increase the detection FPS from 4 to 10.5.

Q: I want to enable more DNN-detected single tube action, how to do that?

  • A: Go to config/const_action.py, add the action name and its detection threshold to the NN_ACT_THRES. Make sure that the action name is already in ACTION_STRINGS.

Q: How to run two mobile nodes on the same machine?

  • A: Duplicate config/const_mobile.py with a different name (e.g. const_mobile2.py) and edit it to match the settings for the second camera. Then make a copy of main_mobile.py to be main_mobile2.py. In the main file line ??, change the from config.const_mobile to be from config.const_mobile2.

Q: The end-to-end delay is long, how to optimize?

  • A: (1) Disable the web node and directly show the visualized result at act node. We will add this function later. (2) Initiate more action DNN instances. We will add this as an option so you can easily config.

If you have any other issues, please leave your questions in "issues"

caesar's People

Contributors

hyperchris avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.