Git Product home page Git Product logo

star_wars_object_detection's Introduction

Star Wars Object Detection

This repository contains the dataset and some of the configuration files to train an object detector of R2-D2 and BB-8 from Star Wars, using Tensorflow's object detection API.

The final model has been fine-tuned using one of the models pre-trained on the COCO dataset as a starting point (transfer learning).

You can read more here.

Some results

Here there's a video to show the performance of the model.

The outcome seems acceptable considering the small size and low variability of the dataset used.

Trying the dataset

In order to try the dataset, you first need to follow the installation instructions on the Tensorflow page. Then, you can train your own model following analogous commands as the ones used in the pet detector.

File description

Folders

  • annotations: xml files with the boundig boxes of each image, obtained using labelImg, and a text file assigning each image to one of the two available classes (R2-D2 and BB-8). Analogous format as the one of the Oxford-IIIT Pets Dataset.
  • graph_images: plots of the Total Loss of the trained model.
  • images: 203 jpg train images, 102 of R2-D2 and 101 of BB-8.
  • object_detection/data: configuration file of the net trained, label map and tensorflow records (created with create_sw_tf_record.py).
  • results: some images/gifs showing the performance of the trained model.
  • test_images: some images not used to train the model.

Scripts

  • check_duplicate_images.py: script that allows you to check if there are duplicate images in one or two directories. Useful when collecting the train and test images.
  • create_sw_tf_record.py: modified version of create_pet_tf_record.py, where the paths have been changed along with the regular expression in line 60 to adapt to my images names.
  • export_inference_graph.py: an exact copy of export_inference_graph.py in Tensorflow's API.
  • object_detection_sw.ipynb: ipython notebook to test the trained model on images and videos. Modified from object_detection_tutorial.ipynb and some lines of code to test videos from priya-dwivedi github (I don't know if that's the original source).
  • rename_images.py: script to rename all images files in one directory to have the same root name.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.