Git Product home page Git Product logo

object-detection's Introduction

Object Detection with YOLOS


Banner

Object detection is a computer vision technique that identifies and classifies a particular object in a particular setting. The main goal of object detection is to scan digital images or real-life scenarios to locate instances of every object, separate them, and analyze their necessary features for real-time predictions. Object detection algorithms typically leverage machine learning or deep learning to produce meaningful results. Object detection is used in various domains, such as image annotation, vehicle counting, activity recognition, face detection, face recognition, video object co-segmentation, etc.

  • In this project, I aim to demonstrate how to use YOLOS models for object detection on images using Gradio, a Python library that allows us to build and share web applications for our machine learning models or data science workflows.

  • I also used some predefined YOLOS models from the Hugging Face hub as our data sources. We use gr.components.Dropdown() to create a dropdown component that can let users choose from different YOLOS models available on the hub.

  • The models used are:

    • yolos-tiny
    • yolos-small
    • yolos-base
    • yolos-small-300
    • yolos-small-dwr
  • These models have different sizes and performance trade-offs. You can find more details about these models on their respective pages on the hub.


Installation

  • To run this project, you need to install some dependencies:

    • Python 3
    • PyTorch
    • Transformers
    • Gradio
    • OpenCV
    • Numpy
  • You can install these dependencies using pip:

    pip install -r requirements.txt


Usage

  • To run this project, you need to execute the app.py script: python main.py

Examples

  • Here are some examples of object detection using different YOLOS models and inputs:

Girl in a jacket


Girl in a jacket

The application can be accessed here: MyApp


Report

  • The project report is available here. It provides a detailed explanation of the methodologies used, results obtained, and conclusions drawn from this project.

Future Work

  • Adding more features and functionalities to our web application, such as downloading, sharing, or editing the results.
  • Evaluating the performance and quality of our web application using metrics and feedback.
  • Adding more YOLOS models from the Hugging Face hub or other sources.
  • Adding more input and output components from Gradio or other libraries.

License

  • This project is licensed under the MIT License. ๐Ÿ“

object-detection's People

Contributors

zaheer-10 avatar

Watchers

 avatar  avatar

Forkers

lighterbund

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.