Git Product home page Git Product logo

datagym-core's Introduction

DataGym.ai

DataGym.ai is a modern, web based workbench to label images and videos. It allows you to manage your projects and datasets, label data, control quality and build your own training data pipeline. With DataGym.ai´s API and Python SDK you can integrate it into your toolchain.

DataGym.ai Workspace

📒 Ressources

🧩 Features

  • Organize your data into different projects with tasks
    • Dashboard with useful statistics / overview
    • Tasks lifecycle with states (backlog, waiting, in progress, completed, skipped, reviewed)
    • Pagination, Filtering and Search
    • Integrated quality control / review process
  • Organize your media within datasets
    • Different storage types (direct upload, public url´s, aws s3 cloud storage)
    • Supported mime types: jpeg, png, mp4
    • Support of large high resolution images
  • Labeling features
    • Global classifications (image wide)
    • Image annotation
      • Variety of geometries: point, line, bounding box, polygons
      • Different classification types: text, checklists, option-box
      • Supports nested geometries (child-geometries)
    • Video annotation: Specialized editor for video labeling
      • Frame-by-frame navigation
      • Linear interpolation to track objects
      • Adjustable playback-speed
      • Analyze and extract video metadata (codec, framerate, duration, ...)
    • Image segmentation
      • Bitmap export
  • Feature-rich Workspace
    • Temporary screen manipulations: contrast, brightness, saturation
    • Hide unused geometry-groups for more clarity
    • Shortcut support
    • Panning and zooming, multi-select, moving, duplication
    • Supports transformation of the same geometry type
    • Context menu for geometries
  • Powerful REST API to build your own workflows
    • Python SDK Package
  • Data exporting- and importing (json)
    • Export your labeled data as json (works for images and videos)
    • Import your labeled data to refine your ml model
    • Export-/import your label configuration and use it in multiple projects

🎯 Quickstart

Running with docker-compose

The simplest way to run DataGym.ai locally is by using docker-compose.

  1. Download the docker-compose.yml from the projects root-directory
  1. Launch container using docker-compose up -d
  2. Wait until the initialization is done
  3. Navigate to localhost:8080

Local development, build manually

Build the whole project:

mvn clean install 

🗳️ Build with

  • Java / Spring Boot
  • Angular

👐 Contributing

We would love to receive contributions - please review our Contributing Guide for all relevant details.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details

datagym-core's People

Contributors

dacbreakpoint avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.