Git Product home page Git Product logo

data-labeling-platforms-comparison's Introduction

Data Labeling Platforms Comparison

Data labeling is a hot topic as a recent report from AI research and advisory firm Cognilytica found that over 80% of the time enterprises spend on AI projects goes toward preparing, cleaning and labeling data.

A new category of tools has emerged, called data labeling platforms, training data management platforms. Three years ago, this market did not exist. Today, the offer is plentiful and it is sometimes complicated to find one's way.

A good annotation tool must meet three needs:

  • annotate quickly (productivity of annotators)
  • annotate well (control the quality / consistency of annotations)
  • annotate the right data (source diversity in the dataset)

Here is an attempt to do a fair comparison of data labeling / training data platforms.

I tried to be as accurate as possible, based on my experience with those solutions and their documentation. If you see any mistake (this is likely), please create an issue / push a PR.

Table of Content

Tasks / Tools by Asset Type

Image

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Bounding Box
  • Bounding Box
  • Polygon
  • Polyline
  • Pose estimation
  • Point
  • Segmentation
  • Vector Segmentation
  • Image transcription / OCR
  • Object relation
  • Satellite imagery support
  • High-resolution support
  • DICOM support
Classif. Hierarchical classif. Bounding Box Polygon Polyline Pose estimation Point
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7
Seg. Vector Seg. Image transcription / OCR Object relation Satellite High-resolution support DICOM support
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Text

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Named entities recognition
  • Relations extraction
Classification Hierarchical classification Named entities recognition Relations extraction
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

PDF

The criteria are as follows:

  • PDF document processing
  • Bounding Box on PDF
  • Object relation
PDF document processing Bounding Box on PDF Object relation
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Video

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Native Video/Frame
  • Classification
  • Object detection
  • Transcription
Classif. Hierarchical classif. Native Video/Frame Classification Object detection Transcription
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Audio

The criteria are as follows:

  • Voice transcription / Speech to text
  • Diarization
  • Ultrasound support
Voice transcription / Speech to text Diarization Ultrasound support
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Timeserie

The criteria are as follows:

  • Classification
  • Range detection
Classification Range detection
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Productivity Management Features

The criteria are as follows:

  • Interface Customization
  • Asset preloading
  • Keyboard shortcuts / Hotkeys
  • Workload distribution
  • Autosave
  • Labeler productivity dashboard
Interface Customization Asset preloading Keyboard shortcuts / Hotkeys Workload distribution Autosave Labeler productivity dashboard
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Quality Management Features

The criteria are as follows:

  • Issue management
  • Consensus Image
  • Consensus Video
  • Consensus Text
  • Consensus Audio
Issue management Consensus Image Consensus Video Consensus Text Consensus Audio
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Dataset Management Features

The criteria are as follows:

  • Data hub
  • Version control
  • Public project
  • ML export formats
Data hub Version control Public project ML export formats
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

API

How to programmatically interact with the tool. The criteria are as follows:

  • GraphQL API
  • Messaging bus API
  • CLI SDK
  • Python SDK
GraphQL API Messaging bus API CLI SDK Python SDK
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Deployment

The different options to deploy the tool. The criteria are as follows:

  • On-premise data
  • On-premise deployment
  • Air gap deployment
On-premise data On-premise deployment Air gap deployment
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Automation

The offering in terms of labeling automation. The criteria are as follows:

  • Model-assisted labeling
  • Model based preannotation
  • Queue prioritisation
  • Model Training
  • Model Inference
Model-assisted labeling Model based preannotation Queue prioritisation Model Training Model Inference
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Workforce

Availability of workforce to help labeling. The criteria are as follows:

  • Human in the loop
  • External workforce
  • On-premise external workforce
Human in the loop External workforce On-premise external workforce
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Security

Security certifications. The criteria are as follows:

  • HIPAA Compliance
  • SOC 2 Compliance
HIPAA Compliance SOC 2 Compliance
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

data-labeling-platforms-comparison's People

Contributors

edarchimbaud avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.