awesome-open-data-annotation  by zenml-io

Curated list of open-source data annotation/labeling tools

created 3 years ago
626 stars

Top 53.7% on sourcepulse

GitHubView on GitHub
Project Summary

This repository is a curated list of open-source data annotation and labeling tools, categorized by data modality (text, images, audio, video, time series, multi-modal). It aims to help machine learning practitioners discover and evaluate tools that fit their MLOps workflows, particularly for data-centric approaches.

How It Works

The project functions as a community-driven directory, compiling tools based on three core criteria: open-source license, active maintenance, and fitness for purpose. It provides a structured overview of available tools, facilitating discovery and comparison for users involved in data annotation and labeling.

Quick Start & Requirements

This is a curated list, not a software package. To use the tools, refer to their individual project pages.

Highlighted Details

  • Comprehensive coverage across multiple data types including text, images, audio, video, time series, and multi-modal data.
  • Tools range from simple Jupyter notebook widgets to full-fledged web platforms.
  • Licenses vary, including permissive (MIT, Apache-2, BSD) and copyleft (GPL, AGPL) options.
  • Includes tools with AI-assisted labeling capabilities.

Maintenance & Community

The list is maintained by ZenML and welcomes community contributions via Pull Requests. Users are encouraged to join the ZenML Slack for discussions and potential collaborations on MLOps integrations.

Licensing & Compatibility

The repository itself is not licensed as software. The tools listed have various licenses, including Apache-2, MIT, BSD, GPL-3, AGPL-3, ELv2, Custom, and Unknown. Compatibility for commercial use depends on the specific license of each tool.

Limitations & Caveats

The list's quality and completeness depend on community contributions. Some tools have "Unknown" or "N/A" licenses, and the "active maintenance" status may vary. The "Description" field is brief, requiring users to visit individual project pages for detailed functionality.

Health Check
Last commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
42 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.