OpenMetadata  by open-metadata

Unified semantic context platform for data and AI

Created 4 years ago
14,019 stars

Top 3.8% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

OpenMetadata provides a unified metadata knowledge graph to enrich AI and data users with context, semantics, and governance, moving beyond raw data access. It connects technical metadata, data quality, lineage, ownership, and business meaning, empowering AI assistants and humans to discover, understand, and trust enterprise data effectively.

How It Works

The platform integrates a Context Layer (technical metadata, quality, lineage, ownership) and a Semantics Layer (glossaries, metrics, classifications) into a unified knowledge graph. This graph connects data assets with people, policies, and semantics. AI assistants and agents interact via the MCP (Metadata-to-Context Protocol) server and Semantic Search, enabling natural language queries and automated actions. An AI SDK supports custom AI application development, providing AI with governed context and business meaning for safe data utilization.

Quick Start & Requirements

Explore via the sandbox or follow the installation guide. Key steps include ingesting metadata from 120+ connectors, building context (descriptions, ownership, quality, lineage), and adding semantics (glossaries, tags, metrics). Enable Semantic Search and connect an MCP client using the provided endpoint. Custom AI applications can be built using the AI SDK.

Highlighted Details

  • 120+ Connectors: Broad data source integration.
  • Column-Level Lineage: Granular lineage for precise AI reasoning.
  • MCP Server: Enables AI assistants/agents to query and act on the metadata graph via natural language.
  • Semantic Search: Facilitates discovery by meaning, surfacing conceptually related assets.
  • Open Metadata Standards: Foundation for interoperable metadata (schemas, RDF/OWL, SHACL).
  • AI SDK: Allows programmatic integration for custom AI applications.

Maintenance & Community

The project welcomes community contributions across schemas, connectors, ingestion, MCP tools, and documentation. A Slack community is available for engagement.

Licensing & Compatibility

Released under the Apache License, Version 2.0, permissive for commercial use and closed-source linking.

Limitations & Caveats

The README focuses on capabilities and AI integration. While mentioning managed enterprise features via "Collate," specific open-source limitations (e.g., self-management overhead) are not detailed.

Health Check
Last Commit

9 hours ago

Responsiveness

Inactive

Pull Requests (30d)
548
Issues (30d)
133
Star History
463 stars in the last 30 days

Explore Similar Projects

Starred by Chaoyu Yang Chaoyu Yang(Founder of Bento), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
3 more.

DB-GPT by eosphoros-ai

0.3%
19k
AI-native data app development framework with agentic workflow
Created 3 years ago
Updated 2 days ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), and
12 more.

minds-platform by mindsdb

0.1%
39k
AI query engine for federated data sources
Created 7 years ago
Updated 2 days ago
Feedback? Help us improve.