SDK for integrating LLMs and AI agents into Apache Airflow pipelines
Top 67.1% on sourcepulse
This SDK integrates Large Language Models (LLMs) and AI Agents into Apache Airflow pipelines, enabling users to orchestrate complex AI workflows using familiar Airflow decorators. It targets Airflow users and data engineers looking to leverage LLMs for tasks like text summarization, structured data extraction, and agent-based research directly within their scheduled data pipelines.
How It Works
The SDK extends Airflow's @task
decorator with specialized versions: @task.llm
for direct LLM calls, @task.agent
for orchestrating AI agents with tools, and @task.llm_branch
for LLM-driven DAG control flow. It leverages Pydantic AI for model support and automatic output parsing via type hints, including Pydantic models, simplifying the integration of LLM outputs into downstream tasks.
Quick Start & Requirements
pip install airflow-ai-sdk[<provider>]
(e.g., airflow-ai-sdk[openai,duckduckgo]
).ai-sdk-examples
repository and run astro dev start
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The SDK relies on external LLM providers, which may incur costs and have rate limits. The slim
version requires manual installation of LLM provider dependencies.
3 days ago
1 day