towhee  by towhee-io

Framework for neural data processing pipelines

Created 4 years ago
3,423 stars

Top 14.1% on SourcePulse

GitHubView on GitHub
Project Summary

Towhee is a Python framework designed to simplify and accelerate neural data processing pipelines for unstructured data like text, images, audio, and video. It targets developers and researchers who need to extract insights from diverse data types, enabling the creation of LLM-based pipelines with state-of-the-art deep learning models and efficient deployment.

How It Works

Towhee utilizes a modular approach with "Operators" (pre-built models and functions) that are chained together to form "Pipelines." These pipelines are defined using a Pythonic, method-chaining API, allowing for intuitive prototyping and complex DAG construction. The framework supports multi-modal data and LLM orchestration, including local model hosting and prompt management. For high-performance execution, it leverages the Triton Inference Server, enabling efficient model serving on CPU/GPU via backends like TensorRT and ONNX.

Quick Start & Requirements

Highlighted Details

  • Supports over 140 SOTA models across CV, NLP, multimodal, audio, and medical domains.
  • Offers pre-built ETL pipelines for tasks like RAG, text-image search, and video deduplication.
  • Enables transformation of Python pipelines into high-performance Docker containers.
  • Integrates with vector databases like Milvus for embedding storage.

Maintenance & Community

  • Active community contributions are encouraged via issues, documentation, and code.
  • Special thanks to contributors listed in the README.

Licensing & Compatibility

  • The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project's license is not clearly stated in the README, which may pose a risk for commercial adoption or integration into closed-source projects.

Health Check
Last Commit

11 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
16 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
3 more.

LitServe by Lightning-AI

0.3%
4k
AI inference pipeline framework
Created 1 year ago
Updated 1 day ago
Starred by Clement Delangue Clement Delangue(Cofounder of Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
26 more.

datasets by huggingface

0.1%
21k
Access and process large AI datasets efficiently
Created 5 years ago
Updated 1 day ago
Feedback? Help us improve.