Discover and explore top open-source AI tools and projects—updated daily.
Funnel any file into LLM context
Top 93.6% on SourcePulse
This library provides a unified interface for processing various file types into a format suitable for Large Language Models (LLMs), extracting both text and images. It aims to simplify the integration of diverse data sources into LLM workflows for developers and researchers.
How It Works
The core of the library is the Attachments
class, which acts as a central "funnel." It accepts file paths or URLs and employs a pipeline of loaders, modifiers, presenters, refiners, and adapters. This modular design allows for extensibility, enabling users to contribute custom processing steps for new file formats or LLM APIs. The library supports a Domain Specific Language (DSL) for advanced customization of the processing pipeline.
Quick Start & Requirements
pip install attachments
pip install attachments[office]
for Microsoft Office formats, pip install attachments[browser]
for advanced web scraping with visual highlighting.Highlighted Details
Maintenance & Community
The project is actively developed, with an alpha version available for testing new features. Users are encouraged to contribute via GitHub Issues or Pull Requests.
Licensing & Compatibility
The library is released under the MIT License, permitting commercial use and integration with closed-source projects.
Limitations & Caveats
The project is currently in alpha, indicating potential for bugs and breaking changes. While extensive, the list of supported formats and adapters is still growing, with plans for audio, video, and more cloud service integrations.
3 days ago
Inactive