toolfront  by kruskal-labs

AI agents for big data

created 1 month ago
284 stars

Top 93.1% on sourcepulse

GitHubView on GitHub
Project Summary

ToolFront provides an end-to-end solution for connecting AI agents to diverse data sources, including databases, APIs, and documents, enabling them to query and understand large datasets. It targets developers and researchers seeking to enhance AI accuracy and relevance by allowing models to discover and learn from data on the fly, offering a more scalable and cost-effective alternative to fine-tuning.

How It Works

ToolFront leverages the MCP (Model Communication Protocol) to act as a bridge between AI models and external data. It dynamically discovers and indexes data sources, making them accessible to AI agents through a set of predefined tools. The core innovation lies in its continuous evaluation and learning (CE/CL) API, which uses in-context learning to augment LLM context with relevant data samples, enabling agents to improve performance over time without traditional retraining.

Quick Start & Requirements

  • Installation: Run via uvx toolfront[all] or docker run antidmg/toolfront.
  • Prerequisites: uv or Docker. Requires URLs for databases and APIs. An optional API key is needed for the CE/CL API.
  • Setup: Connects to various databases (PostgreSQL, MySQL, BigQuery, Snowflake, etc.), APIs with OpenAPI/Swagger specs, and local document directories.
  • Documentation: Quick Install, Databases, APIs, Documents.

Highlighted Details

  • Supports multi-database and multi-API integration within a single interface.
  • Features a "local-first" architecture, keeping data and operations on the user's machine for privacy and cost efficiency.
  • Offers a CE/CL API for AI agents to self-improve through in-context learning.
  • Provides tools for discovering, searching, inspecting, and querying data sources.

Maintenance & Community

  • Active community support via Discord.
  • Bug reporting and feature requests managed through GitHub Issues.
  • Discord, X.

Licensing & Compatibility

  • Released under the MIT License, permitting commercial use and modification.

Limitations & Caveats

The CE/CL API requires an API key and sends queries and request syntax to an external service, though it states data and secrets remain local. The CE/CL feature is currently in beta.

Health Check
Last commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
12
Issues (30d)
8
Star History
284 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), and
7 more.

mindsdb by mindsdb

0.5%
35k
AI query engine for federated data sources
created 7 years ago
updated 1 day ago
Feedback? Help us improve.