agentlego  by InternLM

Tool API library for enhancing LLM agents

Created 2 years ago
400 stars

Top 72.3% on SourcePulse

GitHubView on GitHub
Project Summary

AgentLego is an open-source library designed to enhance Large Language Model (LLM) agents by providing a rich set of versatile tool APIs. It targets developers building LLM-powered agents, offering multimodal capabilities and simplifying tool integration. The library aims to extend agent functionality beyond text processing into areas like visual perception, image generation, and speech processing.

How It Works

AgentLego provides a flexible tool interface that allows users to easily extend custom tools with arbitrary argument and output types. It supports integration with popular LLM agent frameworks such as LangChain, Transformers Agents, and Lagent. The library also enables tool serving and remote access, which is particularly beneficial for tools requiring significant computational resources like large machine learning models or specific hardware environments (e.g., GPU, CUDA).

Quick Start & Requirements

Highlighted Details

  • Supports a wide range of tools including image description, OCR, VQA, object detection, segmentation, text-to-image generation, and speech processing.
  • Includes advanced capabilities like ControlNet for image generation based on edge maps, depth, or pose, and ImageBind for cross-modal generation.
  • Offers tools for image editing such as expansion, object removal, and replacement.
  • Facilitates direct tool usage via a simple Python API for testing and integration.

Maintenance & Community

The project is developed by InternLM. Further community and roadmap information is not explicitly detailed in the README.

Licensing & Compatibility

  • License: Apache 2.0.
  • Users must also comply with the licenses of the underlying models used by the tools.

Limitations & Caveats

Some tools require specific dependencies that need manual installation, potentially increasing setup complexity. The README does not detail specific hardware requirements beyond mentioning GPU/CUDA support for certain tools.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), Jeremy Howard Jeremy Howard(Cofounder of fast.ai), and
8 more.

Qwen-Agent by QwenLM

1.4%
12k
Agent framework for LLM application development
Created 2 years ago
Updated 1 month ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
20 more.

dify by langgenius

0.5%
114k
Open-source LLM app development platform
Created 2 years ago
Updated 14 hours ago
Feedback? Help us improve.