Discover and explore top open-source AI tools and projects—updated daily.
awslabsFramework for testing generative AI virtual agents
Top 87.9% on SourcePulse
This framework provides a generative AI-powered system for testing virtual agents, specifically targeting those built with AWS services like Amazon Bedrock, Amazon Q Business, and Amazon SageMaker. It enables automated, multi-turn conversational testing and evaluation, aiming to expedite delivery and maintain agent stability within CI/CD pipelines.
How It Works
The core of the framework is an LLM-based evaluator agent that orchestrates conversations with a target agent. It evaluates responses during these multi-turn dialogues, offering built-in support for popular AWS AI services and allowing integration of custom agents. Hooks can be defined for additional tasks like integration testing.
Quick Start & Requirements
pip install agent-evaluationHighlighted Details
Maintenance & Community
CONTRIBUTING.md.Licensing & Compatibility
Limitations & Caveats
The framework is primarily designed for AWS-integrated agents, and while custom agents can be brought in, the core tooling is heavily oriented towards the AWS ecosystem.
1 day ago
Inactive
ag2ai
TransformerOptimus
google
Significant-Gravitas