Test-Agent  by codefuse-ai

Test agent for LLM-powered software testing

Created 1 year ago
625 stars

Top 52.9% on SourcePulse

GitHubView on GitHub
Project Summary

Test-Agent is an open-source project aiming to integrate Large Language Models (LLMs) into software testing workflows, targeting quality assurance engineers and developers. It provides specialized LLM capabilities for test case generation and assertion completion, enhancing test suite quality and efficiency.

How It Works

Test-Agent leverages a fine-tuned LLM, TestGPT-7B, built upon CodeLlama-7B. This model is specifically trained for tasks like multi-language test case generation (Java, Python, JavaScript) and automatic assertion completion for existing test cases. The project also includes an engineering framework for rapid local deployment and interaction via a ChatBot interface, prioritizing data privacy and local execution.

Quick Start & Requirements

  • Install: Clone the repository, cd into it, and run pip install -r requirements.txt.
  • Prerequisites: Python >= 3.8, Transformers == 4.33.2. Requires ~14GB VRAM for GPU acceleration. Supports various devices including Apple Silicon (mps), Intel XPU (xpu), Huawei NPU (npu), or CPU.
  • Launch: Start controller (python3 -m chat.server.controller), model worker (python3 -m chat.server.model_worker --model-path models/TestGPT-7B --device mps), and web service (python3 -m chat.server.gradio_testgpt). Access the UI at http://0.0.0.0:7860.
  • Models: Available on ModelScope and Hugging Face.
  • Docs: ModelScope TestGPT-7B, Hugging Face.

Highlighted Details

  • TestGPT-7B shows leading performance in Java, Python, and JavaScript test case generation (e.g., 48.6% Java pass@1) compared to other open-source models.
  • Achieves 71.1% pass@1 for Java assertion completion with 100% strong validation.
  • Offers a local, private deployment option to prevent data leakage.
  • Future plans include support for Go/C++, domain knowledge Q&A, test copilot features, and larger model variants (13B, 34B).

Maintenance & Community

  • The project is built upon FastChat.
  • Community collaboration is encouraged for developing testing tools and intelligent assistants.

Licensing & Compatibility

  • The specific license is not explicitly stated in the README, but it is an open-source project. Compatibility for commercial use or closed-source linking would require clarification of the license.

Limitations & Caveats

The project is actively under development, with plans for future language support and features. Performance metrics are provided for specific tasks and languages, and users should verify suitability for their specific testing needs.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Edward Z. Yang Edward Z. Yang(Research Engineer at Meta; Maintainer of PyTorch), and
5 more.

yet-another-applied-llm-benchmark by carlini

0.2%
1k
LLM benchmark for evaluating models on previously asked programming questions
Created 1 year ago
Updated 4 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Meng Zhang Meng Zhang(Cofounder of TabbyML), and
3 more.

qodo-cover by qodo-ai

0.2%
5k
CLI tool for AI-powered test generation and code coverage enhancement
Created 1 year ago
Updated 2 months ago
Feedback? Help us improve.