aiges  by iflytek

AI serving framework for deploying AI models/engines

created 5 years ago
252 stars

Top 99.7% on sourcepulse

GitHubView on GitHub
Project Summary

AIGES is an AI serving framework loader designed to simplify the deployment and management of AI models and engines. It allows developers to quickly expose AI capabilities as RPC services, abstracting away infrastructure concerns and enabling seamless integration with the Athena Serving Framework for cloud-native deployment.

How It Works

AIGES acts as a universal wrapper for AI models and engines, supporting inference via C or Python code. It can expose these capabilities as both HTTP and gRPC services. The framework handles service registration, load balancing, and configuration management, allowing users to focus on model development rather than infrastructure. It supports both non-streaming and streaming inference requests.

Quick Start & Requirements

  • Installation: pip3 install aiges==0.5.0
  • Prerequisites: Python 3.9+
  • Docker Image: docker run -itd --name mnist2 -p 1889:1888 public.ecr.aws/iflytek-open/aiges-gpu:10.1-1.17-3.9.13-ubuntu1804-v3.0-alpha11
  • Binary Download: Download from releases page.
  • Demo: ./AIservice -mnist (or manual download from aiges_demo repository).
  • Documentation: AI Service Engine Official Documentation

Highlighted Details

  • Supports both Cgo and gRPC modes for Go and Python communication.
  • Provides configuration center and service discovery capabilities.
  • Offers load balancing configuration.
  • Includes built-in Swagger UI for API debugging.

Maintenance & Community

  • The project is actively developed by iFlytek.
  • Contact information is available in the README for inquiries.

Licensing & Compatibility

  • The specific license is not explicitly stated in the README, but it appears to be an internal project with open-source components. Further clarification on licensing is recommended for commercial use.

Limitations & Caveats

  • The project is currently in an alpha state, with some features still under development or incomplete, particularly regarding HTTP interface integration and error handling for Python processes.
  • The README mentions that some functionalities might not be fully implemented due to the alpha status.
Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.