Discover and explore top open-source AI tools and projects—updated daily.
nndeployMulti-platform inference deployment framework
Top 31.6% on SourcePulse
This framework simplifies high-performance, multi-platform inference deployment for AI models. It targets developers needing to deploy models across diverse hardware and operating systems, offering a unified codebase and efficient execution.
How It Works
The core approach models deployment as a Directed Acyclic Graph (DAG), where preprocessing, inference, and postprocessing are distinct nodes. This DAG structure, with its "graph-in-graph" capability, allows for modularity and efficient composition of complex multi-model pipelines. The framework emphasizes performance through various parallel execution modes (serial, pipeline, task, and combined) and resource management via thread and memory pools.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively developing and welcomes contributions. A WeChat group is available for community discussion.
Licensing & Compatibility
The project's licensing is not explicitly stated in the README, which may pose a compatibility concern for commercial or closed-source integration.
Limitations & Caveats
The framework is described as being in its development stage. Some features, such as the memory pool and high-performance operators, are still under development. Support for certain models (e.g., Stable Diffusion, QWen, SAM) is also noted as "in progress."
9 hours ago
1 day
volcengine
ByteDance-Seed
b4rtaz
Lightning-AI
pytorch
mindspore-ai
mlc-ai