ncnn by Tencent

Mobile-first inference framework for neural networks

Created 8 years ago

22,592 stars

Top 1.9% on SourcePulse

View on GitHub

1 Expert Loves This Project

Chaoyu Yang

Founder of Bento

Project Summary

ncnn is a high-performance neural network inference framework optimized for mobile platforms, designed for efficient deployment of deep learning models on devices. It targets developers building AI-powered mobile applications, offering significant speed advantages over other open-source frameworks on mobile CPUs.

How It Works

ncnn is a pure C++ implementation with no third-party dependencies, prioritizing minimal footprint and maximum performance. It achieves this through ARM NEON assembly-level optimizations, sophisticated memory management, and multi-core parallel processing. The framework supports GPU acceleration via Vulkan and offers extensibility for custom layers and model quantization.

Quick Start & Requirements

Installation: Build from source or download pre-built binaries for various platforms. Detailed build instructions are available for Linux, Windows, macOS, Android, iOS, WebAssembly, and embedded systems.
Prerequisites: Generally C++ compiler. Specific builds may require Vulkan SDK, Xcode, or Android NDK.
Resources: Minimal memory footprint. Build times vary by platform.
Links: How to build ncnn, Releases

Highlighted Details

Claims to be faster than all known open-source frameworks on mobile CPUs.
Supports a wide range of CNN architectures including VGG, ResNet, MobileNet, YOLO (v2-v8), and more.
Offers GPU acceleration via Vulkan API.
Supports importing models from Caffe, PyTorch, ONNX, Darknet, Keras, and TensorFlow (via MLIR).

Maintenance & Community

Actively used in Tencent applications (QQ, WeChat, etc.).
Community channels include QQ groups and a Discord channel.
Discord Channel

Licensing & Compatibility

License: BSD 3-Clause.
Permissive license allows for commercial use and integration into closed-source applications.

Limitations & Caveats

The platform support matrix indicates that while many platforms are supported, performance ("speed") may not be optimal for all configurations, particularly for certain GPU types on macOS and Windows. Some ARM-specific platforms are marked as "shall work, not confirmed."

Health Check

Last Commit

2 days ago

Responsiveness

1 week

Pull Requests (30d)

Issues (30d)

Star History

215 stars in the last 30 days