ONNX tool for parsing, editing, and profiling models
Top 68.4% on sourcepulse
This tool provides a comprehensive suite for parsing, editing, and profiling ONNX models, targeting AI researchers and developers working with large language models, diffusion models, and computer vision applications. It aims to simplify model optimization, memory compression, and performance analysis.
How It Works
The tool operates by leveraging a compute graph and shape engine to perform static analysis and transformations on ONNX models. It supports operations like constant folding and operator fusion for optimization. For profiling, it relies on shape inference to accurately calculate MACs and memory usage, enabling detailed performance analysis and memory compression techniques for activations and weights.
Quick Start & Requirements
pip install onnx-tool
or pip install --upgrade git+https://github.com/ThanatosShinji/onnx-tool.git
pip install onnx==1.8.1
first.benchmark/
and data/
directories.Highlighted Details
Maintenance & Community
No specific community channels (Discord/Slack) or notable contributors are listed in the README.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The tool does not support ONNX Loop or Sequence types. Dynamic input shapes can affect MACs calculations, with specific input shapes for benchmark results provided in data/public/config.py
.
1 day ago
1 day