ML profiling and performance analysis tool
Top 72.8% on sourcepulse
This tool provides a profiling and performance analysis suite for JAX, TensorFlow, and PyTorch/XLA, targeting ML engineers and researchers. It helps users understand, debug, and optimize model performance across CPUs, GPUs, and TPUs through detailed visualizations and breakdowns.
How It Works
The profiler integrates as a TensorBoard plugin, offering several analysis tools. It visualizes execution timelines (Trace Viewer), aggregates performance metrics (Overview), monitors memory usage (Memory Profile Viewer), and displays HLO graph structures (Graph Viewer). This approach allows for a comprehensive, multi-faceted view of model performance within a familiar TensorBoard environment.
Quick Start & Requirements
pip install tbp-nightly
(for the latest version).tensorboard --logdir=profiler/demo
Highlighted Details
Maintenance & Community
The project follows TensorFlow's versioning scheme. Links to guides for JAX, TensorFlow, and Cloud TPU profiling are provided.
Licensing & Compatibility
The README does not explicitly state the license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
Offline usage may result in missing charts and tables. Multi-worker GPU profiling requires independent analysis of each worker. Cloud TPU profiling necessitates Google Cloud TPU access.
1 day ago
1+ week