Discover and explore top open-source AI tools and projects—updated daily.
adridaLLM classification routing for cost efficiency
Top 63.8% on SourcePulse
Summary
TRACER addresses the high cost of LLM-based classification by intelligently routing predictable inputs to lightweight, traditional ML models. It targets users with LLM classification pipelines, offering over 90% cost reduction and formal parity guarantees against the teacher LLM, with a self-improving routing policy.
How It Works
TRACER learns a decision boundary from LLM classification traces. It fits a fast, non-LLM surrogate model (e.g., logistic regression, LightGBM) on "easy" inputs. A calibrated "acceptor gate" estimates surrogate agreement with the LLM, deferring uncertain inputs. New traces from deferred calls feed back into subsequent refits, automatically increasing surrogate coverage and reducing LLM reliance. This enables sub-millisecond, CPU-bound inference for handled cases, drastically cutting costs while maintaining formal parity guarantees.
Quick Start & Requirements
pip install tracer-llm. Optional [embeddings] for sentence-transformers.tracer demo.Highlighted Details
Maintenance & Community
No specific details on contributors, sponsorships, or community channels were found in the provided README.
Licensing & Compatibility
MIT License. This license is permissive and generally compatible with commercial use and closed-source linking.
Limitations & Caveats
Effectiveness depends on initial trace quality and ongoing data generation for continual learning. Potential troubleshooting areas include selected_method=null and coverage drift. Parity gate calibration is critical for maintaining accuracy guarantees.
3 days ago
Inactive
KarelDO
txsun1997
tensorzero