ML toolkit for automated log parsing
Top 24.5% on sourcepulse
This repository provides a machine learning toolkit and benchmarks for automated log parsing, enabling users to extract event templates from unstructured logs and structure log analytics. It is targeted at researchers developing new log parsing methods and practitioners evaluating existing ones.
How It Works
The toolkit implements various log parsing algorithms, including Drain, Spell, and Logram, each with different approaches to identifying log message templates. These methods typically involve techniques like fixed-depth trees, n-gram dictionaries, or streaming parsing to cluster similar log messages and extract common templates with parameterized placeholders. This allows for efficient analysis of large volumes of log data.
Quick Start & Requirements
pip install logparser3
regex==2022.3.2
(recommended)deap
for MoLFI, torch
for NuLog, openai
for DivLog).logparser/Drain/demo.py
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is primarily geared towards research and benchmarking, with the current implementation noted as "far from ready for production use." Suggestions for production readiness include enhancing efficiency, scalability, failure recovery, and persistence, with Drain3 cited as a reference for practical enhancements.
1 month ago
1 week