Anomaly detection SDK using Robust Random Cut Forest algorithm
Top 61.7% on sourcepulse
This library provides a Python implementation of the Robust Random Cut Forest (RRCF) algorithm for anomaly detection in streaming data. It is designed for researchers and practitioners working with time-series data who need to identify outliers efficiently, even in high-dimensional or noisy datasets. The RRCF algorithm offers a statistically grounded anomaly score and handles data characteristics that often challenge other methods.
How It Works
The core of the library is the RCTree
class, which builds robust random cut trees. These trees are binary search trees that recursively partition data points. Anomaly detection is performed by calculating the "collusive displacement" (CoDisp) of a point, which measures how much the tree's structure changes when the point is added. Higher CoDisp indicates a higher likelihood of being an outlier. The library supports both batch and streaming anomaly detection by constructing an ensemble of these trees.
Quick Start & Requirements
pip install rrcf
numpy
(>= 1.15). Optional: pandas
, scipy
, scikit-learn
, matplotlib
for examples.Highlighted Details
Maintenance & Community
dev
branch.Licensing & Compatibility
Limitations & Caveats
The README does not explicitly state the license, making it difficult to assess commercial usability or derivative works. While it mentions contributing guidelines and testing, there's no clear indication of recent activity or active maintenance beyond the initial publication.
1 year ago
1 week