clopus-watcher  by kubeden

AI-powered Kubernetes on-call engineer

Created 1 month ago
279 stars

Top 93.4% on SourcePulse

GitHubView on GitHub
Project Summary

<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> Clopus Watcher is a Kubernetes-native tool acting as an autonomous, 24/7 on-call engineer. It monitors pods for errors, analyzes logs, and can directly apply hotfixes via kubectl exec, or generate reports. This offers a proactive solution for maintaining application stability in Kubernetes.

How It Works

<2-4 sentences on core approach / design (key algorithms, models, data flow, or architectural choices) and why this approach is advantageous or novel.> The project deploys as a Kubernetes CronJob scanning pods in a target namespace for degradation (e.g., CrashLoopBackOff, Error). Upon detection, it reads logs for diagnosis and can execute commands within the pod to apply hotfixes. Errors and fixes are recorded in an SQLite database. A separate Dashboard deployment provides a web UI for visualizing these events.

Quick Start & Requirements

  • Primary Install/Run: Deploy using ./scripts/deploy.sh after setting up prerequisites.
  • Non-default Prerequisites: Kubernetes cluster, Sealed Secrets, kubectl, local build tools (podman/docker), container registry access.
  • Configuration: Key environment variables include TARGET_NAMESPACE, AUTH_MODE (api-key/credentials), WATCHER_MODE (autonomous/watcher), ANTHROPIC_API_KEY (for api-key auth), and SQLITE_PATH.
  • Deployment Options: Supports authentication via Anthropic API key or credentials file, managed through Kubernetes secrets.

Highlighted Details

  • Autonomous on-call engineer for Kubernetes.
  • Automated detection of pod errors.
  • Direct hotfix application via kubectl exec.
  • Logging of errors and fixes to SQLite.
  • Web dashboard for monitoring incidents.

Maintenance & Community

The provided README lacks details on contributors, sponsorships, community channels, or a roadmap.

Licensing & Compatibility

The license type is not specified, precluding assessment of commercial use or closed-source integration compatibility.

Limitations & Caveats

<1-3 sentences on caveats: unsupported platforms, missing features, alpha status, known bugs, breaking changes, bus factor, deprecation, etc. Avoid vague non-statements and judgments.> Autonomous hotfix application carries inherent risks requiring validation and monitoring. Fix effectiveness depends on Claude Code's analysis accuracy and error nature. The README does not detail specific unsupported platforms or known bugs.

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
0
Star History
234 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.