FEATURE 02ROI · WASTE · KILL
The Zombie Sentry
Stop paying for AI that isn't thinking. Save up to 20% on GPU bills by killing hung agents in real time.
Forgotten notebooks. Runaway loops. Agents stuck firing the same kernel a thousand times a second while the bill clock runs. We watch the kernel-launch fingerprint of every PID through the eBPF ring buffer. When a process crosses the threshold, we alert — and if policy says so, the kernel signals SIGKILL within milliseconds. Your CFO gets the GPU-hours back.
ROI
up to 20% reclaim on GPU spend
Default threshold
1000 identical launches / 2s
Latency
real-time · sub-second decision
Auto-kill
policy-driven · alert by default
// TECHNICAL SPECIFICATION
Signal
kernel-launch rate per (pid, fn_ptr, grid+block dims)
Heuristic
identical-launch sliding window · O(1) per event
Action
log alert · SIGKILL when policy = kill
Reporting
Prometheus counters + the turn-key Grafana pane
Override
allowlist + protected_pids · tuned per-host