Skip to content

Metrics Dashboard: HELIX 2026-Q2 (post-`v0.3.3`)

Source identity (from 06-iterate/metrics-dashboard.md):

ddx:
  id: metrics-dashboard
  depends_on:
    - metric-definition

Metrics Dashboard: HELIX 2026-Q2 (post-v0.3.3)

Illustrative example using representative measurements taken from the live HELIX repo around the v0.3.3 release window. Exact values reflect a single sampling period and will drift; the dashboard’s structure and decision logic are the canonical part.

Review Window: 2026-04-01 → 2026-04-30 Baseline: v0.3.2 release window (2026-03-01 → 2026-03-31) Status: Complete

Decision

The 2026-Q2 changes (CI stability fix, --review-threshold knob, plugin manifest cleanup) improved overall execution stability with no regression in cycle time or escalation frequency. The wrapper-CLI test suite is now reliably green; bead close rate and agent attempt success both moved in the right direction. No metric crossed a degradation threshold.

Summary

v0.3.3 is a stability release: the CI green rate moved from intermittent to reliable, and operator review-cadence noise dropped after the --review-threshold knob landed. Bead close rate is up modestly; cycle time is essentially flat (within noise). No alerting or backlog growth signals indicate degradation.

Metrics Table

MetricBaselineCurrentDirectionResultSource
Bead close rate (closed/week)~22~26higher is betterpassddx bead list --status closed --json filtered by updated_at
Agent attempt success rate (closed without retry)~78%~82%higher is betterpassevents[] array on closed beads
Median cycle time (claim → close)~38m~36mlower is betternoise (within ±10%)events[] timestamps; sampled across the 2026-Q2 window
Escalation frequency (NEXT_ACTION: WAIT per helix run)~0.18~0.15lower is betterpasshelix status snapshot history
Review-cycle noise (review passes per closed bead)~1.4~1.1lower is betterpass (driven by --review-threshold)ddx bead show <id> review events on closed beads
Wrapper-CLI test green rate (CI)~85%~99%higher is betterpassGitHub Actions runs of tests/helix-cli.sh

Trend Notes

  • The wrapper-CLI green-rate jump is directly attributable to the DDx update-notice filter landed in 92293fd and shipped in v0.3.3.
  • Review-cycle noise dropped because --review-threshold lets operators raise the periodic-alignment trigger when the queue is short.
  • Cycle time is flat — the release did not target latency, and no regression appeared.
  • Close rate moved up but is partially explained by the iterate cycle consuming a batch of artifact-restoration beads in parallel; this is expected to normalize once the AR-2026-05-01 follow-ups are exhausted.

Follow-Up

  • Continue draining the AR-2026-05-01 artifact-restoration backlog (tracked in improvement-backlog.md).
  • Re-measure cycle time after the FEAT-006 context-digest propagation beads close — those are expected to reduce mean cycle time.
  • File a tracker bead to add a deterministic close-rate metric script under scripts/ so this dashboard can be regenerated mechanically next iteration instead of sampled by hand.

Review Checklist

  • Baseline is explicit (v0.3.2 release window)
  • Each metric cites a source (the actual command or events array)
  • The summary states the decision implication (v0.3.3 improved stability; no regression)