FEAT-011: Slider Autonomy Control
Source identity (from
01-frame/features/FEAT-011-slider-autonomy.md):
ddx:
id: FEAT-011
status: superseded
superseded_by: helix.prdSUPERSEDED — This feature defined operator-facing autonomy controls (
low/medium/high) forhelix-runand the supervisory execution loop. The current PRD (helix.prd) removes the execution loop, runtime UX, and supervisory autopilot from HELIX’s scope entirely. Autonomy configuration is a runtime concern. If DDx or another runtime wants an autonomy slider, it must be authored in runtime-owned documentation with no implication that HELIX owns the execution model. This document is retained for historical context only and must not govern new HELIX work.
FEAT-011: Slider Autonomy Control
Status: Planning
Priority: P1
Created: 2026-04-07
Related ADRs: ADR-001 (supervisory control model)
Problem Statement
Current HELIX has 15+ skills that mirror CLI commands. This creates cognitive load and doesn’t match the actual user workflow, which is:
- Two primary modes: “take design input” vs “work the queue”
- Human input is sparse: “design a CRM”, “use postgres”, “make logout blue”, “here’s a bug”
- Agent must maintain artifact stack coherence: Vision → PRD → Specs → ADRs → Designs → Tests → Code
The current command-centric model doesn’t support:
- Natural language input interpretation
- Cross-cutting concern enforcement (e.g., “use postgres” should check existing SQLite ADR)
- Human-agentic continuum (sometimes human does work, sometimes agent runs away)
- Non-blocking conflict escalation
- Automated artifact graph traversal
Requirements
FR-01: Artifact Graph Metadata (Based on Existing Hierarchy)
Each artifact shall declare its connections using the existing HELIX cross-reference pattern:
# TD-036-list-mcp-servers.md
**User Story**: [[US-036-list-mcp-servers]]
**Parent Feature**: [[FEAT-001-mcp-server-management]]
**Solution Design**: [[SD-001-mcp-management]]Graph metadata includes:
- Upstream dependencies: Higher-authority artifacts (PRD, FEAT, SD) that govern this one
- Downstream dependents: Lower-authority artifacts (TD → TP → tests → code) derived from this one
- Peer relationships: Same-level artifacts (US-036 ↔ US-037 in same feature)
Storage: Cross-references embedded in artifact frontmatter/body using [[ID]] notation. No separate registry needed initially.
FR-02: Impact Detection via Search + Declared Links
The system shall identify impacted artifacts through two-activity detection:
- Declared links first: Follow existing
[[ID]]cross-references from changed artifact - Search-based fallback: Use
rgto find term matches (e.g., “Postgres” → ADRs about database choice)
Precedence: Declared links take precedence over search results to avoid false positives. Search only supplements when declared links are incomplete.
FR-03: Graph Traversal and Change Flow
Once impacted artifacts are identified, the system shall:
- Trace the graph: Follow upstream/downstream connections from impact point
- Order by authority: Process in canonical order (Vision → PRD → FEAT → US → SD → TD → TP → Tests → Code)
- Note: ADRs can appear at multiple levels; they govern both specs AND designs
- Flow changes: Propagate modifications through each layer as atomic diffs
- Create downstream work: Generate beads for artifacts that need updating
Cycle handling: Traversal uses visited set to prevent infinite loops on peer relationships.
FR-04: Physics-Level vs Resolvable Conflict Detection
The system shall distinguish between:
| Type | Definition | Example | Action |
|---|---|---|---|
| Resolvable | Different approaches to same goal | Postgres vs SQLite for database | Escalate via bead, continue with documented assumption |
| Physics-level | True contradictions that cannot coexist | “must be real-time” + “batch processing only” | Block execution, require human resolution |
High autonomy mode proceeds through resolvable conflicts; blocks only on physics-level.
DDx preserve outcomes from bounded execution attempts (for example failed required executions or ratchet regressions) are not physics-level conflicts. They terminate the current DDx-managed attempt and hand control back to HELIX for escalation, follow-on beads, or user input.
FR-05: Autonomy Slider Controls Flow Behavior
The system shall support configurable autonomy level (per session or project):
| Level | Behavior | Use Case |
|---|---|---|
| Low | Ask before each graph step and before creating each downstream artifact; do not make changes until the user approves | Learning mode, critical systems |
| Medium | Traverse graph autonomously, ask only on conflicts/ambiguity | Default for most work |
| High | Continue end-to-end autonomously; escalate resolvable conflicts as non-blocking beads; only physics-level conflicts stop the supervisory workflow, while DDx preserve outcomes end the current bounded attempt and hand control back to HELIX | Trusted contexts, rapid iteration |
FR-06: Minimal Execution Surface (Backward Compatible)
Execution entrypoints reduce to two core capabilities while maintaining backward compatibility:
- Input command:
helix input "<natural language>"- accepts sparse input, triggers graph traversal - Queue-drain command:
ddx agent execute-loop- drains execution-ready beads using DDx-managed claim, execute-bead, land/preserve, and close-with-evidence semantics
helix run remains as a compatibility wrapper and operator convenience surface
while DDx loop parity is validated. If execute-loop plus HELIX skills reach
the required parity, HELIX-owned execution wrappers (helix run, and
potentially helix build) become deprecation candidates while input,
planning, review, and spec-shaping surfaces remain HELIX-owned.
Planning actions such as alignment do not need their own permanent execution
substrate. If helix align remains as a skill or CLI surface, it must behave
as a thin entrypoint that creates or claims the governing alignment bead and
then invokes the stored alignment prompt.
FR-07: Bead-Centric Coordination
All work flows through beads (non-blocking for resolvable conflicts):
- User input → [Bead: Request]
- Agent refines → [Children: Questions/Decisions]
- Conflicts → [Bead: Escalation - non-blocking, label
kind:escalation] - Gaps → [Bead: Gap - speculative, label
kind:speculative]
Workers execute from the bead queue without blocking on resolvable conflicts. Physics-level conflicts block only the affected worker path.
FR-08: Verification Loop and Traceability
The system shall verify that artifact flow produces functional, measurable results:
Traceability Model (Based on Existing Pattern)
FEAT-001 → SD-001 → US-036 → TD-036 → TP-036 → tests → code
↓
US-037 → TD-037 → TP-037 → tests → codeEach layer references upstream via [[ID]] notation. Tests reference spec IDs in comments or metadata.
Failure Triage Algorithm
When verification fails:
- Classify failure type:
- Test bug (test wrong) → create bead for test fix
- Implementation bug (code wrong) → create bead for code fix
- Spec conflict (spec contradicts upstream) → trace back to spec, create escalation bead
- Trace back through graph using
[[ID]]references: failing test → TP → TD → US/FEAT → PRD - Identify adjustment point: nearest upstream artifact that directly specifies the failing behavior
- Create follow-on beads for resolution
Metrics and Non-Functional Requirements
For non-testable constraints (latency, security, compliance):
- Define named metrics in artifacts (e.g., “P95 < 100ms” in PRD)
- Automated checks collect metrics continuously
- Threshold violations trigger reflow via beads
DDx Substrate vs HELIX Workflow Ownership
This section summarizes the canonical boundary defined in CONTRACT-001. If this feature text and the contract diverge, the contract is authoritative.
DDx owns the substrate:
- graph primitives (
[[ID]]indexing, upstream/downstream traversal, reverse lookup) - graph-discovered execution docs and required validation execution
- managed bead execution via
ddx agent execute-bead <bead-id> [--from <rev>] [--no-merge] - single-project queue draining via
ddx agent execute-loop - runtime evidence capture, execution runs, metric projection, ratchet evaluation, and merge/preserve/close mechanics
HELIX owns the workflow semantics:
- autonomy behavior (
low/medium/high) - authority ordering and artifact-flow policy
- conflict classification and escalation behavior
- workflow routing (
helix input, when to delegate a queue toexecute-loop, follow-on bead creation) - deterministic bead acceptance and success-measurement authoring
- prompt design and prompt-engineering strategy
- stage-authored behavior stance for planning, execution, review, alignment, and supervisory steps
HELIX does not need a separate user-facing “personality profile” configuration surface to preserve this ownership. The intended contract is simpler:
- stage stance lives in HELIX action prompts, skill wording, and execution-doc conventions
- DDx still owns harness/model execution plus concrete model resolution
- HELIX may request stage-appropriate tier or harness constraints, but it must not turn stage stance into duplicated model-version policy
Default stage families:
- planning (
input,frame,design,evolve,triage,polish): exploratory, ambiguity-surfacing, artifact-authoring - managed execution (
build,measure): bounded, contract-following, anti-feature-creep - review: adversarial and risk-first
- alignment: top-down and drift-seeking
- supervisory (
check,report): concise, state-oriented, policy-applying
Handoff contract: HELIX decides scope, autonomy behavior, queue policy, and
workflow context; DDx executes either one bounded bead (execute-bead) or the
project queue (execute-loop) and returns runtime evidence plus merge/preserve
outcomes; HELIX interprets preserved, failed, or blocked results to continue,
escalate, or ask for input. Direct ddx agent run remains appropriate for
planning, review, alignment, and other non-managed prompts that should not
claim and close beads automatically; those direct prompts should reuse the same
HELIX-authored stage stance rather than inventing a separate personality lane.
Acceptance Criteria
AC-01: Artifact Graph Metadata
- Existing
[[ID]]cross-reference pattern works for graph traversal - Traversal can follow declared connections automatically (tested with fixture artifacts)
- Search-based impact detection supplements declared relationships when links incomplete
AC-02: Impact Detection and Traversal
- Given sample artifact graph, change in PRD produces exactly expected downstream beads
- Graph traversal follows canonical authority order with ADR exceptions handled correctly
- Cycle detection prevents infinite loops on peer relationships
AC-03: Conflict Classification
- System distinguishes resolvable vs physics-level conflicts (test cases defined)
- Resolvable conflicts create escalation beads and continue with documented assumptions
- Physics-level conflicts block execution and require human resolution
AC-04: Autonomy Slider Behavior
- Low autonomy: asks before each graph step and before creating each downstream artifact, human approves flow (verified via test scenarios)
- Medium autonomy: autonomous traversal, asks on conflicts/ambiguity (default behavior)
- High autonomy: full autonomous flow (“until blocked”), escalates resolvable conflicts via beads and blocks only physics-level conflicts at the supervisory level
AC-05: CLI Simplification (Backward Compatible)
-
helix input "<text>"command accepts natural language and triggers graph traversal -
ddx agent execute-loopis the primary documented queue-drain command for execution-ready beads -
helix runis either a thin compatibility wrapper over DDx-managed queue draining or is explicitly documented as transitional - Legacy commands (
design,build, etc.) route through the new model or have an explicit deprecation/retention decision recorded
AC-06: Verification Loop
- 100% of tests reference spec IDs (via comments or metadata)
- Failed verification traces back through graph to identify adjustment point (tested with fixtures)
- New constraints discovered during verification create follow-on beads for resolution
- Metrics thresholds defined and trigger reflow when violated
AC-07: Backward Compatibility
- Existing CLI commands continue to work during transition period
- Existing skills remain functional (may be deprecated but not broken)
-
helix runloop continues to function with new bead types (kind:escalation,kind:speculative)
AC-08: DDx Handoff Is Observable
- HELIX dispatches bounded implementation/verification work through
ddx agent execute-beadrather than an implicit internal execution path - HELIX documents
ddx agent execute-loopas the queue-drain substrate rather than a wrapper-owned claim/execute/close loop - Preserve-vs-merge outcomes returned by DDx are observable inputs to HELIX workflow behavior (continue, escalate, or ask)
- A DDx preserve outcome ends the current bounded attempt and returns control to HELIX without being misclassified as a physics-level conflict
- Runtime evidence returned by DDx is sufficient for HELIX to interpret required execution outcomes and ratchet results without building a parallel execution store
AC-09: Automation-Friendly Success Criteria
- Execution-ready beads carry deterministic acceptance criteria with explicit commands, checks, or observable repository states
- Success measurement criteria are specific enough that DDx-managed execution can close merged work with evidence instead of relying on manual interpretation
- Direct
ddx agent runis not required to understand whether a merged execution-ready bead succeeded; the bead contract itself is sufficient
Non-Requirements
- This feature does NOT replace the existing CLI/skill contract immediately (deprecation timeline defined)
- This feature does NOT eliminate activity enforcers (they become parameterized by slider)
- This feature is NOT production-ready until fully tested in sandbox branch
- This feature does NOT require new artifact storage format (uses existing
[[ID]]pattern)
Dependencies
- DDx tracker must support custom labels for escalation/gap/speculative beads (
kind:escalation,kind:speculative) - Artifact stack must have complete
[[ID]]cross-references for reliable traversal - Slider config schema needs to be defined and validated (
.helix/slider-config.yaml) - DDx must provide the substrate defined in
CONTRACT-001: DDx / HELIX Boundary Contract, including graph primitives, managed bead execution, and runtime evidence capture
Autonomy Scope Contract (Explicit)
low: ask-first — do not proceed without explicit user confirmation per step and per downstream artifact creationmedium: guided-autonomy — ask when ambiguity or conflict blocks deterministic progresshigh: high-autonomy — run to completion unless physics-level constraints stop progress; ordinary DDx preserve outcomes end the current bounded attempt and return control to HELIX, but are not themselves physics-level conflicts
helix input SHOULD expose this as --autonomy.
Risks
- Breaking existing workflows: If we replace skills too aggressively, automation breaks → Mitigation: deprecation timeline, backward compatibility
- Silent divergence: Speculative work might drift from intent without proper reconciliation → Mitigation: escalation beads with documented assumptions
- Cognitive load shift: Users need to understand slider settings and their implications → Mitigation: clear documentation, sensible defaults
- Graph traversal complexity: Following artifact relationships correctly is non-trivial → Mitigation: start with declared links only, add search as supplement
Success Metrics
- Reduced skill count (from 15+ to 2 core + optional specialized)
- Faster time from input to first artifact creation
- Higher quality conflict detection (fewer silent drifts)
- User satisfaction with autonomy level control
- Automated change flow through artifact graph works reliably (tested with fixtures)
- Verification loop closes successfully: tests pass, metrics met, failures trigger correct reflow
- Functional artifacts produced: each layer produces measurable, verifiable output
- High execute-loop closure rate: execution-ready beads land and auto-close from DDx-managed evidence without follow-up clarification on what “done” means
References
- Artifact Hierarchy - canonical authority order and naming
- ADR-001: Supervisory Control Model - helix-run as supervisor
- CONTRACT-001: DDx / HELIX Boundary Contract - platform/workflow ownership split
- CONTRACT-002: HELIX Execution-Document Conventions - how HELIX authors execution docs, metrics, and ratchet-backed validations for DDx discovery
- DDx BEAD Tracker - execution tracking conventions
This feature is in PLANNING mode. Do not implement until technical design document (TD-011) is approved and beads are created through proper HELIX workflow.