marfrit/aish - aish - marfrit's space

Author SHA1 Message Date

Author	SHA1	Message	Date
marfrit	cf4d79dd9d	docs/PHASE3: analyze + baseline — \C-n mechanics, LLM latency, module pre-state Analyze findings folded into the manifest: A1. \C-n binding can't toggle mid-prompt without rl_insert_text / rl_redisplay. Solution: bind those (one cdef + 2 wrappers in ffi/readline.lua) so \C-n inserts ":norris " at the cursor; user types goal + Enter. Routes through existing meta dispatch. A2. broker has no max_tokens passthrough. Add opts.max_tokens for the LLM second-opinion path (terminates at ~2 tokens; verified proxy honors it). A3. Phase 2 tool-sub-loop pattern IS the planner shape. safety.norris_step is the per-iteration extraction; driver loop in repl.lua. Module-changes table (§3) updated with the rl_insert_text and max_tokens rows. Baseline doc (PHASE3-baseline.md, 80 lines) captures: - LLM second-opinion latency: 425-1162ms per probe, all 5 test cases correct. Worst-case 16-step Norris = ~20s overhead; with static-pattern fast-path + session cache, ~5s realistic. - Module pre-state at commit `f26cbd9` (Phase 2 tip): LOC + state per file before Phase 3 edits. - Six static-pattern Lua-match sanity checks (all correct). - Carries: aish#15 (still open), aish#14, aish#32/#33. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 22:37:58 +00:00
marfrit	b58a842e49	docs/PHASE3: formulate — Norris autonomous mode + destructive-op gate Phase 3 formulate manifest. Three pillars per PHASE0.md §11 row 3: Chuck Norris autonomous mode (planning loop), destructive-op heuristic (static patterns + LLM second-opinion), and HALT/confirm protocol. Resolutions baked in via §2: Q2 iterative re-plan after each action (not top-down tree) Action sources CMD: lines AND MCP tool_calls — Phase 2 contract honored HALT trigger static-pattern hit OR LLM-second-opinion flag HALT shape 3-way: proceed / skip / abort Auto-approve under Norris honors Phase 2 auto_approve policy EXCEPT destructive-op heuristic always wins LLM second-opinion model the `fast` preset (cheapest) Norris prompt suffix appended to system prompt while active; "GOAL: complete" sentinel for done Key extensions: - safety.is_destructive: ~20 static shell-idiom patterns + LLM probe; runs on interactive CMD: extraction too (§9 — replaces bare confirm_cmd for known-destructive cases). Q24 worth challenging at analyze. - safety.norris_step: single-iteration of the planner. Driver loop in repl.lua. \C-n toggle (real binding, replaces Phase 1 placeholder); :norris <goal> explicit launch. - renderer.norris_begin/step/halt/end: visual parity with exec and tool_call frames. Prompt becomes [aish:fast ⚡]> per PHASE0.md §9. - context.to_messages dynamically appends NORRIS MODE suffix when norris_active. New open questions (Q23–Q30) tracked in §11: Q23 LLM second-opinion latency budget (caching mitigation) Q24 interactive CMD: also subject to is_destructive? (proposal: yes) Q25 GOAL: complete + pending actions in same response — dispatch first Q26 context preservation on abort/done/budget — all preserve Q27 :norris continue (resume after abort) — deferred to v2 Q28 side-effect MCP tools not in __shell/__write_file patterns Q29 goal-implies-authorization for destructive ops — no, always confirm Q30 :norris no-arg vs \C-n share goal-prompt path — yes, trivial Module-layout (PHASE0 §4) untouched — all changes are growth of existing files. 6 commits expected at implement. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 20:45:03 +00:00

cf4d79dd9d

docs/PHASE3: analyze + baseline — \C-n mechanics, LLM latency, module pre-state

Analyze findings folded into the manifest:

  A1. \C-n binding can't toggle mid-prompt without rl_insert_text /
      rl_redisplay. Solution: bind those (one cdef + 2 wrappers in
      ffi/readline.lua) so \C-n inserts ":norris " at the cursor; user
      types goal + Enter. Routes through existing meta dispatch.

  A2. broker has no max_tokens passthrough. Add opts.max_tokens for
      the LLM second-opinion path (terminates at ~2 tokens; verified
      proxy honors it).

  A3. Phase 2 tool-sub-loop pattern IS the planner shape. safety.norris_step
      is the per-iteration extraction; driver loop in repl.lua.

Module-changes table (§3) updated with the rl_insert_text and
max_tokens rows.

Baseline doc (PHASE3-baseline.md, 80 lines) captures:
  - LLM second-opinion latency: 425-1162ms per probe, all 5 test
    cases correct. Worst-case 16-step Norris = ~20s overhead; with
    static-pattern fast-path + session cache, ~5s realistic.
  - Module pre-state at commit f26cbd9 (Phase 2 tip): LOC + state
    per file before Phase 3 edits.
  - Six static-pattern Lua-match sanity checks (all correct).
  - Carries: aish#15 (still open), aish#14, aish#32/#33.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 22:37:58 +00:00

marfrit

b58a842e49

docs/PHASE3: formulate — Norris autonomous mode + destructive-op gate

Phase 3 formulate manifest. Three pillars per PHASE0.md §11 row 3:
Chuck Norris autonomous mode (planning loop), destructive-op heuristic
(static patterns + LLM second-opinion), and HALT/confirm protocol.

Resolutions baked in via §2:
  Q2  iterative re-plan after each action (not top-down tree)
  Action sources    CMD: lines AND MCP tool_calls — Phase 2 contract honored
  HALT trigger      static-pattern hit OR LLM-second-opinion flag
  HALT shape        3-way: proceed / skip / abort
  Auto-approve under Norris  honors Phase 2 auto_approve policy
                             EXCEPT destructive-op heuristic always wins
  LLM second-opinion model   the `fast` preset (cheapest)
  Norris prompt suffix       appended to system prompt while active;
                             "GOAL: complete" sentinel for done

Key extensions:
  - safety.is_destructive: ~20 static shell-idiom patterns + LLM probe;
    runs on interactive CMD: extraction too (§9 — replaces bare
    confirm_cmd for known-destructive cases). Q24 worth challenging
    at analyze.
  - safety.norris_step: single-iteration of the planner. Driver loop
    in repl.lua. \C-n toggle (real binding, replaces Phase 1
    placeholder); :norris <goal> explicit launch.
  - renderer.norris_begin/step/halt/end: visual parity with exec
    and tool_call frames. Prompt becomes [aish:fast ⚡]> per
    PHASE0.md §9.
  - context.to_messages dynamically appends NORRIS MODE suffix
    when norris_active.

New open questions (Q23–Q30) tracked in §11:
  Q23 LLM second-opinion latency budget (caching mitigation)
  Q24 interactive CMD: also subject to is_destructive? (proposal: yes)
  Q25 GOAL: complete + pending actions in same response — dispatch first
  Q26 context preservation on abort/done/budget — all preserve
  Q27 :norris continue (resume after abort) — deferred to v2
  Q28 side-effect MCP tools not in *__shell/*__write_file patterns
  Q29 goal-implies-authorization for destructive ops — no, always confirm
  Q30 :norris no-arg vs \C-n share goal-prompt path — yes, trivial

Module-layout (PHASE0 §4) untouched — all changes are growth of
existing files. 6 commits expected at implement.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 20:45:03 +00:00

2 Commits