marfrit/aish - aish - marfrit's space

Author	SHA1	Message	Date
marfrit	7d62eb5659	review followups: pcall shield, :resume guard, shell quoting, nits CONCERNs from the Phase 1 review pass: ffi/curl.lua: - SSE write_cb body is now pcall-wrapped. A Lua error in on_event (or in the parse loop itself) is captured into cb_error and surfaced after curl_easy_perform rather than propagating across the FFI callback boundary (which LuaJIT documents as process-fatal). The EOS flush path gets the same shield. Errors return (nil, "callback: <msg>") from post_sse. history.lua: - sh_singlequote() escapes shell metacharacters; the mkdir -p and ls -1 shell-outs no longer double-quote (where $(...) and $VAR still expand) — single-quote with embedded-' escaping is the safe form. - M.load now returns (turns, meta) instead of (meta, turns). turns is ALWAYS a table on success, never nil-when-no-header; failure path is the unambiguous (nil, err). Callers can `if not turns then` without the previous ambiguity. repl.lua :resume updated to the new shape. repl.lua :resume: - Refuse to resume into a non-empty ctx — silent overwrite was the Q15 default, but the review surfaced the no-undo / no-warning failure mode. User must :reset (or :save then re-launch) to express intent. The current session's on-disk log is unaffected either way. NITs: - ffi/libc.lua READ_BUF: comment noting it's module-shared and Phase 1 has no reentrant readers; revisit when that changes. - PHASE1.md §7: \C-x\C-c reservation pinned to Phase 3 ("deferred from Phase 1 — no consumer here") rather than the previous dangling "(or here)". Regression suite verifies: - history.load new signature on success + failure paths - shell-quoted history.dir with $ doesn't trip - aish scripted run: ctx with 2 turns refuses :resume anchor with a clear status; user must :reset first Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 20:05:23 +00:00
marfrit	1f1065157e	review BLOCKER: PTY input forwarding + raw mode toggle Phase 1 review caught a structural gap: executor.exec only drained the PTY master fd, never forwarded user keystrokes — vim/less/htop/nano would render and hang on input. PHASE1.md §5 specified bidirectional multiplex but only the read leg landed. tcgetattr/tcsetattr were also missing, so even with input forwarding the parent's line discipline would buffer until newline (breaking single-key UIs). ffi/libc: - struct termios opaque buffer + tcgetattr/tcsetattr + cfmakeraw - M.set_raw(fd) saves termios + applies cfmakeraw; returns saved or (nil, err) when fd isn't a tty (scripted / piped-stdin runs) - M.restore_termios(fd, saved) - struct pollfd + M.poll (POLLIN constant) executor: - multiplex(sess): poll(stdin, master); reads master on any revents (POLLHUP fires when child closes its slave end, not POLLIN — the revents != 0 check catches both); forwards stdin keystrokes to master; loop exits when master read returns 0 (EOF / child gone) - stdin polling is only enabled when stdin_is_tty (set_raw succeeded); piped-stdin runs (tests / scripted) would otherwise drain queued aish commands into the child of the current cmd, swallowing them - raw mode is restored before returning so the user lands back at the aish prompt in canonical mode renderer + repl: - exec_output(out, code) split into exec_begin() (top rule, before spawn) + exec_end(code) (closing rule with exit, after wait). PTY multiplex streams the body live to stdout in between; the renderer never re-prints the body. PHASE1.md §3: - tcgetattr/tcsetattr changed from "optional" to "required for single-key UIs to work — done-criteria #2"; poll added to the libc row description. Verified: - non-interactive smoke (echo / false / exit 7 / ls /nonexistent / printf multi-line) — all exit codes correct, output streamed live, a\nb\nc\n preserved byte-for-byte - scripted-stdin run reaches all expected lines (no stdin draining into a non-interactive child) - aish prompt + framed exec block + exit-code line all render in correct order Live interactive verification (vim / less / htop in a real terminal) still needs a user-test pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 20:00:53 +00:00
marfrit	a75118b2ae	readline: bind() via rl_bind_keyseq; repl reserves \C-n no-op Phase 1 readline binding wiring per PHASE1.md §7. ffi/readline: M.bind(seq, lua_fn) -> bool Wraps lua_fn as a C callback (signature `int (int, int)` per readline's rl_command_func_t) and registers it via rl_bind_keyseq(seq, cb). Returns true on success (rl returns 0). Trampolines are pinned in module-local state so they outlive the bind call — readline retains the function pointer for the process lifetime. Rebinding the same seq frees the previous trampoline. Bound handlers are pcall-wrapped so a Lua error doesn't crash readline's input loop. repl: Binds \C-n to a no-op that emits "[aish] Norris mode not yet implemented (Phase 3)" Verifies the mechanism end-to-end; Phase 3 (Norris autonomous mode) replaces the body with the actual toggle. Smoke covers bind / rebind-same-seq (exercises the :free path) / bind-different-seq with no errors. Live keyboard verification waits on user-test. Phase 1's 8(+1) inner loop is now functionally through `implement`; next inner phase is `verify` (review pass) followed by memory-update. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 19:26:58 +00:00
marfrit	9d586870e8	repl: session persistence wiring — auto-log, :save, :resume, :sessions Phase 1 session log integration per PHASE1.md §6. On every M.run(), open a session file at <config.history.dir>/sessions/<utc-iso8601>.jsonl with a meta header (started, model, aish_version). If history.dir is unset or unwritable, status-log the disable and continue without persistence. ask_ai logs the merged user turn (after pending exec output is folded in) and the assistant turn (after streaming completes). run_shell does NOT log [exec output] — that becomes part of the next user turn when ctx.pending_exec_output is flushed. New meta commands: :sessions list session files; "*" marks the active one :save <name> rename current session log to <name>.jsonl (auto- appends .jsonl); reopens for continued append :resume <name> load <name>.jsonl into ctx (replaces current turns via ctx:reset + append loop). The current process's own session log is unaffected — Phase 1 chooses per-process logs over chained continuations. :quit and EOF (Ctrl-D) both close the session file via shutdown_session before exiting. HELP text updated (no longer "Phase 0:" header since meta set has grown). Q15 noted in PHASE1.md §10 (resume into non-empty context) is resolved by the ctx:reset() in :resume — silent overwrite for Phase 1, revisit if anyone cares. End-to-end live verified: chat -> auto-log; :save renames; :sessions listings; :resume + :history shows the round-trip. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 19:23:05 +00:00
marfrit	a722f576ac	repl + renderer: streaming assistant output (Phase 1) repl.ask_ai now drives broker.chat_stream and pumps each delta into renderer.assistant_delta(delta) as it arrives. renderer.assistant_flush is called when the stream ends to add a trailing newline if missing. The full reassembled response is then handed to executor.extract_cmd_lines for the CMD: confirm-and-execute path (unchanged from Phase 0). renderer.assistant() is kept for non-streaming callers (none in tree right now, but cheap to keep around). assistant_delta/flush share no state with assistant(); they use a module-local stream_buf that tracks the in-progress streamed block. Q12 deferred: incremental CMD: highlighting (cursor-positioning re- render on flush) is not implemented in Phase 1 — deltas emit raw. The §6 CMD: marker is still extractable on the reassembled string post- stream, which is what executor cares about. Renderer's bold+cyan treatment for CMD: lines stays available via M.assistant(). Broker error / SSE-framed api-error path still pops the user turn and restores ctx.pending_exec_output. Order: assistant_flush always runs (even on error) so the cursor lands on a fresh line before the broker- error status renders. Live verification: `Count one to ten` against hossenfelder fast streams deltas through to stdout incrementally; CMD: extraction works on the reassembled string; confirm gate intact. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 19:17:27 +00:00
marfrit	16490e6905	fix: buffer exec output for next user turn; alternation for strict templates User-test surfaced the bug: with `deep` (mistral-nemo-12b) active, running `list files` -> y on `CMD: ls` -> `Are there directory entries beginning with "lor"?` returned a Jinja exception: api: ... Error: Jinja Exception: After the optional system message, conversation roles must alternate user/assistant/user/assistant/... Cause: §6 specified "exec output injected into context uses role 'user' with a prefix tag '[exec output]'." This works for permissive templates (qwen2.5-coder-1.5b, the `fast` preset) but produces a back-to-back user/user pair on strict templates that enforce the OpenAI alternation contract — `[exec output]` user turn followed by the user's actual follow-up question. Fix: context.lua: - new field `pending_exec_output` (initially nil) - new method `:append_exec_output(out)` buffers (concat on subsequent captures so multi-shell-then-ai still merges everything) - new method `:append_user(content)` flushes buffered exec output as a `[exec output]\n...\n\n` prefix and appends a user turn - `:reset()` also clears the buffer repl.lua: - run_shell calls ctx:append_exec_output(out) instead of ctx:append({role="user", content="[exec output]\n"..out}) - ask_ai calls ctx:append_user(text) instead of raw :append; saves prev_pending so a broker error can restore the buffer for retry PHASE0.md §6: - amended the role-injection paragraph to describe the buffer-and- prepend policy; the §3 invariants list is untouched (this was a §6 design detail, not a locked invariant) Verification: - context unit tests cover: alternation after the failing sequence, multi-shell merge, reset clears buffer, broker-error retry path - live reproduction against `deep` (mistral-nemo) of the exact user-reported sequence succeeds; model responds with a sensible `CMD: ls \| grep '^lor'` instead of a Jinja exception Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 18:41:21 +00:00
marfrit	abc993aa49	review followup: empty-input guards, ~/ symmetry, CMD: filter Addresses three concerns + one nit from the Phase 0 review pass. executor.lua: - M.exec guards empty / whitespace-only cmd up front, returns "(empty command)" / -1 instead of running the wrapper on nothing. - On sentinel-parse failure with empty output (typical of shell parse errors — the syntax error itself escapes to the popen parent's stderr because 2>&1 is inside the unparsable subshell), surface "(no output — possible shell parse error)" rather than a silent empty frame. - extract_cmd_lines now skips whitespace-only / empty bodies; a bare `CMD: ` line in assistant output no longer turns into an "execute ''? [y/N]" prompt. - "what" comments cleaned in maybe_chdir. router.lua: - path_like now matches `~` and `~/foo` so `~/scripts/build.sh` classifies as shell (was: ai). Restores symmetry with executor's maybe_chdir, which already expands `~` on `cd`. repl.lua: - :exec and :ask trim args and renderer.status a usage line on empty rather than running an empty cmd / sending an empty turn to broker. Regression: full prior smoke suite still passes — known_commands shell paths, all maybe_chdir branches, CMD: extraction with non-empty bodies, exec exit-code recovery, all router branches. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 17:41:35 +00:00
marfrit	e0e69f839b	repl: readline loop, dispatch, all Phase 0 meta commands Phase 0 implementation per PHASE0.md §5, §9. Wires the lower-half modules into a single REPL: ffi/readline -> input + history router -> classify(line) -> meta/shell/ai executor -> run_shell with cd interception, frame output, capture broker -> ask_ai, then extract+confirm CMD: lines from response context -> turn list + eviction; status line on evict renderer -> assistant text + exec frame + status Prompt format `[aish:<model>]> ` per §9. Meta commands all wired (§5.2): :quit/:q, :clear, :reset, :model <name>, :models, :history, :exec <cmd>, :ask <text>, :help. Unknown meta names report via renderer.status rather than crashing. End-of-input (Ctrl-D on empty line) breaks the loop cleanly. Empty / whitespace-only lines are skipped silently before dispatch — router would otherwise classify them as ai with empty payload and pollute context. `CMD: ` extraction + confirm-and-execute is wired: when broker returns an assistant turn, the response is scanned for §6 CMD: lines; each is prompted via readline ("execute '...'? [y/N]") when config.shell .confirm_cmd is true (default), else auto-executed. On broker error, the user turn just appended is popped so the context isn't polluted with a turn that has no assistant response. Smoke covers :help, :models, shell exec via known_commands allowlist, and Ctrl-D break. Live broker exchange deferred per issue #12. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 15:17:40 +00:00
claude-noether	4310207738	Phase 0: scaffold tree + manifest - README, .gitignore, CLAUDE.md (project conventions) - docs/PHASE0.md — full Phase 0 manifest (locked substrate) - 10 root .lua modules + 4 ffi/ bindings, all stubs raising NotImplemented with module-scoped responsibilities matching the manifest - config.lua wired to current dirac/hossenfelder endpoints (qwen-coder-7b snappy/32k + cloud via OpenRouter through hossenfelder) File names match docs/PHASE0.md §4 exactly. Module bodies fill in across later phases; the tree shape is locked. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 23:16:07 +00:00

9 Commits