T

marfrit db26d0ccb7 context: enforce_budget honors token_budget + R2 guard (Phase 8 commit #3 )

Pillar 5 (analyze finding A1) — the real value-add of Phase 8.
Until now, ctx.token_budget = 4096 was set but never enforced;
enforce_budget only looked at max_turns. With commit #2's accurate
tokenization wired in (via commit #4), eviction now finally fires
when the actual context fills the budget.

Loop condition change:

  before:
    while #self.turns > self.max_turns do

  after:
    while (#self.turns > self.max_turns
           or self:estimate_tokens() > self.token_budget)
          and #self.turns > 0 do

R2 guard: the `and #self.turns > 0` clause is essential. When
system_prompt alone exceeds token_budget (e.g. a 5000-token [project]
block with token_budget=4096), the OR-condition stays true even
when turns are empty — table.remove on a 0-length list would
no-op forever while evicted++ spins. Sonnet review caught this;
without the guard, real users could hit an infinite loop just
by setting a small token_budget + opening a large project tree.

Per-pair eviction logic (summarize callback + pair-pop) inside the
loop is unchanged. The estimate_tokens call is potentially expensive
under tokenize_fn — commit #2's per-turn cache amortizes to O(N)
per iteration after first fill; for max_turns=40 + budget=4096
sessions the worst case is microseconds per call.

Unit-verified across 5 cases (with and without tokenize_fn):
  1. max_turns eviction unchanged (no behavior regression).
  2. char/4 path: tight budget evicts to 0 when sys > budget,
     exits via R2 guard.
  3. char/4 path: practical budget evicts to a stable count.
  4. tokenize_fn stub: evicts to exactly the (budget - sys)/per-turn
     count.
  5. R2 critical: zero turns + oversize sys -> immediate exit,
     evicted=0, no spin.

Behavior change for existing users: a session that fit under
token_budget=4096 by char/4 (~16K chars) may now evict earlier
because accurate counts are HIGHER for most natural-text inputs
(per baseline B2). Users on cloud presets with very large context
windows (Claude 200K) should raise token_budget to match — see §9
risk row in PHASE8.md.

Regression: test_safety 87/87, test_router_model 31/31, repl loads.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-16 23:30:37 +00:00

.claude

.claude: project-shared permission allowlist for read-only MCP/Bash

2026-05-12 08:08:26 +00:00

docs

docs/PHASE8: review fold-in — 2 BLOCKERs + 4 CONCERNs + 4 NITs

2026-05-16 23:28:27 +00:00

ffi

repl: configurable prompt template via config.shell.prompt (closes #10 )

2026-05-16 21:14:43 +00:00

vendor

phase0 amendment: vendor dkjson 2.8 under vendor/

2026-05-10 11:30:16 +00:00

.gitignore

.claude: project-shared permission allowlist for read-only MCP/Bash

2026-05-12 08:08:26 +00:00

broker.lua

broker: token_count + endpoint capability cache (Phase 8 commit #1 )

2026-05-16 23:29:17 +00:00

CLAUDE.md

docs: codify contribution flow — issues for features, PRs for review

2026-05-10 11:04:51 +00:00

config.lua

config + docs/PHASE7: example block + status -> Implement (Phase 7 commit #6 )

2026-05-16 23:02:55 +00:00

context.lua

context: enforce_budget honors token_budget + R2 guard (Phase 8 commit #3 )

2026-05-16 23:30:37 +00:00

executor.lua

repl: sub-broker delegation via DELEGATE: marker (closes #6 )

2026-05-16 21:29:09 +00:00

history.lua

history: memory.jsonl store + flock (Phase 4 commit #1 )

2026-05-13 04:52:03 +00:00

main.lua

main: non-interactive -p/--prompt one-shot mode (closes #4 )

2026-05-16 21:06:27 +00:00

mcp.lua

mcp: JSON-RPC client + ffi/curl status_code; PHASE0 §4 amended

2026-05-12 13:06:39 +00:00

README.md

docs: fix Phase-N references stale after MCP renumber

2026-05-10 05:43:58 +00:00

renderer.lua

repl + renderer: tree-sitter highlighter (Phase 6 commit #5 )

2026-05-16 22:27:04 +00:00

repl.lua

repl: :cost meta surface (Phase 7 commit #5 )

2026-05-16 23:02:24 +00:00

router.lua

router: classify_model heuristic + 31-case corpus (Phase 5 commit #1 )

2026-05-13 11:17:22 +00:00

safety.lua

safety + repl: opts.category for Norris + probe (Phase 7 commit #4 )

2026-05-16 23:01:21 +00:00

secrets.lua

secrets: vault loader + scrub/rehydrate + autodetect (#13 commit 1)

2026-05-16 21:36:39 +00:00

test_router_model.lua

router: classify_model heuristic + 31-case corpus (Phase 5 commit #1 )

2026-05-13 11:17:22 +00:00

test_safety.lua

safety: is_destructive static pattern matcher (Phase 3 commit #1 )

2026-05-12 22:47:10 +00:00

README.md

aish

aish — AI-augmented conversational shell.

A single REPL that interleaves shell command execution and language-model conversation, backed by a llama.cpp HTTP broker. Implementation is LuaJIT 2.x with FFI bindings to libcurl, GNU readline, and libc — no C extensions, no build step, one source tree.

Why

Three flows that currently live in three windows fold into one:

"Run this command and show me the output" — fast feedback loop, no copy-paste between terminal and chat.
"Explain or write code based on the output we just looked at" — exec output is automatically injected into the model's context.
"Plan and execute a multi-step task with confirmation gates" — landing in Phase 3 as Chuck Norris autonomous mode.

aish is not a wrapper around bash. It's a first-class interactive environment where the shell is one of several execution channels.

Status

Component	State
Repository skeleton	✅ in this commit
Phase 0 manifest	✅ `docs/PHASE0.md` — locked
Phase 0 implementation	🔜 next session
Phase 1+	📋 enumerated in PHASE0.md §11

Every module file currently raises not implemented (Phase 0 pending) when called. luajit main.lua fails loudly at the first un-implemented function, never silently.

Quick orientation

Read this	If you want to know
`docs/PHASE0.md` §1–2	What aish is and what Phase 0 ships
`docs/PHASE0.md` §3	Technology decisions (LuaJIT, FFI, readline, libcurl, llama.cpp)
`docs/PHASE0.md` §4	Directory layout — these file names are stable across all phases
`docs/PHASE0.md` §5	How input is dispatched (meta / shell / AI)
`docs/PHASE0.md` §6	Broker contract: `/v1/chat/completions`, `CMD:` extraction
`docs/PHASE0.md` §10	Config schema and resolution order
`docs/PHASE0.md` §11	Phase sequence (what lands when)
`docs/PHASE0.md` §13	Open questions, tracked per phase
`CLAUDE.md`	Project conventions for AI-assisted contributors

Directory layout

aish/
├── main.lua              # entry point
├── repl.lua              # readline loop, dispatch, prompt
├── broker.lua            # llama.cpp HTTP client
├── router.lua            # input classifier (meta/shell/AI)
├── executor.lua          # command exec + CMD: extraction
├── context.lua           # in-memory turn history
├── history.lua           # disk persistence (Phase 1+)
├── safety.lua            # destructive-op gate (Phase 3+)
├── renderer.lua          # output formatting
├── config.lua            # default model registry + preferences
├── ffi/
│   ├── curl.lua          # libcurl easy interface
│   ├── readline.lua      # GNU readline
│   ├── pty.lua           # forkpty (Phase 1+)
│   └── libc.lua          # chdir, errno, strerror
└── docs/
    └── PHASE0.md         # locked substrate

Build / runtime dependencies

System packages (Debian / ALARM / Arch names):

luajit (>= 2.0)
libcurl4 / libcurl-openssl-3 runtime
libreadline8 runtime
libc6 runtime (always present)

No compilation, no luarocks, no make. Just luajit main.lua.

Running

Once Phase 0 ships:

luajit main.lua                          # uses ~/.config/aish/config.lua
luajit main.lua --config ./config.lua    # explicit config path
AISH_CONFIG=/path/to/config.lua luajit main.lua

Config resolution order is documented in docs/PHASE0.md §10.

Configuration

config.lua is a Lua file returning a single table. The committed config.lua in this repo is both the canonical example and the development-fallback config (lowest precedence). Copy it to ~/.config/aish/config.lua and edit endpoints to your local llama.cpp servers, or point AISH_CONFIG at your own.

The default endpoints assume mfritsche's home network:

fast → dirac.fritz.box:8081 (Qwen2.5-Coder-7B q4 8k ctx)
deep → dirac.fritz.box:8080 (Qwen2.5-Coder-7B q4 32k ctx)
cloud → hossenfelder.fritz.box:8082 (forwards to OpenRouter)

Replace these with your own llama.cpp endpoints if you're not on that LAN.

License

Not yet selected. Default-private until decided.

Project conventions

See CLAUDE.md for contribution conventions, commit style, and the phase-loop discipline this project follows.

README.md Unescape Escape

aish

Why

Status

Quick orientation

Directory layout

Build / runtime dependencies

Running

Configuration

License

Project conventions

README.md