rosenblatt/TODO.md at 24adc74812d1712fe33e53074186ae22635f00a9

marfrit 24adc74812 Rosenblatt: project scaffold for RK3588 NPU on mainline

Codename: Frank Rosenblatt — Mark I Perceptron 1958, the first
hardware neural network.  This project lights up the RK3588 NPU on
mainline Linux so the OSS world finally owns the silicon-side of
inference on that chip.

Phase-1 scope: small LLM running CPU + NPU mix on boltzmann (Rock 5
ITX+).  Backend: llama.cpp with a new rknpu ggml backend offloading
INT8 GEMM (attention + FFN matmuls) to the NPU's tile-MAC array while
leaving dequant / RoPE / softmax / sampling / embedding on A76 NEON.

Target model: qwen2.5-1.5B-instruct Q4_K_M GGUF.

Scaffold layout: README.md (frame + 9+1-phase plan), TODO.md (rolling
punch-list), docs/{npu-mainline-status,architecture}.md, kernel/ for
DT bindings + driver tweaks, userspace/{npu-probe,llm-runtime}/,
fleet/boltzmann.yaml.

Next: Phase-1 substrate audit — fill the TBDs in docs/npu-mainline-status.md
with the actual state of Tomeu Vizoso's rknpu / DRM-accel work on
the boltzmann-running kernel.

3.0 KiB

Raw Blame History

TODO — Rosenblatt

Phase 1 — substrate audit

Phase 2 — formulate

Phase 3 — analyze

Phase 4 — baseline

Cross-phase / standing items

3.0 KiB Raw Blame History Unescape Escape

TODO — Rosenblatt

Phase 1 — substrate audit

Phase 2 — formulate

Phase 3 — analyze

Phase 4 — baseline

Cross-phase / standing items

3.0 KiB

Raw Blame History