h264: V3D shaders for chroma deblock V + H (4:2:0) #29
Reference in New Issue
Block a user
Delete Branch "noether/v3d-shader-h264-deblock-chroma"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Adds the QPU shader pair for chroma_v / chroma_h deblock (non-intra bS<4). Siblings of cycle 8's luma_v and PR #28's luma_h. Per H.264 §8.7.2.4 chroma kernel is simpler — only p0/q0 updated, tC = tc0_seg+1, 8 cells/edge (vs luma's 16). Shader is 64 lines vs luma_v's 108.
4:2:0-only (4:2:2 has a 16-row chroma_h edge geometry not handled). Recipe table flips DEBLOCK_CV/CH from CPU to QPU. Shared QPU plumbing factored between V and H.
Test results on hertz:
chroma v: 256/256 bit-exact,chroma h: 256/256 bit-exact. Recipe substrate=2 (QPU) for both.Non-intra deblock QPU matrix complete after this PR: luma_v ✓, luma_h ✓, chroma_v ✓, chroma_h ✓. Intra (bS=4) variants stay CPU NEON (less common, smaller per-frame contribution, structurally different algorithm).