fix: add missing prototype for turbo_cpu_fwht_inverse to resolve -Wmissing-prototypes CI error by sujitvasanth · Pull Request #12 · AtomicBot-ai/atomic-llama-cpp-turboquant

sujitvasanth · 2026-05-13T03:28:41Z

Overview

turbo_cpu_fwht_inverse was added in 0759506 without a forward declaration, triggering -Wmissing-prototypes which is treated as -Werror in the expanded CI suite, causing all builds to fail.
Fix: add forward declaration before the function definition in ggml-turbo-quant.c.

Additional information

Referenced in #8

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: Yes, cowrote with Claude, reran build locally and now compiling without warnings on ubuntu 20.04

…ssing-prototypes CI error

…ILE FA routing (TheTom#176) * HIP: fix turbo KV decode crash under graph capture; batch-aware VEC/TILE FA routing Route small-batch (decode) quantized-KV flash attention through the graph-safe VEC kernel and let large prefill batches fall through to the fast TILE/MMA kernel. Make the f16 dequant temp allocation capture-aware: allocate from the ggml pool while a stream is capturing (no cudaMalloc/cudaFree/cudaStreamSynchronize), keep raw alloc for large eager prefill so the multi-GB buffer is released immediately (gfx1201 has no VMM, the legacy pool would retain it). Fixes 'FLASH_ATTN_EXT failed: operation not permitted when stream is capturing' with GGML_HIP_GRAPHS=ON and turbo KV types on RDNA4. Tested on gfx1201 (Radeon AI PRO R9700, Windows, HIP SDK 7.1): pp2048 735 t/s (vs 188 t/s without graphs), tg128 22.9 t/s, no decode crash. Possibly related: AtomicBot-ai#12. * fattn (HIP): note pool-retention tradeoff for non-VEC captured decode Address review on TheTom#176: document that head_dim==192 / K-stride-mismatch configs fall through to the TILE/MMA path under capture and pool-alloc the full f16 dequant buffer, which the legacy pool retains permanently -- a VRAM tradeoff, not a crash. VEC-eligible head dims (Gemma) never hit this. --------- Co-authored-by: KaiAtAdesso <KaiAtAdesso@users.noreply.github.com>

fix: add missing prototype for turbo_cpu_fwht_inverse to resolve -Wmi…

1ea1cce

…ssing-prototypes CI error

github-actions Bot added the ggml label May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add missing prototype for turbo_cpu_fwht_inverse to resolve -Wmissing-prototypes CI error#12

fix: add missing prototype for turbo_cpu_fwht_inverse to resolve -Wmissing-prototypes CI error#12
sujitvasanth wants to merge 1 commit into
AtomicBot-ai:feature/turboquant-kv-cachefrom
sujitvasanth:fix/turbo-fwht-prototype

sujitvasanth commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sujitvasanth commented May 13, 2026

Overview

Additional information

Requirements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant