Skip to content

Commit cc74de2

Browse files
committed
quantcpp 0.10.1: includes S2 infinite scrollback + S4 context persistence
1 parent 953addb commit cc74de2

3 files changed

Lines changed: 14 additions & 4 deletions

File tree

bindings/python/pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ build-backend = "setuptools.build_meta"
77

88
[project]
99
name = "quantcpp"
10-
version = "0.10.0"
10+
version = "0.10.1"
1111
description = "Single-header LLM inference engine with KV cache compression (7× compression at fp32 parity)"
1212
readme = "README.md"
1313
license = { text = "Apache-2.0" }

bindings/python/quantcpp/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@
1919
from importlib.metadata import version as _pkg_version
2020
__version__ = _pkg_version("quantcpp")
2121
except Exception:
22-
__version__ = "0.10.0" # fallback for editable / source-tree imports
22+
__version__ = "0.10.1" # fallback for editable / source-tree imports
2323

2424
import os
2525
import sys

docs/strategy_progressive_kv.md

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -52,5 +52,15 @@ degradation from +3.8% to +0.6% at 28 KB cost.
5252
- Added progressive=True to Model()
5353
- Published v0.10.0 to PyPI
5454

55-
### Round 3: Infinite Scrollback (IN PROGRESS)
56-
- Goal: replace "context exceeded → stop" with "context full → compress oldest → continue"
55+
### Round 3: Infinite Scrollback (DONE)
56+
- Implemented context shift in tq_generate.c + quant.h
57+
- Verified: SmolLM2-135M at ctx=64, 500 tokens with 9 auto-shifts
58+
- Context never overflows — generation continues seamlessly
59+
60+
### Round 4: Compressed Persistence (DONE)
61+
- quant_save_context / quant_load_context API
62+
- QKVC file format: 64-byte header + raw compressed KV data
63+
- Python: m.save_context("doc.kv") / m.load_context("doc.kv")
64+
- "Read once, query forever" — verified round-trip
65+
66+
### Round 5: Next — S5 WASM Demo or PyPI publish v0.10.0

0 commit comments

Comments
 (0)