File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -7,7 +7,7 @@ build-backend = "setuptools.build_meta"
77
88[project ]
99name = " quantcpp"
10- version = " 0.10.0 "
10+ version = " 0.10.1 "
1111description = " Single-header LLM inference engine with KV cache compression (7× compression at fp32 parity)"
1212readme = " README.md"
1313license = { text = " Apache-2.0" }
Original file line number Diff line number Diff line change 1919 from importlib .metadata import version as _pkg_version
2020 __version__ = _pkg_version ("quantcpp" )
2121except Exception :
22- __version__ = "0.10.0 " # fallback for editable / source-tree imports
22+ __version__ = "0.10.1 " # fallback for editable / source-tree imports
2323
2424import os
2525import sys
Original file line number Diff line number Diff line change @@ -52,5 +52,15 @@ degradation from +3.8% to +0.6% at 28 KB cost.
5252- Added progressive=True to Model()
5353- Published v0.10.0 to PyPI
5454
55- ### Round 3: Infinite Scrollback (IN PROGRESS)
56- - Goal: replace "context exceeded → stop" with "context full → compress oldest → continue"
55+ ### Round 3: Infinite Scrollback (DONE)
56+ - Implemented context shift in tq_generate.c + quant.h
57+ - Verified: SmolLM2-135M at ctx=64, 500 tokens with 9 auto-shifts
58+ - Context never overflows — generation continues seamlessly
59+
60+ ### Round 4: Compressed Persistence (DONE)
61+ - quant_save_context / quant_load_context API
62+ - QKVC file format: 64-byte header + raw compressed KV data
63+ - Python: m.save_context("doc.kv") / m.load_context("doc.kv")
64+ - "Read once, query forever" — verified round-trip
65+
66+ ### Round 5: Next — S5 WASM Demo or PyPI publish v0.10.0
You can’t perform that action at this time.
0 commit comments