quantumaikr/quant.cpp
quantumaikr/quant.cpp: LLM inference with 7x longer context. Pure C, zero dependencies. Lossless KV cache compression + single-header library. License: apache-2.0. Hugging Bay hosted release. Scan: pending.
- License
- apache-2.0
- Scan status
- pending
- Hosting status
- external
- Upstream
- quantumaikr/quant.cpp
Open interactive artifact page