artalis-io/bitnet.c
artalis-io/bitnet.c: Minimal, zero-dependency LLM inference in pure C11. CPU-first with NEON/AVX2 SIMD. Flash MoE (pread + LRU expert cache). TurboQuant 3-bit KV compression (8.9x less memory per session). 20+ GGUF quant formats. Compiles to License: mit. Hugg
- License
- mit
- Scan status
- pending
- Hosting status
- external
- Upstream
- artalis-io/bitnet.c
Open interactive artifact page