[hacker news] KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit]

Available in: 中文

2026-04-21T03:16:22.843Z·1 min read

KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit]

摘要

KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit]

本文首发于 hacker news。

阅读原文：[KV Cache Compression 900000x Beyond TurboQuant and Per-Vector Shannon Limit]](https://arxiv.org/abs/2604.15356)

Comments0