docs: add TurboQuant MLX PDD plan#2148

Open

ianleon wants to merge 1 commit into

exo-explore:mainfrom

ianleon:plan/turboquant-mlx-pdd

ianleon commented Jun 4, 2026

Summary

add a design plan for TurboQuant-style KV cache compression in the MLX runner
outline phased work for benchmarking, cache adapter integration, Apple Silicon fast path, and PDD cache handoff
call out Qwen3-Next hybrid cache handling and default-off rollout constraints

Tests

not run; documentation-only change


          docs: add TurboQuant MLX PDD plan

a6ff87b

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet