Back to feed
Reddit r/LocalLLaMA·

minor speed bump for MTP with Qwen3.6-27B-MTP Q6_K_XL

Signal
35
Hype
15
In three linesPersonal benchmark on MacBook M5 Max: Qwen 3.6-27B-UD-Q6_K_XL with MTP reaches 22.3 tokens/s vs 19 tokens/s without MTP via llama.cpp. Modest improvement (17%) compared to reported gains elsewhere.
Read source
Your take?
QwenBenchmarksCode generation

Summary generated by Claude — human-verified