Back to feed
Reddit r/LocalLLaMA·

llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s

Signal
72
Hype
15
In three linesllama.cpp optimizes Qwen 3.6/3.5-MTP support following multiple PRs and fixes. Community shares tokens/s benchmarks with full configurations (quantization, context, KVCache, MTP). Example: 207.90 t/s prompt eval, 24.07 t/s generation with 52.6% draft acceptance rate.
Read source
Your take?
LlamaQwenBenchmarksOpen sourceTools

Summary generated by Claude — human-verified