llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s
Signal
72
Hype
15
In three linesllama.cpp optimizes Qwen 3.6/3.5-MTP support following multiple PRs and fixes. Community shares tokens/s benchmarks with full configurations (quantization, context, KVCache, MTP). Example: 207.90 t/s prompt eval, 24.07 t/s generation with 52.6% draft acceptance rate.Read source
Your take?
Summary generated by Claude — human-verified