Qwen 3.6 benchmarks on 2x RTX PRO 6000
Signal
72
Hype
15
In three linesQwen 3.6 benchmarks on 2x RTX PRO 6000 with vLLM. Qwen 3.6 27B BF16 reaches 1800 tps (64 concurrency, MTP-2). Qwen 3.6 35B BF16 reaches 3500 tps generation (128 concurrency, MTP-Off) with 30k tps prompt processing.Read source
Your take?
Summary generated by Claude — human-verified