1000 tps generation on Qwen3.6 27B with V100s
Signal
45
Hype
35
In three linesUser reports 1000 tokens/s generation on Qwen 3.6 27B with V100s at batch 128, and 80 t/s single-user (batch 1) without MTP. Processing throughput reaches 3000 t/s.Read source
Your take?
Summary generated by Claude — human-verified