Back to feed
Reddit r/LocalLLaMA·

1000 tps generation on Qwen3.6 27B with V100s

Signal
45
Hype
35
In three linesUser reports 1000 tokens/s generation on Qwen 3.6 27B with V100s at batch 128, and 80 t/s single-user (batch 1) without MTP. Processing throughput reaches 3000 t/s.
Read source
Your take?
QwenBenchmarksInfrastructure

Summary generated by Claude — human-verified