Back to feed
Reddit r/LocalLLaMA·

RTX 5080 + RTX 3090 Setup: 80+ Tok/s on Qwen 3.6 27B Q8

Signal
35
Hype
25
In three linesUser reports 80+ tokens/s with Qwen 3.6 27B Q8 quantization on dual GPU setup (RTX 5080 + RTX 3090). Performance measured on local hardware without framework or test condition details.
Read source
Your take?
QwenOpen sourceInfrastructure

Summary generated by Claude — human-verified