Reddit r/LocalLLaMA·31 May 2026

Speed difference between Windows 11 and Linux with llama.cpp: a myth when using medium and large MoE models

Signal

Hype

In three linesllama.cpp benchmark comparing Windows 11 and Linux (Ubuntu 26.04) on Nvidia GPU (RTX 5080 + 2× RTX 5060 Ti). No significant performance difference: Qwen 3.5 122B achieves PP 300/TG 28 (Windows) vs PP 290/TG 28.5 (Linux); Qwen 3.5 397B: PP 140/TG 16 vs PP 150/TG 15.2. Tests repeated 4 times with recent llama.cpp including VRAM optimization.

Read source

Your take?

Llama Qwen Benchmarks Open source Infrastructure

Summary generated by Claude — human-verified

Speed difference between Windows 11 and Linux with llama.cpp: a myth when using medium and large MoE models

Other angles on this story