Back to feed
Reddit r/LocalLLaMA·

Any reason to run dense over MOE for RAGs?

Signal
35
Hype
15
In three linesUser compares dense vs MoE for RAG: Qwen 3.6 35B APEX (MoE) outperforms Qwen 3.6 27B (dense) on information retrieval and speed (150 vs 60 tok/s on 3090). Asks if MoE has specific advantages for RAG against common sub assumptions.
Read source
Your take?
QwenRAGOpen source

Summary generated by Claude — human-verified