Back to feed
Reddit r/LocalLLaMA·

Scrambling to max StrixHalo (+NVLink dual eGPU 3090 mod)

Signal
35
Hype
15
In three linesUser optimizes Strix Halo (124 GB VRAM) by adding dual RTX 3090 eGPUs via NVLink to speed up 27B/31B dense models. Tests show significant throughput gains for multi-agent scenarios, but trade-offs in power efficiency and llama.cpp compatibility.
Read source
Your take?
Open sourceInfrastructureAI Agents

Summary generated by Claude — human-verified