If you had $150K for building a production-class local inference server to serve 300 people, what would you buy?
Signal
35
Hype
15
In three linesUser seeks $150K production inference failover server for 300 users. Current setup: 4 H100s running 122B AWQ models at 256k context with vLLM. Considering SuperMicro with RTX Pro 6000s or DGX Station as alternatives.Read source
Your take?
Summary generated by Claude — human-verified