Back to feed
Reddit r/LocalLLaMA·

If you had $150K for building a production-class local inference server to serve 300 people, what would you buy?

Signal
35
Hype
15
In three linesUser seeks $150K production inference failover server for 300 users. Current setup: 4 H100s running 122B AWQ models at 256k context with vLLM. Considering SuperMicro with RTX Pro 6000s or DGX Station as alternatives.
Read source
Your take?
InfrastructureOpen source

Summary generated by Claude — human-verified