Back to feed
Reddit r/LocalLLaMA·

21 GPU's benchmarked running a small TTS model (vram peak: 5GB)

Signal
45
Hype
25
In three linesBenchmark of 21 GPUs (mostly consumer) on OmniVoice TTS model (5GB VRAM peak). Tested via vast.ai, measures xRT (speed relative to real-time). RTX 3090 as baseline. 3 runs per GPU on small paragraph with voice cloning.
Read source
Your take?
VoiceBenchmarksTools

Summary generated by Claude — human-verified