Back to feed
Reddit r/LocalLLaMA·

A lightweight, real-time multilingual ASR router that runs on local hardware

Signal
78
Hype
25
In three linesLightweight multilingual ASR routing system for local hardware using Zipformer, Silero VAD, and SpeechBrain. Routes audio between specialized monolingual models (~100M parameters) instead of one large model. Achieves 13% WER on inter-utterance code-switching, outperforming cloud APIs. Known limitation: 41% WER on intra-utterance switching. Open-source repo available.
Read source
Your take?
VoiceOpen sourceToolsBenchmarks

Summary generated by Claude — human-verified