Page 173 of 192

AllHigh signalRecent
7679 articles
Reddit r/LocalLLaMA·

Tried to benchmark Google’s new on-device dictation models (Eloquent) and basically couldn’t

A developer benchmarked Google's new on-device dictation app Eloquent with proprietary models. Result: ~50% of dictations return incomplete (20+ words reduced to 5-10). When transcription completes (15/50 tests), accuracy is competitive (~24% WER vs ~21% for Qwen3-ASR), but the chat-style model often refuses to transcribe instead of producing text.

DeepMindBenchmarksVoice
SIG
42
HYP
35