Back to feed
arXiv cs.CL·

SCRIBE: Diagnostic Evaluation and Rich Transcription Models for Indic ASR

Signal
78
Hype
15
In three linesSCRIBE is a diagnostic framework for Indic ASR that decomposes errors into categories (lexical, punctuation, numerals, domain entities) instead of WER. Sandhi-tolerant alignment with domain vocabulary injection. Open-weight rich transcription models released for Hindi, Malayalam, and Kannada.
Read source
Your take?
BenchmarksEvalsVoiceOpen source

Summary generated by Claude — human-verified