Back to feed
arXiv cs.CL·

Syllabic-Structure Decoder for Automatic Speech Recognition in Vietnamese

Signal
72
Hype
15
In three linesNew ASR approach for Vietnamese using syllabic-structure phoneme-based decoding. Model captures phonological composition of syllables instead of orthographic units, reducing vocabulary size. Outperforms PhoWhisper and Wav2Vec2 on LSVSC and UIT-ViMD benchmarks.
Read source
Your take?
VoiceBenchmarksPapers

Summary generated by Claude — human-verified