arXiv cs.CL·28 May 2026

Syllabic-Structure Decoder for Automatic Speech Recognition in Vietnamese

Signal

Hype

In three linesNew ASR approach for Vietnamese using syllabic-structure phoneme-based decoding. Model captures phonological composition of syllables instead of orthographic units, reducing vocabulary size. Outperforms PhoWhisper and Wav2Vec2 on LSVSC and UIT-ViMD benchmarks.

Read source

Your take?

Voice Benchmarks Papers

Summary generated by Claude — human-verified

Syllabic-Structure Decoder for Automatic Speech Recognition in Vietnamese

Other angles on this story