Back to feed
arXiv cs.AI·

PAREDA: A Multi-Accent Speech Dataset of Natural Language Processing Research Discussions

Signal
72
Hype
18
In three linesPAREDA is a multi-accent speech dataset (Australian, Indian, Chinese English) featuring spontaneous discussions on NLP papers. SOTA ASR models degrade in zero-shot settings, but fine-tuning on PAREDA significantly reduces WER, validating the corpus's value for building robust ASR systems.
Read source
Your take?
BenchmarksVoicePapers

Summary generated by Claude — human-verified