noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]
Signal
72
Hype
25
In three linesnoisekit is an open-source CLI to generate annotated degraded speech datasets for realistic STT benchmarking (telecom G.711, ambient noise, reverb). Solves the gap: public datasets (FLEURS, CommonVoice) are too clean to evaluate production performance. HuggingFace AudioFolder compatible, includes PESQ/SNR/NISQA metrics.Read source
Your take?
Summary generated by Claude — human-verified