Reddit r/MachineLearning·27 May 2026

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

Signal

Hype

In three linesnoisekit is an open-source CLI to generate annotated degraded speech datasets for realistic STT benchmarking (telecom G.711, ambient noise, reverb). Solves the gap: public datasets (FLEURS, CommonVoice) are too clean to evaluate production performance. HuggingFace AudioFolder compatible, includes PESQ/SNR/NISQA metrics.

Read source

Your take?

Voice Evals Benchmarks Open source Tools

Summary generated by Claude — human-verified

noisekit - CLI for generating realistic degraded speech datasets for ASR benchmarking [P]

Other angles on this story