Reddit r/LocalLLaMA·22 May 2026

I fine-tuned Cohere Transcribe to support diarization and timestamps

Signal

Hype

In three linesDeveloper fine-tuned Cohere Transcribe to add diarization (speaker identification) and timestamps. Model outputs parsable format with average temporal precision of ±0.097s. Supports up to 4 speakers per 30s, extensible to 32 with diarize_long.py script. Available free on Hugging Face.

Read source

Your take?

Open source Fine-tuning Voice

Summary generated by Claude — human-verified

I fine-tuned Cohere Transcribe to support diarization and timestamps

Other angles on this story