CBT-Audio: Evaluating Audio Language Models for Patient-Side Distress Intensity Estimation in CBT Session Recordings
Signal
78
Hype
15
In three linesCBT-Audio is a dataset of 1,802 patient turns from 96 public CBT recordings with expert-validated distress labels. Evaluation of 10 open-source audio language models shows audio improves distress estimation over text alone in 8/10 model families, with strongest gains when verbal content and vocal delivery diverge.Read source
Your take?
Summary generated by Claude — human-verified