Back to feed
arXiv cs.CL·

DraDDP: A Multimodal Multi-Party Dialogue Discourse Parsing Dataset

Signal
75
Hype
15
In three linesDraDDP is the first public multimodal dataset for discourse parsing in multi-party dialogues. Built from American TV dramas, it contains 495 dialogue segments (6,374 utterances, 9.1 hours of video). Benchmarks demonstrate the value of multimodal information for identifying dependency structures and relation types between utterances.
Read source
Your take?
VisionMulti-agentBenchmarksOpen source

Summary generated by Claude — human-verified