IdioLink: Retrieving Meaning Beyond Words Across Idiomatic and Literal Expressions
Signal
72
Hype
18
In three linesIdioLink is a retrieval benchmark with 10,700 documents and 2,140 queries across 107 idioms. It tests whether models can link idiomatic expressions to their literal equivalents. Current embeddings (BGE, E5, Contriever, Qwen) fail, relying on shallow topical cues instead of semantic abstraction.Read source
Your take?
Summary generated by Claude — human-verified