Back to feed
arXiv cs.CL·

IdioLink: Retrieving Meaning Beyond Words Across Idiomatic and Literal Expressions

Signal
72
Hype
18
In three linesIdioLink is a retrieval benchmark with 10,700 documents and 2,140 queries across 107 idioms. It tests whether models can link idiomatic expressions to their literal equivalents. Current embeddings (BGE, E5, Contriever, Qwen) fail, relying on shallow topical cues instead of semantic abstraction.
Read source
Your take?
BenchmarksEmbeddingsRAGPapers

Summary generated by Claude — human-verified