Back to feed
arXiv cs.CL·

Disentangling Ambiguity from Instability in Large Language Models: A Clinical Text-to-SQL Case Study

Signal
75
Hype
15
In three linesCLUES, a framework for clinical Text-to-SQL, decomposes semantic uncertainty into ambiguity and instability scores using the Schur complement of a bipartite semantic graph matrix. Tested on AmbigQA/SituatedQA and a clinical benchmark, it outperforms Kernel Language Entropy and enables efficient triage: 51% of errors in 25% of queries.
Read source
Your take?
PapersBenchmarksEvalsAI safety

Summary generated by Claude — human-verified