Back to feed
arXiv cs.CL·

Can LLMs Refuse Questions They Do Not Know? Measuring Knowledge-Aware Refusal in Factual Tasks

Signal
78
Hype
15
In three linesResearchers propose the Refusal Index (RI), a metric measuring LLMs' ability to refuse questions beyond their knowledge. RI correlates refusal probability with error probability using Spearman's rank correlation. Testing across 16 models and 5 datasets shows LLMs refuse unreliably despite high factual accuracy.
Read source
Your take?
EvalsAI safetyAlignmentBenchmarks

Summary generated by Claude — human-verified