Back to feed
arXiv cs.CL·

HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

Signal
78
Hype
15
In three linesHalluWorld is a controlled benchmark for evaluating LLM hallucinations through explicit reference worlds (gridworlds, chess, terminal tasks). Frontier models solve perceptual hallucinations on direct observations well, but struggle with multi-step state tracking and causal forward simulation, even with extended thinking.
Read source
Your take?
BenchmarksReasoningAI safety

Summary generated by Claude — human-verified