Can I Take Another Dose? Evaluating LLM Decision-Making Under Temporal Uncertainty in OTC Dosing QA
Signal
72
Hype
15
In three linesDOSEBENCH, a benchmark of 81 OTC dosing scenarios (acetaminophen, ibuprofen), evaluates 4 LLMs on temporal reasoning and medical constraint adherence. Models struggle with 24-hour rolling-window calculations and ambiguous cases, despite confident-looking responses.Read source
Your take?
Summary generated by Claude — human-verified