Back to feed
arXiv cs.CL·

PQR: A Framework to Generate Diverse and Realistic User Queries that Elicit QA Agent Failures

Signal
72
Hype
18
In three linesPQR is a framework that automatically generates realistic user queries to identify failures in LLM-based QA agents. Through iterative interaction between query refinement and prompt refinement modules, PQR uncovers 23-78% more unhelpful responses than prior methods, with more diverse and realistic queries.
Read source
Your take?
AI AgentsEvalsAI safety

Summary generated by Claude — human-verified