Back to feed
arXiv cs.AI·

Progressive Autonomy as Preference Learning: A Formalization of Trust Calibration for Agentic Tool Use

Signal
72
Hype
18
In three linesFormalizes trust calibration for autonomous agents as preference learning. A policy gateway maintains a Gaussian-process posterior over human risk tolerance from binary approve/deny feedback, escalating uncertain decisions to humans. Structured as Preferential Bayesian Optimization with uncertainty-targeted querying.
Read source
Your take?
AI AgentsReasoningAI safetyAlignment

Summary generated by Claude — human-verified