Back to feed
arXiv cs.AI·

Robust Shielding for Safe Reinforcement Learning

Signal
78
Hype
15
In three linesNovel shielding framework for RL agents ensuring formal safety guarantees in MDPs with unknown transition dynamics. Uses robust MDPs (RMDPs) with sets of transition probabilities and LTL formulas. Combines shielding with PAC-learning methods to construct minimally restrictive shields while guaranteeing safety.
Read source
Your take?
Reinforcement learningAI safetyReasoning

Summary generated by Claude — human-verified