arXiv cs.AI·19 May 2026

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

Signal

Hype

In three linesStudy of adversarial attacks via action removal in self-play reinforcement learning. An attacker selectively removes legal actions from the victim's available set. Across poker games (6 to 5,531 states) and two non-poker domains, learned masking causes more damage than random masking. The attack persists across Q-learning, PPO, NFSP, DQN and shows no recovery under extended masked training.

Read source

Your take?

Reinforcement learning AI safety Benchmarks

Summary generated by Claude — human-verified

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

Other angles on this story