arXiv cs.LG·19 May 2026

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

Signal

Hype

In three linesStudy of adversarial action removal attacks in self-play reinforcement learning. An attacker selectively masks legal actions from the victim's action set. Experiments on poker (6 to 5,531 states) and two non-poker domains: learned masking causes substantially more damage than random masking, persists across Q-learning/PPO/NFSP/DQN, transfers between agents, and is amplified by self-play.

Read source

Your take?

Reinforcement learning AI safety Benchmarks

Summary generated by Claude — human-verified

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

Other angles on this story