arXiv cs.LG·27 May 2026

Balancing Plasticity and Stability with Fast and Slow Successor Features

Signal

Hype

In three linesStudy on RL agent adaptation in gradually non-stationary environments. Authors modify 3D Miniworld and MuJoCo environments to introduce continuous drift, showing that synaptic consolidation applied to multi-timescale Successor Features outperforms Q-value-based approaches. Stability outweighs plasticity in continual learning with gradual changes.

Read source

Your take?

Reinforcement learning Papers Benchmarks

Summary generated by Claude — human-verified

Balancing Plasticity and Stability with Fast and Slow Successor Features

Other angles on this story