Back to feed
arXiv cs.LG·

Balancing Plasticity and Stability with Fast and Slow Successor Features

Signal
72
Hype
15
In three linesStudy on RL agent adaptation in gradually non-stationary environments. Authors modify 3D Miniworld and MuJoCo environments to introduce continuous drift, showing that synaptic consolidation applied to multi-timescale Successor Features outperforms Q-value-based approaches. Stability outweighs plasticity in continual learning with gradual changes.
Read source
Your take?
Reinforcement learningPapersBenchmarks

Summary generated by Claude — human-verified