arXiv cs.CL·19 May 2026

AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning

Signal

Hype

In three linesAMARIS enhances rubric-based RL by integrating persistent evaluation memory. The system accumulates evaluation diagnostics over time, retrieves them via static and semantic search, and continuously adapts reward rubrics. Experiments show performance gains with ~5% time overhead.

Read source

Your take?

Reinforcement learning Fine-tuning Evals Reasoning

Summary generated by Claude — human-verified

AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning

Other angles on this story