arXiv cs.AI·19 May 2026

AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning

Signal

Hype

In three linesAMARIS introduces persistent evaluation memory to improve rubrics in LLM RL fine-tuning. The system accumulates evaluation diagnostics over time, uses static and dynamic retrieval to contextualize rubric modifications, and adds ~5% time overhead. Experiments show consistent gains across closed and open-ended domains.

Read source

Your take?

Reinforcement learning Fine-tuning Evals Papers

Summary generated by Claude — human-verified

AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning

Other angles on this story