Metric-Gradient Projection for Stable Multi-Agent Policy Learning
Signal
72
Hype
15
In three linesHPML (Hodge-Projected Multi-agent Learning) stabilizes multi-agent learning by projecting the joint update field onto a metric-gradient component. The method uses Hodge-type projection in an L² space of vector fields, implemented via graph-based and amortized neural realizations. Results: improved stability and normalized returns on CTDE benchmarks.Read source
Your take?
Summary generated by Claude — human-verified