Back to feed
arXiv cs.LG·

Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty

Signal
75
Hype
15
In three linesJoint training of autonomous vehicle and 12 pedestrians using MARL (MAPPO) in simulation. SDC reaches 78% goal completion with 14% collision rate vs 35%/33% for rule-based baseline. Jaywalkers (13% of crossings) account for 62% of collisions. Co-training reduces collisions by 30% vs single-agent RL.
Read source
Your take?
Multi-agentReinforcement learningAI safetyRobotics

Summary generated by Claude — human-verified