arXiv cs.LG·4 June 2026

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

Signal

Hype

In three linesTabular reinforcement learning for metro network expansion (MNEP). Reformulated as Non-Markovian Rewards Decision Process (NMRDP): matches Deep RL performance with 18× fewer training episodes and 12× lower carbon emissions. Incorporates social equity criteria. Validated on Xi'an and Amsterdam.

Read source

Your take?

Reinforcement learning Benchmarks Papers

Summary generated by Claude — human-verified

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

Other angles on this story