Back to feed
arXiv cs.AI·

Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework

Signal
75
Hype
15
In three linesProRL is a programmatic reinforcement learning framework for combinatorial optimization (job shop scheduling). It generates interpretable policies as human-readable programs via a domain-specific language (DSL-S), exploring the program space through local search and Bayesian optimization. Outperforms classical heuristics and DRL baselines with minimal training episodes.
Read source
Your take?
Reinforcement learningReasoningBenchmarksOpen source

Summary generated by Claude — human-verified