Back to feed
arXiv cs.AI·

DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling

Signal
72
Hype
18
In three linesDecoupleSearch decouples planning and search in agentic RAG systems using dual value models. A reasoning tree is constructed with Monte Carlo Tree Search to assess each step quality. Hierarchical Beam Search iteratively refines planning and search candidates during inference.
Read source
Your take?
AI AgentsRAGReasoning

Summary generated by Claude — human-verified