COMPASS: Cognitive MCTS-Guided Process Alignment for Safe Search Agents
Signal
72
Hype
25
In three linesCOMPASS is a safety alignment framework for multi-step LLM search agents. It combines Cognitive Tree Exploration (CTE) to synthesize stealthy attack trajectories and Introspective Step-wise Alignment (ISA) to supervise risky intermediate actions. Results: favorable safety-utility trade-off requiring substantially less training data.Read source
Your take?
Summary generated by Claude — human-verified