Back to feed
arXiv cs.AI·

Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs

Signal
75
Hype
35
In three linesResearchers demonstrate that hidden reasoning traces in LLMs can be extracted via Reasoning Exposure Prompting (REP), a lightweight prompting method using shadow-model-generated demonstrations in auxiliary code-like formats. REP exposes internal traces even when deployed systems intentionally hide them, while preserving useful reasoning signals for distillation.
Read source
Your take?
ReasoningPrompt engineeringFine-tuningAI safety

Summary generated by Claude — human-verified