arXiv cs.LG·1 June 2026

NumLeak: Public Numeric Benchmarks as Latent Labels in Foundation Models

Signal

Hype

In three linesNumLeak measures memorization of public benchmarks in frontier LLMs. Models recall Fama-French data (r=0.97-0.99), US unemployment, and NOAA temperature with high fidelity. On recent unseen data, parse rate drops to 21-57% but r stays ~0.99 for answered months. A one-line system-prompt defense blocks 99.8% of attacks.

Read source

Your take?

Benchmarks Evals AI safety Alignment

Summary generated by Claude — human-verified

NumLeak: Public Numeric Benchmarks as Latent Labels in Foundation Models

Other angles on this story