LEAF: A Living Benchmark for Event-Augmented Forecasting
Signal
72
Hype
28
In three linesLEAF is a living benchmark to evaluate LLM forecasting capabilities using multidimensional events. The system uses recursive retrieval agents and dual-agent cross-validation to provide relevant auxiliary text. Testing shows LLMs leverage signals from complex events to improve predictions, particularly on equities.Read source
Your take?
Summary generated by Claude — human-verified