Back to feed
arXiv cs.LG·

LEAF: A Living Benchmark for Event-Augmented Forecasting

Signal
72
Hype
28
In three linesLEAF is a living benchmark to evaluate LLM forecasting capabilities using multidimensional events. The system uses recursive retrieval agents and dual-agent cross-validation to provide relevant auxiliary text. Testing shows LLMs leverage signals from complex events to improve predictions, particularly on equities.
Read source
Your take?
BenchmarksAI AgentsMulti-agentReasoningRAG

Summary generated by Claude — human-verified