Back to feed
arXiv cs.AI·

BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces

Signal
75
Hype
15
In three linesBehaviorBench is a benchmark for evaluating personalized decision modeling from real-world behavioral traces. Built on 2,000 wallets with 141,445 belief-prediction instances and 1,485,972 trade-prediction instances, it tests whether generative models can adapt predictions to individual users without relying on simulated behavior.
Read source
Your take?
BenchmarksEvalsPapers

Summary generated by Claude — human-verified