Back to feed
arXiv cs.AI·

SVFSearch: A Multimodal Knowledge-Intensive Benchmark for Short-Video Frame Search in the Gaming Vertical Domain

Signal
78
Hype
15
In three linesSVFSearch is a multimodal benchmark for short-video frame search in the Chinese gaming domain. It contains 5,000 test examples and 4,198 training examples based on real game scenes. Evaluation compares direct QA, RAG, Plan-Act-Replan agents, and learned search models: best open-source model reaches 66.4%, best practical agent 79.1%, oracle 95.4%.
Read source
Your take?
BenchmarksAI AgentsRAGVisionReasoning

Summary generated by Claude — human-verified