arXiv cs.AI·19 May 2026

Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

Signal

Hype

In three linesSystematic evaluation of multimodal LLMs on video anomaly detection (VAD) using ShanghaiTech and CHAD benchmarks. Models exhibit conservative bias in zero-shot settings: high precision but recall collapse. Class-specific instructions improve F1-score from 0.09 to 0.64 on ShanghaiTech, yet recall remains a critical bottleneck for real-world surveillance.

Read source

Your take?

Vision Reasoning Prompt engineering Benchmarks Evals

Summary generated by Claude — human-verified

Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

Other angles on this story