Back to feed
arXiv cs.AI·

On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression

Signal
72
Hype
15
In three linesStudy of adversarial robustness in compressed vision-language models. Authors propose CAGE attack that exploits the mismatch between perturbation optimization (full tokens) and inference (via compression). CAGE combines expected feature disruption and rank distortion alignment to expose hidden vulnerabilities in compressed LVLMs.
Read source
Your take?
VisionAI safetyBenchmarks

Summary generated by Claude — human-verified