Back to feed
arXiv cs.AI·

CounterCount: A Diagnostic Framework for Counting Bias in Vision Language Models

Signal
78
Hype
15
In three linesCounterCount is a diagnostic framework to evaluate counting bias in vision-language models. Tests show VLMs perform well on factual images but degrade significantly on counterfactual images where visual attributes contradict learned priors. An inference-time attention modulation strategy improves accuracy by up to 8% across multiple VLMs.
Read source
Your take?
VisionEvalsBenchmarksAlignment

Summary generated by Claude — human-verified