Back to feed
Hugging Face Blog·

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Signal
75
Hype
25
In three linesGoogle releases PaliGemma 2 Mix, a family of instruction-tuned vision-language models based on Gemma 2. Three variants (3B, 10B, 28B) combine visual and textual capabilities for multimodal tasks. Available open-source on Hugging Face.
Read source
Your take?
GeminiVisionOpen sourceMulti-agent

Summary generated by Claude — human-verified