Back to feed
Hugging Face Blog·

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Signal
75
Hype
25
In three linesGoogle releases PaliGemma, an open-source vision-language model built on Gemma 2B. The model combines an image encoder and text decoder for multilingual visual understanding tasks. Weights available on Hugging Face.
Read source
Your take?
GeminiVisionOpen sourceTools

Summary generated by Claude — human-verified