We now support VLMs in smolagents!
Signal
75
Hype
25
In three linesHugging Face adds Vision Language Models (VLM) support to smolagents. Agents can now process images and text together for multimodal tasks.Read source
Your take?
Summary generated by Claude — human-verified