RSS

Hugging Face Blog

Hugging Face introduces MolmoMotion, a language-guided 3D motion forecasting model. The system combines vision and language to predict future trajectories from videos, enabling applications in robotics and animation.

Vision Robotics

SIG

HYP

Hugging Face Blog·Jun 17

From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot

Hugging Face and Strands integrate Hub models with LeRobot to deploy AI agents on robot hardware. The platform enables developers to use pre-trained models to control physical robots directly.

AI Agents Robotics Open source

SIG

HYP

Hugging Face Blog·Jun 17

GLM-5.2: Built for Long-Horizon Tasks

Hugging Face announces GLM-5.2, a model designed for long-horizon tasks. The model improves capacity to handle extended contexts and complex multi-step workflows.

DeepMind Reasoning Benchmarks

SIG

HYP

Hugging Face Blog·Jun 17

Agentic Resource Discovery: Let agents search

Hugging Face introduces agentic resource discovery, enabling AI agents to autonomously search and access models, datasets, and tools available on the platform. This capability enhances agent autonomy in executing complex tasks.

AI Agents Tools Open source

SIG

HYP

Hugging Face Blog·Jun 12

olmo-eval: An evaluation workbench for the model development loop

Hugging Face releases olmo-eval, an evaluation workbench for the model development loop. The tool automates performance testing and enables rapid iteration during language model training and fine-tuning.

Tools Evals Open source

SIG

HYP

Hugging Face Blog·Jun 11

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Part 2 of a PyTorch profiling guide. Detailed performance analysis of nn.Linear layers and construction of an optimized fused MLP. Demonstrates operation fusion techniques to reduce latency and improve computational efficiency.

Infrastructure Tools Code generation

SIG

HYP

Hugging Face Blog·Jun 9

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

Hugging Face benchmarks frontier ASR models (Whisper, Canary, Conformer) on code-switched bilingual speech. Results reveal significant performance gaps across language pairs and models, exposing limitations of current voice agents for multilingual customer service.

Benchmarks Voice Evals

SIG

HYP

Hugging Face Blog·Jun 9

Introducing North Mini Code: Cohere’s First Model For Developers

Cohere releases North Mini Code, its first model designed for developers. The model is optimized for code generation and completion, with multilingual support and integration into the Hugging Face ecosystem.

Code generation Open source Tools

SIG

HYP

Hugging Face Blog·Jun 9

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces

An AI agent built a 3D Paris gallery by chaining two Hugging Face Spaces. The system automatically orchestrated image generation and 3D environment creation without manual intervention.

AI Agents Tools Open source

SIG

HYP

Hugging Face Blog·Jun 9

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

A researcher fine-tuned a language model to generate content optimized for ADHD brains by maximizing dopaminergic engagement. The approach combines fine-tuning on curated data and evaluation via behavioral metrics.

Fine-tuning Tools

SIG

HYP

Hugging Face Blog·Jun 9

Migrating Your GitHub CI to Hugging Face Jobs

Hugging Face releases a migration guide for moving GitHub CI/CD pipelines to Hugging Face Jobs. The platform provides a native alternative to automate testing and deployments within the Hugging Face ecosystem.

Tools Infrastructure Open source

SIG

HYP

Hugging Face Blog·Jun 8

The crash that vanished: control and emergence in a five-model economy

Hugging Face study on emergent behaviors in a multi-model system. Simulation of a five-model AI economy showing how crashes can occur and vanish based on agent interactions. Analysis of control and emergence phenomena in complex systems.

Multi-agent AI Agents Benchmarks

SIG

HYP

Hugging Face Blog·Jun 8

Building Pakistan Notice Helper: A Small AI Tool for a Very Local Safety Problem

Hugging Face built Pakistan Notice Helper, a small AI tool addressing a specific local safety problem in Pakistan. The tool leverages lightweight models tailored to regional needs and resource constraints.

Tools Open source AI safety

SIG

HYP

Hugging Face Blog·Jun 8

The Open Source Community is backing OpenEnv for Agentic RL

The open source community is backing OpenEnv, a platform for agentic reinforcement learning. The project gains growing adoption and active collaboration within the open source ecosystem.

AI Agents Reinforcement learning Open source

SIG

HYP

Hugging Face Blog·Jun 7

Room360: Video-to-3D Spatial Reconstruction Platform

Hugging Face introduces Room360, a video-to-3D spatial reconstruction platform. The tool converts video sequences into usable 3D models for immersive and architectural applications.

Video generation Tools Open source

SIG

HYP

Hugging Face Blog·Jun 7

Sponsors especially OPENAI CODEX voucher usage for codex - openAI challange

OpenAI offers Codex vouchers to Hugging Face sponsors to test the code generation model. Partnership initiative between OpenAI and the community platform.

OpenAI Code generation Business

SIG

HYP

Hugging Face Blog·Jun 7

Her · हेर — a detective for your Claude Code sessions

Hugging Face releases Her, a debugging tool for Claude Code sessions. Her analyzes interactions and identifies issues in AI-assisted coding workflows.

Claude Code Tools Code generation

SIG

HYP

Hugging Face Blog·Jun 6

Five labs, five minds: building a multi-model finance drama on small models

Hugging Face showcases a collaborative experiment across five labs using small models to build a dramatized financial scenario. The project demonstrates how reduced-size models can be orchestrated to generate complex narratives in a specialized domain.

Open source Multi-agent AI Agents

SIG

HYP

Hugging Face Blog·Jun 6

Job Searcher

Hugging Face launches Job Searcher, an AI-powered job search tool that helps candidates find relevant positions. The tool uses language models to analyze job postings and match candidate profiles.

Tools Business

SIG

HYP

Hugging Face Blog·Jun 6

Persona Atlas: Mapping How Famous Minds Think

Hugging Face introduces Persona Atlas, a tool mapping the thinking styles of famous personalities. The project analyzes cognitive patterns and decision-making approaches of public figures using language models.

Tools Open source

SIG

HYP

Hugging Face Blog·Jun 5

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Hugging Face deploys a multi-agent economy on a 3B (3 billion parameter) model. The 'Thousand Token Wood' system enables autonomous agents to interact, negotiate, and exchange resources in a simulated environment with constrained token budgets.

Multi-agent AI Agents Open source

SIG

HYP

Hugging Face Blog·Jun 4

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

NVIDIA releases Nemotron 3.5 Content Safety, an open-source multimodal safety model detecting harmful content across text, image, and video. Customizable for global enterprises, it provides granular policy control for moderation across regions and use cases.

AI safety Vision Video generation

SIG

HYP

Hugging Face Blog·Jun 4

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Hugging Face releases a guide to fine-tune Nemotron 3.5 ASR, NVIDIA's speech recognition model. The method enables adapting the model to specific languages, domains, or accents through fine-tuning.

Fine-tuning Voice Tools

SIG

HYP

Hugging Face Blog·Jun 4

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Hugging Face releases EVA-Bench Data 2.0, a benchmark spanning 3 domains, 121 tools, and 213 scenarios to evaluate multi-tool AI agents. Major expansion from previous version to test models' ability to orchestrate complex tool interactions.

AI Agents Multi-agent Benchmarks

SIG

HYP

Hugging Face Blog·Jun 4

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Nvidia releases a task-seeded synthetic Q&A generation method for Nemotron pretraining. The technique uses task seeds to create diverse, high-quality training data, improving model performance across varied benchmarks.

Fine-tuning Papers Benchmarks

SIG

HYP

Hugging Face Blog·Jun 4

Designing the hf CLI as an agent-optimized way to work with the Hub

Hugging Face redesigns its CLI to optimize it as an agent. The command-line interface becomes agent-friendly with structured commands and parsable responses, enabling autonomous systems to interact directly with the Hub.

AI Agents Tools Infrastructure

SIG

HYP

Hugging Face Blog·Jun 3

Direct Preference Optimization Beyond Chatbots

Hugging Face explores applying DPO (Direct Preference Optimization) beyond chatbots, including for vision and reasoning model optimization. The article details how this alignment technique can improve performance on complex tasks without requiring an explicit reward model.

Fine-tuning Alignment Reinforcement learning

SIG

HYP

Hugging Face Blog·Jun 3

Adding MCP Tools to Reachy Mini

Hugging Face integrates MCP (Model Context Protocol) tools into Reachy Mini, a humanoid robot. This integration enables the robot to access external tools via MCP protocol, expanding its interaction and autonomy capabilities.

MCP Robotics Tools

SIG

HYP

Hugging Face Blog·Jun 2

Holo3.1: Fast & Local Computer Use Agents

Hugging Face releases Holo3.1, a fast local computer use agent for task automation. The model runs on-device without cloud dependency, enabling speed and privacy for system-level actions.

AI Agents Open source Tools

SIG

HYP

Hugging Face Blog·Jun 1

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains releases Mellum2, a 12B Mixture-of-Experts model. The model combines computational efficiency with performance, designed for code and reasoning tasks.

Code generation Open source Benchmarks

SIG

HYP

Hugging Face Blog·Jun 1

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Hugging Face argues that enterprise AI adoption beyond LLMs requires scalable agent logic. The article explores how multi-agent systems and orchestration become critical for deploying AI beyond simple use cases.

AI Agents Multi-agent Business

SIG

HYP

Hugging Face Blog·Jun 1

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

NVIDIA releases Cosmos 3, an open omni-model for physical AI that reasons and acts. The model processes video, text, and images to understand real-world physics and generate robotic actions.

Robotics Vision Open source

SIG

HYP

Hugging Face Blog·May 29

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

Beginner's guide to PyTorch profiling using torch.profiler. Covers how to measure performance and identify bottlenecks in AI models, with practical examples for newcomers.

Tools Infrastructure

SIG

HYP

Hugging Face Blog·May 27

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ITBench-AA, a new benchmark from Artificial Analysis and IBM, evaluates frontier models on agentic enterprise IT tasks. Top models (Claude, GPT-4, Gemini) score below 50%, exposing significant gaps in automating complex IT workflows.

Benchmarks AI Agents Claude

SIG

HYP

Hugging Face Blog·May 27

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

Hugging Face introduces Delta Weight Sync in TRL to optimize deployment of trillion-parameter models. The technique syncs only weight changes rather than full models, drastically reducing storage and bandwidth requirements for updates.

Infrastructure Open source

SIG

HYP

Hugging Face Blog·May 27

Reachy Mini goes fully local

Reachy Mini, Pollen Robotics' humanoid robot, now runs fully locally without cloud dependency. Integrates open-source models (Llama, Whisper) for vision, speech, and motor control. Deployed on embedded hardware.

Robotics Open source Llama

SIG

HYP

Hugging Face Blog·May 25

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Hugging Face clarifies AI agent terminology: distinguishing harness (execution infrastructure), scaffold (coordination structure), and agent (autonomous system). Essential definitions to avoid confusion in the ecosystem.

AI Agents

SIG

HYP

Hugging Face Blog·May 23

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Nvidia and Hugging Face introduce Nemotron-Labs, diffusion-based language models to accelerate text generation. The approach parallelizes token generation, reducing latency compared to traditional autoregressive methods.

Code generation Benchmarks Open source

SIG

HYP

Hugging Face Blog·May 22

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

Hugging Face argues that AI model specialization outperforms raw scale in procurement decisions. Organizations typically favor large generalist models, overlooking that smaller specialized models deliver better performance and lower costs for specific tasks.

Open source Business Benchmarks

SIG

HYP

Hugging Face Blog·May 19

OlmoEarth v1.1: A more efficient family of models

Hugging Face releases OlmoEarth v1.1, a more efficient family of models for geospatial tasks. The new models deliver improved performance and inference speed compared to the previous version.

Open source Benchmarks Tools

SIG

HYP

Hugging Face Blog·May 19

Introducing the Ettin Reranker Family

Hugging Face introduces the Ettin Reranker family, models designed to improve search relevance and RAG result ranking. These rerankers optimize document ranking after initial retrieval.

RAG Vector search Tools

SIG

HYP

Hugging Face Blog·May 18

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Hugging Face releases a guide for fine-tuning NVIDIA Cosmos Predict 2.5, a robot video generation model, using LoRA/DoRA. The method reduces GPU resource requirements while maintaining generation quality for specialized robotics use cases.

Fine-tuning Video generation Robotics

SIG

HYP

Hugging Face Blog·May 18

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

PaddleOCR 3.5 integrates a Transformers backend for OCR and document parsing tasks. The new version improves accuracy and flexibility by leveraging Transformers models, enabling better text recognition and structured data extraction.

Open source Vision Tools

SIG

HYP

Hugging Face Blog·May 18

The Open Agent Leaderboard

Hugging Face launches a public leaderboard to evaluate open-source AI agents. The platform ranks models by their ability to complete complex tasks, with reproducible benchmarks and transparent results.

AI Agents Benchmarks Open source

SIG

HYP

Hugging Face Blog·May 14

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

IBM and Hugging Face release Granite Embedding Multilingual R2, an open-source embedding model under Apache 2.0 license. The model supports 32K token context and delivers best-in-class retrieval quality for sub-100M parameter models across multiple languages.

Embeddings Open source RAG

SIG

HYP

Hugging Face Blog·May 14

Unlocking asynchronicity in continuous batching

Hugging Face introduces an asynchronicity technique for optimizing continuous batching in inference servers. The method improves throughput by handling requests non-blockingly, reducing latency and increasing GPU resource utilization.

Infrastructure Tools Open source

SIG

HYP

Hugging Face Blog·May 11

Building Blocks for Foundation Model Training and Inference on AWS

Hugging Face and AWS collaborate to provide optimized building blocks for foundation model training and inference on AWS infrastructure, including SageMaker integrations and open-source tools.

Infrastructure Open source Tools

SIG

HYP

Hugging Face Blog·May 8

EMO: Pretraining mixture of experts for emergent modularity

Hugging Face introduces EMO, a pretrained mixture of experts (MoE) model designed to develop emergent modularity. The approach aims to create specialized experts that naturally form during training, improving model efficiency and performance.

Open source Infrastructure Benchmarks

SIG

HYP

Hugging Face Blog·May 6

vLLM V0 to V1: Correctness Before Corrections in RL

vLLM transitions from v0 to v1 prioritizing correctness before optimizations. The update introduces reliability and accuracy improvements in LLM inference, focusing on result validation before applying reinforcement learning techniques.

Infrastructure Reinforcement learning Evals

SIG

HYP

Hugging Face Blog·May 6

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

Hugging Face adds anti-Benchmaxxer filtering to the open ASR leaderboard to prevent artificial benchmark optimization. The system detects models over-optimized for test metrics without real generalization.

Benchmarks Open source Evals

SIG

HYP

Hugging Face Blog

MolmoMotion: Language-guided 3D motion forecasting

From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot

GLM-5.2: Built for Long-Horizon Tasks

Agentic Resource Discovery: Let agents search

olmo-eval: An evaluation workbench for the model development loop

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

Introducing North Mini Code: Cohere’s First Model For Developers

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

Migrating Your GitHub CI to Hugging Face Jobs

The crash that vanished: control and emergence in a five-model economy

Building Pakistan Notice Helper: A Small AI Tool for a Very Local Safety Problem

The Open Source Community is backing OpenEnv for Agentic RL

Room360: Video-to-3D Spatial Reconstruction Platform

Sponsors especially OPENAI CODEX voucher usage for codex - openAI challange

Her · हेर — a detective for your Claude Code sessions

Five labs, five minds: building a multi-model finance drama on small models

Job Searcher

Persona Atlas: Mapping How Famous Minds Think

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Designing the hf CLI as an agent-optimized way to work with the Hub

Direct Preference Optimization Beyond Chatbots

Adding MCP Tools to Reachy Mini

Holo3.1: Fast & Local Computer Use Agents

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

Reachy Mini goes fully local

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

OlmoEarth v1.1: A more efficient family of models

Introducing the Ettin Reranker Family

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

The Open Agent Leaderboard

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Unlocking asynchronicity in continuous batching

Building Blocks for Foundation Model Training and Inference on AWS

EMO: Pretraining mixture of experts for emergent modularity

vLLM V0 to V1: Correctness Before Corrections in RL

Adding Benchmaxxer Repellant to the Open ASR Leaderboard