Bria is now on Replicate
Bria joins Replicate with commercial-grade image generation and editing models built on licensed data. Designed for enterprises and developers, these tools provide a safe alternative for visual AI applications.
Bria joins Replicate with commercial-grade image generation and editing models built on licensed data. Designed for enterprises and developers, these tools provide a safe alternative for visual AI applications.
Invideo AI integrates OpenAI's GPT-4.1, gpt-image-1, and text-to-speech models to generate professional videos 10x faster. The platform transforms creative ideas into videos within minutes.
Hugging Face releases Ettin Suite, a collection of state-of-the-art paired encoders and decoders. Models optimize translation and text generation tasks with symmetric architectures.
Hugging Face introduces an approach decoupling action prediction from execution in robotics. The system uses asynchronous inference to improve latency and robot responsiveness, enabling parallel action execution with next prediction.
Hugging Face releases ScreenEnv, an environment for deploying full-stack desktop agents. The tool enables AI models to interact with complete graphical interfaces, unlocking advanced automation use cases.
Hugging Face publishes a guide for creating custom kernels optimized for AMD MI300 GPUs. The tutorial covers CUDA/HIP kernel implementation and integration into inference pipelines to improve performance.
Hugging Face introduces Reachy Mini, an open-source robot for AI developers. The robot features vision, manipulation, and mobility capabilities with native support for modern AI models and reinforcement learning workflows.
Hugging Face introduces MCP (Model Context Protocol) servers integrated with Gradio to enhance LLM capabilities. This approach enables models to access custom tools and interfaces through the standardized MCP protocol.
Hugging Face releases a guide for training and finetuning sparse embedding models with Sentence Transformers. The method reduces dimensionality while maintaining performance for semantic search and information retrieval.
Retell AI launches a no-code voice agent platform powered by GPT-4o and GPT-4.1 for call center automation. Natural voice agents reduce costs, improve customer satisfaction (CSAT), and eliminate scripts and hold times.
SGLang integrates Hugging Face Transformers backend for LLM inference. This integration enables SGLang users to directly access Hugging Face models with native optimizations, improving compatibility and performance.
Groq integrates with Hugging Face as an inference provider. Users access HF models through Groq's API to leverage Groq's inference speed.
OpenAI launches a government-focused initiative to provide U.S. federal agencies with access to its most advanced AI tools. The program aims to support government adoption of cutting-edge technology for public service delivery.
Featherless AI, an inference provider, integrates with Hugging Face Inference Providers. Users can deploy and serve models through the Hugging Face ecosystem with multi-model support and latency optimizations.
OpenAI releases its Outbound Coordinated Disclosure Policy governing how it reports vulnerabilities in third-party software. The program emphasizes collaboration, integrity, and proactive security at scale.
Hugging Face releases ScreenSuite, a comprehensive evaluation suite for GUI agents. The tool measures models' ability to interact with graphical interfaces through standardized, reproducible benchmarks.
OpenAI is challenging a court order from the New York Times lawsuit that would require indefinite retention of ChatGPT and API user data. The company claims to be defending user privacy against legal demands while honoring its data protection commitments.
Hugging Face publishes a tutorial on implementing KV cache from scratch in nanoVLM. The guide covers memory optimization mechanisms for vision-language models, enabling more efficient inference.
OpenAI releases o3, o4-mini, and GPT-4.1 to speed up code reviews. CodeRabbit integrates these models to improve PR accuracy, reduce bugs, and boost developer ROI.
OpenAI's latest models (GPT-4.1, GPT-4o, o-series) are now available on Replicate for inference.
Google DeepMind launches SynthID Detector, a portal to identify AI-generated content. Announced at Google I/O, this tool helps users understand the origin of online content.
Google DeepMind announces Gemini 2.5 with improvements for coding and a new experimental reasoning mode called Deep Think for 2.5 Pro. 2.5 Flash also receives updates.
Hugging Face introduces Falcon-Edge, a series of universal, fine-tunable language models with 1.58bit quantization. The models deliver extreme compression while maintaining reasoning and instruction-following capabilities.
Hugging Face standardizes model definitions through its Transformers library. The initiative aims to harmonize architectures and improve interoperability across frameworks. Goal: reduce fragmentation and accelerate adoption of open-source models.
Replicate partners with Hugging Face to integrate its inference infrastructure into the platform. Users can now run 30,000+ LoRAs directly through Hugging Face.
Hugging Face optimizes Whisper transcriptions through Inference Endpoints with significant speed improvements. The platform offers dedicated infrastructure to accelerate audio processing in production.
Hugging Face launches LeRobot Community Datasets, an initiative to build robotics' equivalent of ImageNet. The project aims to centralize demonstration datasets for robot control to accelerate research and model development.
Lowe's launches Mylow and Mylow Companion, AI tools built with OpenAI to assist customers and store associates with home improvement projects. The AI assistants simplify planning and execution of complex renovation tasks both in-store and online.
Ideogram 3.0 is now available on Replicate with enhanced design, style transfer, and realism capabilities.
Google DeepMind releases an updated Gemini 2.5 Pro Preview with improved coding capabilities for building rich, interactive web applications.
Replicate offers an API to run MiniMax's Speech-02 models, providing high-quality text-to-speech with voice cloning, emotional expression, and multilingual support.
Hugging Face publishes a guide for building an MCP (Model Context Protocol) server with Gradio. The tutorial demonstrates how to integrate Gradio to create interactive interfaces compatible with the MCP protocol, enabling better integration with AI tools.
Hugging Face demonstrates fine-tuning olmOCR, an open-source OCR model based on OLMo. The approach improves optical character recognition fidelity across diverse document types.
Cohere integrates with Hugging Face Inference Providers, enabling access to Cohere models through the Hugging Face ecosystem. This integration streamlines deployment and usage of Cohere models for developers.
Hugging Face launches Arabic-specific leaderboards to evaluate instruction-following capabilities in Arabic. The platform updates AraGen and introduces new benchmarks to measure model performance on Arabic language tasks.
Meta releases Llama 4 Maverick and Scout on Hugging Face. Maverick is a high-performance model for complex tasks, Scout a lightweight model optimized for efficiency. Both are available on the Hugging Face platform.
Gradio reaches 1 million users. The ML interface-sharing platform has experienced exponential growth since launch, becoming the standard tool for demonstrating AI models in production.
Hugging Face is transforming its NLP Course into an LLM Course. Content shifts to modern language models, covering fine-tuning, RAG, agents, and deployment. Progressive updates to existing chapters and new modules.
Google DeepMind releases a framework to evaluate cybersecurity threats from advanced AI systems. The tool helps security experts identify necessary defenses and prioritize them.
Hugging Face accelerates LLM inference using Text Generation Inference (TGI) on Intel Gaudi processors. The solution optimizes latency and throughput for production deployments.