Rearchitecting Hugging Face Uploads and Downloads
Hugging Face rearchitects its uploads and downloads infrastructure. The platform optimizes storage and file transfer systems to improve performance and reliability for model and dataset operations.
17 articles
Hugging Face rearchitects its uploads and downloads infrastructure. The platform optimizes storage and file transfer systems to improve performance and reliability for model and dataset operations.
Hugging Face introduces SmolVLM, a compact yet performant vision-language model. The model combines computational efficiency with advanced multimodal capabilities for image understanding and text tasks.
Article exploring positional encoding design for transformers. Analyzes how different approaches (sinusoidal, RoPE, ALiBi) impact performance and sequence length generalization.
OpenAI publishes an approach combining humans and AI for red teaming (adversarial security testing). The method improves vulnerability detection by leveraging the respective strengths of human testers and AI models to identify security flaws.
OpenAI enables vision fine-tuning for GPT-4o. Trained models better recognize map elements (roads, buildings, landmarks) with fewer errors. Use case: improved mapping and navigation services.
Hugging Face launches the first multilingual LLM debate competition. Large language models compete on diverse topics across multiple languages, testing argumentation capabilities and critical reasoning.
Hugging Face improves storage efficiency by chunking large files. The new approach reduces redundancy and speeds up partial downloads for models and datasets.
Hugging Face introduces Self-Speculative Decoding, an optimization technique that accelerates text generation without requiring an additional model. The method leverages intermediate layers of the model to predict upcoming tokens, reducing latency while preserving output quality.
Hugging Face launches an open leaderboard to evaluate Japanese language models. The platform enables comparison of different LLM performances on Japanese-specific benchmarks.
Hugging Face introduces Judge Arena, a benchmark to evaluate LLMs' ability to serve as evaluators. The system tests how different models judge the quality of other LLM outputs, measuring their reliability as automated judges.
OpenAI opens its first office in continental Europe in France. The company strengthens its geographic presence and commitment to European regulators and partners.
Estée Lauder Companies leverages ChatGPT to extract data insights and optimize beauty and creativity strategies. Generative AI integration enhances data analysis and business decision-making in the cosmetics sector.
Hugging Face encourages researchers and developers to share open ML datasets on the Hub. The platform provides free storage, versioning, integrated documentation, and collaboration tools to facilitate data distribution and reuse.
The UK's liver transplant matching algorithm may systematically exclude younger patients. Seemingly minor technical decisions can have life-or-death effects.
Hugging Face and JetBrains integrate their platforms. PyCharm now provides direct access to Hugging Face models and datasets, with embedded autocompletion and documentation for ML workflows.
OpenAI submits comments to the NTIA on data center growth, resilience, and security. The document responds to an official information request from the U.S. telecommunications administration.
Argilla 2.4 enables building fine-tuning and evaluation datasets directly on Hugging Face Hub without coding. The platform provides a web interface for annotating, validating, and preparing data before model training.