November 2024

17 articles

Rearchitecting Hugging Face Uploads and Downloads

Hugging Face rearchitects its uploads and downloads infrastructure. The platform optimizes storage and file transfer systems to improve performance and reliability for model and dataset operations.

Infrastructure Tools

SIG

HYP

Hugging Face Blog·Nov 26

SmolVLM - small yet mighty Vision Language Model

Hugging Face introduces SmolVLM, a compact yet performant vision-language model. The model combines computational efficiency with advanced multimodal capabilities for image understanding and text tasks.

Vision Open source Benchmarks

SIG

HYP

Hugging Face Blog·Nov 25

You could have designed state of the art positional encoding

Article exploring positional encoding design for transformers. Analyzes how different approaches (sinusoidal, RoPE, ALiBi) impact performance and sequence length generalization.

Reasoning Papers

SIG

HYP

OpenAI Blog·Nov 21

Advancing red teaming with people and AI

OpenAI publishes an approach combining humans and AI for red teaming (adversarial security testing). The method improves vulnerability detection by leveraging the respective strengths of human testers and AI models to identify security flaws.

OpenAI AI safety Alignment

SIG

HYP

OpenAI Blog·Nov 20

Building smarter maps with GPT-4o vision fine-tuning

OpenAI enables vision fine-tuning for GPT-4o. Trained models better recognize map elements (roads, buildings, landmarks) with fewer errors. Use case: improved mapping and navigation services.

GPT OpenAI Vision

SIG

HYP

Hugging Face Blog·Nov 20

Letting Large Models Debate: The First Multilingual LLM Debate Competition

Hugging Face launches the first multilingual LLM debate competition. Large language models compete on diverse topics across multiple languages, testing argumentation capabilities and critical reasoning.

Benchmarks Reasoning Evals

SIG

HYP

Hugging Face Blog·Nov 20

From Files to Chunks: Improving HF Storage Efficiency

Hugging Face improves storage efficiency by chunking large files. The new approach reduces redundancy and speeds up partial downloads for models and datasets.

Infrastructure Tools

SIG

HYP

Hugging Face Blog·Nov 20

Faster Text Generation with Self-Speculative Decoding

Hugging Face introduces Self-Speculative Decoding, an optimization technique that accelerates text generation without requiring an additional model. The method leverages intermediate layers of the model to predict upcoming tokens, reducing latency while preserving output quality.

Code generation Infrastructure Tools

SIG

HYP

Hugging Face Blog·Nov 20

Introducing the Open Leaderboard for Japanese LLMs!

Hugging Face launches an open leaderboard to evaluate Japanese language models. The platform enables comparison of different LLM performances on Japanese-specific benchmarks.

Benchmarks Open source Tools

SIG

HYP

Hugging Face Blog·Nov 19

Judge Arena: Benchmarking LLMs as Evaluators

Hugging Face introduces Judge Arena, a benchmark to evaluate LLMs' ability to serve as evaluators. The system tests how different models judge the quality of other LLM outputs, measuring their reliability as automated judges.

Benchmarks Evals Open source

SIG

HYP

OpenAI Blog·Nov 15

OpenAI en France

OpenAI opens its first office in continental Europe in France. The company strengthens its geographic presence and commitment to European regulators and partners.

OpenAI Business

SIG

HYP

OpenAI Blog·Nov 13

Data-driven beauty and creativity with ChatGPT

Estée Lauder Companies leverages ChatGPT to extract data insights and optimize beauty and creativity strategies. Generative AI integration enhances data analysis and business decision-making in the cosmetics sector.

GPT OpenAI Business

SIG

HYP

Hugging Face Blog·Nov 12

Share your open ML datasets on Hugging Face Hub!

Hugging Face encourages researchers and developers to share open ML datasets on the Hub. The platform provides free storage, versioning, integrated documentation, and collaboration tools to facilitate data distribution and reuse.

Open source Tools Infrastructure

SIG

HYP

AI Snake Oil·Nov 11

Does the UK’s liver transplant matching algorithm systematically exclude younger patients?

The UK's liver transplant matching algorithm may systematically exclude younger patients. Seemingly minor technical decisions can have life-or-death effects.

Alignment AI safety Regulation

SIG

HYP

Hugging Face Blog·Nov 5

Hugging Face + PyCharm

Hugging Face and JetBrains integrate their platforms. PyCharm now provides direct access to Hugging Face models and datasets, with embedded autocompletion and documentation for ML workflows.

Tools Infrastructure

SIG

HYP

OpenAI Blog·Nov 4

OpenAI’s comments to the NTIA on data center growth, resilience, and security

OpenAI submits comments to the NTIA on data center growth, resilience, and security. The document responds to an official information request from the U.S. telecommunications administration.

OpenAI Regulation Infrastructure

SIG

HYP

Hugging Face Blog·Nov 4

Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required

Argilla 2.4 enables building fine-tuning and evaluation datasets directly on Hugging Face Hub without coding. The platform provides a web interface for annotating, validating, and preparing data before model training.

Fine-tuning Tools Open source

SIG

HYP