May 2023

23 articles

Improving mathematical reasoning with process supervision

OpenAI trains a model using process supervision (rewarding each correct reasoning step) instead of outcome supervision (rewarding final answers). This approach achieves state-of-the-art mathematical problem solving and improves alignment by directly training models to produce human-endorsed chain-of-thought reasoning.

OpenAI Reasoning Reinforcement learning

SIG

HYP

Hugging Face Blog·May 31

Introducing BERTopic Integration with the Hugging Face Hub

Hugging Face integrates BERTopic, a BERT-based topic modeling tool, directly into the Hub. Users can train, share, and deploy BERTopic models through Hugging Face's unified interface.

Embeddings Tools Open source

SIG

HYP

Hugging Face Blog·May 31

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

Hugging Face releases an optimized LLM inference container for Amazon SageMaker, enabling streamlined deployment of language models in production with improved performance and scalability.

Tools Infrastructure Open source

SIG

HYP

OpenAI Blog·May 25

Democratic inputs to AI

OpenAI launches a program awarding ten $100,000 grants to fund experiments in democratic processes for deciding what rules AI systems should follow, within legal bounds.

OpenAI Regulation AI safety

SIG

HYP

Hugging Face Blog·May 25

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Hugging Face and Intel optimize Stable Diffusion for Intel CPUs using NNCF and Optimum. 8-bit quantization reduces model size by 75% and accelerates inference 2-3x on CPU with no significant visual quality loss.

Image generation Open source Tools

SIG

HYP

Hugging Face Blog·May 24

Hugging Face Collaborates with Microsoft to launch Hugging Face Model Catalog on Azure

Hugging Face and Microsoft launch Hugging Face Model Catalog on Azure. This integration enables users to access open-source models directly through Microsoft's cloud platform, streamlining deployment and inference in production.

Open source Infrastructure Business

SIG

HYP

Hugging Face Blog·May 24

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Hugging Face introduces 4-bit quantization with bitsandbytes and QLoRA to reduce LLM memory requirements. The technique enables fine-tuning 65B parameter models on a single 24GB GPU, making training accessible to more users.

Fine-tuning Open source Tools

SIG

HYP

Hugging Face Blog·May 23

Instruction-tuning Stable Diffusion with InstructPix2Pix

Hugging Face introduces InstructPix2Pix, an instruction-tuning method for Stable Diffusion enabling image editing via text instructions. The model learns to apply specific edits (style changes, object modifications, color adjustments) from a source image and natural language instruction.

Image generation Fine-tuning Prompt engineering

SIG

HYP

Hugging Face Blog·May 23

Hugging Face and IBM partner on watsonx.ai, the next-generation enterprise studio for AI builders

Hugging Face and IBM partner to integrate Hugging Face open-source models into watsonx.ai, IBM's enterprise AI platform. The collaboration gives developers access to language and vision models through a unified interface, with fine-tuning and production deployment support.

Open source Business Fine-tuning

SIG

HYP

Hugging Face Blog·May 23

🐶Safetensors audited as really safe and becoming the default

Safetensors passed independent security audit confirming its safety. The format becomes default standard for model storage on Hugging Face, gradually replacing legacy formats like pickle.

Tools Open source AI safety

SIG

HYP

OpenAI Blog·May 22

Governance of superintelligence

OpenAI calls for early thinking on superintelligence governance—future AI systems dramatically more capable than AGI. The company frames foundational debate on control frameworks needed before their emergence.

OpenAI Regulation AI safety

SIG

HYP

OpenAI Blog·May 18

Introducing the ChatGPT app for iOS

OpenAI launches ChatGPT app for iOS with conversation sync, voice input, and access to latest model improvements.

OpenAI Voice Tools

SIG

HYP

Hugging Face Blog·May 16

Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon

Hugging Face introduces Q8-Chat, a model optimized for Intel Xeon processors delivering efficient generative AI. The model reduces size while maintaining performance, enabling deployment on standard CPU infrastructure without GPUs.

Open source Infrastructure Code generation

SIG

HYP

Hugging Face Blog·May 16

Large-scale Near-deduplication Behind BigCode

BigCode built large-scale near-deduplication infrastructure to clean code data. The system identifies and removes near-duplicates across billions of files, improving training dataset quality for code models.

Code generation Benchmarks Open source

SIG

HYP

Hugging Face Blog·May 15

Run a Chatgpt-like Chatbot on a Single GPU with ROCm

Hugging Face demonstrates how to deploy a ChatGPT-like chatbot on a single GPU using ROCm. The guide covers memory optimization and efficient inference for large language models on AMD hardware.

Open source Tools Infrastructure

SIG

HYP

Hugging Face Blog·May 15

Introducing RWKV - An RNN with the advantages of a transformer

Hugging Face introduces RWKV, an RNN model combining transformer advantages: training parallelization and linear inference complexity. Hybrid architecture eliminates the quadratic attention bottleneck.

Open source Reasoning Infrastructure

SIG

HYP

Hugging Face Blog·May 15

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

Hugging Face selected by French Data Protection Agency (CNIL) for enhanced support program. Recognition of the company's commitment to data protection and regulatory compliance in AI.

Regulation Business

SIG

HYP

Hugging Face Blog·May 11

Assisted Generation: a new direction toward low-latency text generation

Hugging Face introduces Assisted Generation, a technique reducing text generation latency by using a fast draft model to validate tokens with a main model. Significant speed improvement without quality loss.

Code generation Infrastructure Tools

SIG

HYP

OpenAI Blog·May 9

Language models can explain neurons in language models

OpenAI uses GPT-4 to automatically generate explanations for neuron behavior in large language models and score those explanations. A dataset of these explanations and scores for every neuron in GPT-2 is released.

OpenAI GPT Evals

SIG

HYP

Hugging Face Blog·May 9

Creating a Coding Assistant with StarCoder

Hugging Face publishes a guide for building a coding assistant with StarCoder. The open-source model generates code and can be integrated into applications via APIs. The article details architecture and practical use cases.

Code generation Open source Tools

SIG

HYP

Hugging Face Blog·May 8

A Dive into Text-to-Video Models

Hugging Face explores the architecture and capabilities of text-to-video models. The article details current approaches, technical challenges, and practical applications of this emerging technology.

Video generation Tools Open source

SIG

HYP

Hugging Face Blog·May 4

StarCoder: A State-of-the-Art LLM for Code

Hugging Face introduces StarCoder, a language model specialized in code generation. The model achieves performance comparable to proprietary solutions on standard coding benchmarks.

Code generation Open source Benchmarks

SIG

HYP

Hugging Face Blog·May 1

How to Install and Use the Hugging Face Unity API

Hugging Face releases installation and usage guide for its Unity API, enabling game developers to integrate AI models directly into Unity projects.

Tools Open source

SIG

HYP