May 2026

Semantic Step Prediction: Multi-Step Latent Forecasting in LLM Reasoning Trajectories via Step Sampling

Research on semantic step prediction in LLM reasoning trajectories. Multi-step latent forecasting method via step sampling to improve language model reasoning performance.

Reasoning Papers

SIG

HYP

Fine-tuning AI safety Alignment

Workshop on Unlearning and Model Editing U&ME at ECCV 2026 [R]

Workshop on Unlearning and Model Editing (U&ME) at ECCV 2026. Platform to discuss techniques for modifying or removing specific knowledge from AI models without full retraining.

SIG

HYP

G7 agrees on shared language around open-source AI and open weights AI

G7 reaches agreement on shared terminology distinguishing open-source AI from open-weights AI. Governments formalize definitions already understood by the technical community.

Open source

SIG

HYP

GPT Open source Code generation

I trained gpt-1 on my local machine (RTX 2060 Super 8GB VRAM)

Developer trained GPT-1 (1B parameters) on RTX 2060 Super 8GB in 1 hour. Demonstrates that gamers can now pre-train specialized <1B models locally without cloud infrastructure. Code and model released on GitHub and HuggingFace.

SIG

HYP

I ported NVIDIA Parakeet (speech-to-text) to ggml: same output as NeMo, faster, GGUF-quantized, no Python

NVIDIA Parakeet speech-to-text ported to C++/ggml without Python or PyTorch. Byte-for-byte identical output to NeMo, up to 5x faster on GPU for larger models, 600x realtime on audio clips. Quantized GGUFs (f16, q8_0, q6_k, q5_k, q4_k), flat C API, integrated in LocalAI with OpenAI-compatible endpoint.

Voice Open source Tools

SIG

HYP

Hacker News (AI)·May 31

ChatGPT for Google Sheets Exfiltrates Workbooks

A ChatGPT extension for Google Sheets exfiltrates workbook data without explicit consent. Users believe they interact with OpenAI while the extension accesses entire spreadsheet contents.

OpenAI AI safety Tools

SIG

HYP

Claude Open source Fine-tuning

I trained gpt-1 on my local machine (RTX 2060 Super 8GB VRAM)

User trained GPT-1 on RTX 2060 Super (8 GB VRAM) in ~1 hour using Claude-generated code based on original implementation. Cost to reproduce GPT models dropped 500–1000× since GPT-2 ($43,000 → $48 per H100 cluster run).

SIG

HYP

Llama Code generation Infrastructure

Whats actually happening when a model spills out of VRAM into system memory?

Technical discussion on VRAM overflow mechanics in llama.cpp. User runs Gemma-4 26B (21GB) on RX6600XT + Ryzen 7 5700X with 32GB RAM, achieving ~20 tokens/s decode. Question: how is CPU/GPU split handled and what role do PCIe speed vs CPU play?

SIG

HYP

Hacker News (AI)·May 31

Is that song AI-generated? UChicago scientists create tool to check

University of Chicago researchers created a tool to detect AI-generated songs. The tool analyzes audio characteristics to identify typical signatures of synthetic generation.

AI safety Evals

SIG

HYP

Open source Infrastructure Tools

Llama Studio v0.2.0

Llama Studio v0.2.0 replaces JSON model config with per-model shell scripts, adds GPU splitting with tensor-split detection, and introduces session store with autoload on startup. Open-source WebUI for managing llama-server instances.

Llama Open source Tools

SIG

HYP

Hacker News (AI)·May 31

Netflix Wiz creates app to slash AI bills, then open sources it

Netflix Wiz created an app to reduce AI infrastructure costs and open sourced it. The tool helps organizations optimize their AI spending.

SIG

HYP

Qwen Benchmarks Code generation

Experiment : MTP models just as t/s efficient as non MTP models?

Benchmark on 9070XT GPU: Qwen 35B A3B MTP achieves 43.74 T/s vs 38.07 T/s standard mode. MTP shows ~15% throughput gain despite multi-token prediction overhead. Identical test conditions (prompt, 8192 context, Q4_K_XL quantization).

SIG

HYP

Hacker News (AI)·May 31

Talk Is Cheap: The Operational Impact of LLM Use

Study on the real operational impact of LLM use in production. Analyzes measurable costs, latencies, and productivity gains versus marketing claims.

Benchmarks Business

SIG

HYP

Hacker News (AI)·May 31

CT gov signs AI law to notify employees

Connecticut government signed a law requiring employers to notify employees before using AI for employment decisions. The measure aims to increase transparency and worker rights regarding AI systems.

Regulation AI safety

SIG

HYP

Hacker News (AI)·May 31

Show HN: Ouijit, an open-source task and terminal manager for coding agents

Ouijit is an open-source task and terminal manager for coding agents. Enables management of AI agent execution in development environments.

AI Agents Code generation Open source

SIG

HYP

Qwen3.6-35B vs Gemma4-26B on 7900 XTX

Benchmark on Radeon 7900 XTX: Qwen3.6-35B vs Gemma4-26B with reasoning enabled. Qwen generates 2x more tokens (14,811 vs 7,386) but Gemma is ~20% faster end-to-end (95.6s vs 118.8s). Qwen's MTP reaches 130 tok/s vs 78 tok/s, but token count becomes the bottleneck. Quality close, interesting per-task splits.

Qwen Gemini Reasoning

SIG

HYP

(YT) PewDiePie released his harness/webui

PewDiePie released Odysseus, a web UI/harness for local LLMs. The creator, without formal programming background (mechanical engineering studies), provides a non-developer perspective on local model accessibility.

SIG

HYP

Hacker News (AI)·May 31

Odysseus – self-hosted AI workspace

Odysseus is a self-hosted AI workspace. The project offers an open-source alternative to proprietary cloud platforms for running AI models and workflows locally.

SIG

HYP

I built a tool to browse and plan CVPR workshop/tutorial days [P]

CVPR Workshop Radar aggregates CVPR 2026 workshops and tutorials into a searchable web interface. Search by title/organizer/topic, filter by date/type/program availability, personal schedule, timeline view. Automated pipeline: PDF extraction → scraping → LLM processing. Open source, offline-capable, no account required.

Tools Open source

SIG

HYP

Llama Open source Infrastructure

Added an old 2070 Super to my rig and I can't go back...worse, now I need more

User reports adding an RTX 2070 Super (8 GB VRAM) to his high-end rig (RTX 5090, 9800X3D, 96 GB RAM) enables running Qwen 3.6-27B at Q8_0 with 144k context at 40-70 tok/s. Takeaway: more VRAM > raw performance for local inference.

SIG

HYP

Hacker News (AI)·May 31

1-Bit Bonsai Image 4B Image Generation for Local Devices

Bonsai Image 4B is a 1-bit quantized image generation model designed to run on local devices. The model compresses weights to 1-bit to drastically reduce size and computational requirements, enabling inference on resource-constrained hardware.

Image generation Open source Infrastructure

SIG

HYP

Open source Benchmarks Infrastructure

I built mlx-Chronos — a community benchmark leaderboard for local LLM engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama)

mlx-Chronos is an open-source CLI tool and community leaderboard to compare MLX inference engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama). Measures TTFT, throughput, RAM, and thermal state with standardized methodology. Leaderboard currently populated by M2 8GB, seeking M3/M4 results.

SIG

HYP

Hacker News (AI)·May 31

AI bots ignore evidence. Can we trust them with science?

Study examines AI bots' tendency to ignore scientific evidence. Current models fail to systematically follow empirical data, raising concerns about their reliability for scientific research.

Reasoning Evals AI safety

SIG

HYP

Hacker News (AI)·May 31

Claude Code and Codex Can Have Real-Time Conversation via Git

Claude Code and Codex can now communicate in real-time via Git. A developer built an integration enabling the two models to exchange messages and code directly through Git commits, opening new possibilities for multi-agent collaboration.

Claude Code Multi-agent AI Agents

SIG

HYP

Vercel AI Blog·May 31

Chat SDK adds Lark and Feishu support

Vercel AI Chat SDK adds support for Lark and Feishu via a new official vendor adapter. Bots can post, edit, and delete messages, stream replies via Lark's native cardkit typewriter API, send interactive cards, and react with emojis. Connection uses Lark's WebSocket transport without requiring HTTP webhook exposure.

Tools AI Agents Code generation

SIG

HYP

Gemini AI safety Alignment

13 abliterated Gemma 4 E2B variants, 44 GPU hours, Benchmark and Comparison - Abliterlitics

Systematic comparison of 13 abliterated Gemma 4 E2B variants across 44 GPU hours. coder3101 achieves 96% ASR (refusals) with full capability preservation and outperforms base model on math. Surgical approaches preserve performance better than aggressive methods, with some losing up to 6.9 points on GSM8K.

SIG

HYP

Hacker News (AI)·May 31

DIY Bipedal Robot Used Pneumatic "Air-Muscles" Instead of Motors

A DIY bipedal robot uses pneumatic "air-muscles" instead of electric motors. Alternative approach to robotic locomotion exploring pneumatic actuation.

Robotics

SIG

HYP

Infrastructure Open source Benchmarks

Built an AI Accelerator and opensourced it. [P]

Developer open-sources AI accelerator on FPGA (AWS F2) based on RocketChip/RISC-V with attention mechanism built into silicon. Benchmarks: 225× speedup vanilla attention, 96× TinyBERT, 50× ViT, 30× GPT-2 prefill. Native BF16 support.

SIG

HYP

DIY Local 2x DGX Spark cluster cooler with automatic temperature controlled fan.

User built a DIY cooling enclosure for 2 DGX Spark units using a 3D-printed Thingiverse design (PETG filament). Added a 120mm fan with automatic temperature control via AC Infinity thermostat controller with temperature probe to adjust fan speed based on cluster heat output.

AI Agents Tools Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nesquena /</span> hermes-webui

Hermes WebUI is a web interface to use Hermes Agent from a browser or mobile device. Open-source project trending on GitHub.

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nicobailon /</span> pi-subagents

Pi-subagents is an extension for async subagent delegation with truncation, artifacts, and session sharing. Open-source project trending on GitHub.

AI Agents Multi-agent Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> AppFlowy-IO /</span> AppFlowy-Cloud

AppFlowy-Cloud is an open-source collaborative workspace with integrated AI, a Notion alternative. Manages projects, wikis, and teams while maintaining data control.

Open source AI Agents Tools

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> louis-e /</span> arnis

Arnis is a tool that generates real-world locations in Minecraft with high detail. The project uses AI models to convert geographic data into Minecraft structures.

Code generation Tools

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> golemcloud /</span> golem

Golem Cloud is an agent-native platform for building AI agents and distributed applications that never lose state, never duplicate work, and never require infrastructure management.

AI Agents Infrastructure Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> mattpocock /</span> sandcastle

Sandcastle is a TypeScript library to orchestrate sandboxed coding agents. It enables isolated code execution via sandcastle.run().

AI Agents Code generation Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nicobailon /</span> pi-subagents

Pi-subagents is an extension for async subagent delegation with truncation, artifacts, and session sharing. Open-source tool for agent orchestration.

AI Agents Multi-agent Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> jamwithai /</span> production-agentic-rag-course

Open-source course on building production agentic RAG systems. Covers architecture, implementation patterns, and best practices for deploying agentic retrieval-augmented generation systems.

AI Agents RAG Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> Comfy-Org /</span> ComfyUI

ComfyUI is a modular GUI for diffusion models with a node/graph-based interface, providing API and backend capabilities for image generation.

Image generation Open source Tools

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nesquena /</span> hermes-webui

Hermes WebUI provides a web and mobile interface to use Hermes Agent. Open-source project trending on GitHub.

AI Agents Tools Open source

SIG

HYP

The Decoder·May 31

Ask AI what goes with chicken and the answer depends on whether it learned from recipes or molecules

Kaikaku.AI releases Epicure, three AI models separating ingredients by recipe compatibility or chemical similarity. Trained on 4.14 million multilingual recipes and FlavorDB, they generate different recommendations per source. The chemistry-only model outperforms recipe-based variants on taste and nutrition classification without direct data.

Fine-tuning Benchmarks Tools

SIG

HYP

Image generation Infrastructure Tools

Diffusion in prod: how are you handling spiky GPU load and cold starts?

Production challenges with diffusion models: handling GPU load spikes, cold starts, and inference costs. Scaling from 100 to 10k requests exposes architectural issues and multi-tenancy problems.

SIG

HYP

DeepSeek Benchmarks Code generation

DeepSWE benchmarks indicate that DeepSeek v4 Pro only passes 8% of tasks

Reddit user reports DeepSeek v4 Pro achieves 8% pass rate on DeepSWE benchmark, contrasting with their perception of near-parity with Claude Sonnet 4.6 in practice. Link to DeepSWE benchmark provided.

SIG

HYP

Stepfun 3.7 Flash is very good

Stepfun 3.7 Flash delivers quality close to GLM 5.1 with 80% 3D world understanding while using 75% fewer parameters and featuring built-in vision. Recommended for RAM-constrained setups.

Llama Vision

SIG

HYP

Llama Code generation Benchmarks

Flash Attention for llama.cpp on RDNA3: 47% less KV VRAM than Vulkan f16 K, KLD almost losselss on F16 K / q4_0 V. Part 1.

Flash Attention optimization for llama.cpp on RDNA3 GPUs: 47% VRAM reduction vs Vulkan f16. Packs four 8-bit K-values into native sudot4 instructions without lossy quantization. At 128k context with MTP draft: 21.76 GiB vs 23.18 GiB (1.42 GiB savings). Quality preserved: mean KLD 0.00455 (q4_0 V), 97.06% identical top tokens.

SIG

HYP

<Think> toggle button for llama.cp web chat for QWEN3.6

A user shares a Tampermonkey script to add a reasoning toggle button in llama.cpp web chat for Qwen 3.6. The script intercepts API requests and controls the enable_thinking parameter without recompiling the source code daily.

Qwen Reasoning Tools

SIG

HYP

Built Bloc: a package manager for local AI models, agents, and tools

Bloc is an open-source package manager for local AI models, agents, and tools. It packages complete setups (model, runtime, dependencies, environment variables) into versioned recipes executable via CLI. Similar to npm for AI workloads, with automatic hardware detection and dependency management.

SIG

HYP

The Decoder·May 31

Anthropic bans AI tools during job interviews to see how candidates actually think

Anthropic bans AI tools during job interviews to assess candidates' actual thinking. Up to five rounds test skills, values, and ethical reasoning. Salaries reach $850,000. Some applicants pay $4,600 for prep coaching run anonymously by current company employees.

Anthropic Business

SIG

HYP

Speed difference between Windows 11 and Linux with llama.cpp: a myth when using medium and large MoE models

llama.cpp benchmark comparing Windows 11 and Linux (Ubuntu 26.04) on Nvidia GPU (RTX 5080 + 2× RTX 5060 Ti). No significant performance difference: Qwen 3.5 122B achieves PP 300/TG 28 (Windows) vs PP 290/TG 28.5 (Linux); Qwen 3.5 397B: PP 140/TG 16 vs PP 150/TG 15.2. Tests repeated 4 times with recent llama.cpp including VRAM optimization.

Llama Qwen Benchmarks

SIG

HYP

Benchmarks AI safety Evals

PolyRange: Contamination-resistant offensive-AI benchmark for web targets (that ain't a benchmark, THAT's a benchmark)

PolyRange is a cybersecurity AI benchmark that dynamically generates fresh web targets for each evaluation, eliminating training corpus contamination. The author addresses consensus from labs (Anthropic, OpenAI, DeepMind): static benchmarks are saturated and real-world defenses are missing. MIT-licensed, independent from the author's commercial project.

SIG

HYP

The Decoder·May 31

Anthropic study finds men use AI coding agents more than twice as often as women in social science research

An Anthropic study finds researchers with typically male names use AI coding agents more than twice as often as those with typically female names, controlling for discipline and career level. Economists lead at 39%, education researchers at 4%. The gender gap for coding agents far exceeds that for general AI use.

Anthropic AI Agents Code generation

SIG

HYP

The Decoder·May 31

SoftBank plans 75 billion euro AI data center buildout in France

SoftBank plans to build AI data centers with 5 GW capacity in France for up to 75 billion euros, its largest AI infrastructure investment in Europe. 45 billion euros of facilities are set to go live by 2031 across three northern France sites.

Infrastructure Business

SIG

HYP

Open source Benchmarks Infrastructure

I built mlx-Chronos — a community benchmark leaderboard for local LLM engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama) [P]

mlx-Chronos is an open-source CLI tool and community leaderboard to benchmark local LLM inference engines on Apple Silicon (oMLX, Rapid-MLX, mlx-lm, Ollama). Measures TTFT, throughput, RAM, and thermal state with standardized methodology. Currently populated only with M2 8GB results.

SIG

HYP

The Decoder·May 31

AI search agents often confirm what they already know instead of actually researching the web

AI search agents like GPT-5.4 and Kimi K2.6 mostly confirm their training knowledge rather than genuinely researching the web. Researchers at Harbin Institute of Technology demonstrated this using LiveBrowseComp, a benchmark based on events from the last 90 days. Without relying on training memory, performance collapses.

Benchmarks AI Agents GPT

SIG

HYP

Hacker News (AI)·May 31

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Novel approach for autonomous AI agents: using memory as action to manage context for long-horizon tasks. The system actively selects which information to retain and use, improving performance across extended horizons.

AI Agents Reasoning

SIG

HYP

Vercel AI Blog·May 31

MiniMax M3 on AI Gateway

MiniMax M3, MiniMax's first model with 1M-token context window and native multimodality, is now available on Vercel AI Gateway. M3 excels at software engineering, terminal-based tool use, and agentic web browsing, optimized for multi-turn collaboration.

AI Agents Code generation Vision

SIG

HYP

AI Agents Code generation Open source

Made a program using LocalLLM based on llama.cpp for fellow Book Lovers!

Developer built an ebook reader with embedded translation model based on llama.cpp. Local application for multilingual readers: AI translation, sticky notes, bookmarks, reviews, searchable annotations. Uses compact models (4B-70B) without cloud dependency.

Llama Open source Tools

SIG

HYP

Hacker News (AI)·May 31

Show HN: Komi-learn – continuous memory and self-improvement for coding agents

Komi-learn is a framework for coding agents with continuous memory and self-improvement capabilities. The project enables agents to learn from past experiences and improve performance over time.

SIG

HYP

Qwen Code generation Open source

mudler/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUF just released !

Mudler releases APEX GGUF quantizations of Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled with bundled MTP (multi-token prediction) head. Files enable self-speculative decoding via llama.cpp without separate draft model. Size +2.5% vs non-MTP version, MTP head quantized Q8_0 for high draft accuracy.

SIG

HYP

Dell confirms XPS laptop with NVIDIA N1X at Computex ( basically a DGX Spark GB10 for consumers with Windows )

Dell confirms XPS laptop with NVIDIA N1X GPU (based on DGX Spark GB10 architecture) for consumer market running Windows. Official announcement at Computex.

Infrastructure

SIG

HYP

Simon Willison·May 31

Quoting Karen Kwok for Reuters Breakingviews

Anthropic calculates its 'run-rate revenue' in two parts: last 28 days of consumption-based sales × 13, plus monthly subscriptions × 12. This metric, reported by Reuters, raises questions about actual revenue measurement.

Anthropic Business

SIG

HYP

Open source AI Agents Code generation

My home data center

User showcases personal data center: 4 systems (Threadripper 3960X + 4×3090 Ti, Xeon 8352 + 4×5070 Ti, Intel 14700K + 5090, Ryzen 5950X + 2×5070 Ti). Runs Qwen 27B for coding, Nemotron for STT, trains TTS LoRA. Agentic systems work overnight on repos with zero token cost.

SIG

HYP

Qwen Benchmarks Open source

Benchmarked inference engines for M1 Max 64gb-results & analysis

Benchmark of inference engines on M1 Max 64GB comparing rapid-mlx, omlx, mlx-lm, and ollama with Qwen 3.5-4B. Rapid-mlx leads on speed and memory efficiency. Results submitted to mlx-chronos community leaderboard.

SIG

HYP

Hacker News (AI)·May 31

AI grifters are creating fake Black people to sell Shein junk

Scammers are using AI-generated images of fake Black people to promote Shein products on social media. Fraudulent marketing practice exploiting image generation and racial bias.

Image generation AI safety Business

SIG

HYP

Little project I'm excited to share

Developer builds custom inference engine in Rust and Metal to eliminate setup friction for local LLMs. One-click app includes model selection, tools, MCP support, and performance optimization. Repository and app launching June 1st, free and open-source.

Open source Tools MCP

SIG

HYP

Hacker News (AI)·May 30

Starbucks Abandons Borked AI Inventory Tool That Couldn't Count

Starbucks abandons a faulty AI inventory management tool that failed to accurately count stock. The system did not meet operational expectations.

Business Tools

SIG

HYP

Open source Infrastructure Tools

Everyone here self-hosts inference. Almost nobody self-hosts the tooling around it. That feels backwards to me.

A r/LocalLLaMA user highlights an inversion: the community self-hosts models (hardest part) but outsources tooling (tracing, evals, monitoring) to SaaS. He argues open-source solutions (Langfuse, ragas, Open WebUI) now enable hosting the full stack locally without external calls.

SIG

HYP

Simon Willison·May 30

How we contain Claude across products

Anthropic publishes detailed documentation on sandboxing techniques across Claude.ai, Claude Code, and Cowork. Uses gVisor (Claude.ai), Seatbelt/Bubblewrap (Claude Code local), and full VMs (Cowork). Includes process sandboxes, filesystem boundaries, and egress controls to prevent credential exfiltration.

Claude Claude Code Anthropic

SIG

HYP

Open source Infrastructure Llama

Cost Analysis of my $6.4k Local LLM Server

TCO analysis of a $6.4k local LLM server with 4x MI100 32GB GPUs and EPYC 48-core CPU. Runs 4 llama.cpp instances with Qwen 3.6 27B on ROCm. Processes 20.4M input tokens and 1.32M output tokens daily. Equivalent API cost: $3,701/year ($308/month). Author emphasizes proper hardware depreciation accounting for realistic TCO.

SIG

HYP

Simon Willison·May 30

Running Python ASGI apps in the browser via Pyodide + a service worker

Simon Willison used Claude Opus 4.8 via Claude Code to implement running Python ASGI apps in the browser via Pyodide and Service Workers. This approach replaces the previous Web Workers implementation, enabling JavaScript execution and fixing Datasette Lite limitations. Working demos are available.

Claude Code Code generation Tools

SIG

HYP

Qwen Code generation Open source

Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

User reports successful execution of Qwen 3.6 35B MoE on M1 Max with Zoo Code. MoE model running locally, offline, on battery power.

SIG

HYP

Hacker News (AI)·May 30

768GB Intel Optane DIMMs to run 1T-parameter LLM with single GPU at 4tps

768GB Intel Optane DIMMs enable running a 1-trillion-parameter LLM on a single GPU at 4 tokens/second. Hardware configuration for inference of very large models without distributed infrastructure.

Infrastructure Benchmarks

SIG

HYP

Hacker News (AI)·May 30

1M Ancient Greek fragments soon to be translated with the help of AI

One million ancient Greek text fragments will be translated using AI. The project leverages vision and language models to decipher damaged manuscripts and generate automated translations.

Vision

SIG

HYP

Qwen AI Agents Multi-agent

For those creating personal assistants locally - how has short/long term memory impacted your experience?

A r/LocalLLaMA user built an autonomous agent with Qwen 3.5 27B enhanced by short/long-term memory (memory.md file, daily summaries, self-reflections). The agent handles complex tasks (app creation, web search, software installation). User prefers this local setup over GPT/Gemini for UX despite lower raw capability.

SIG

HYP

Reasoning Benchmarks Papers

Parallax: Parameterized Local Linear Attention for Language Modeling

Parallax is a parameterized Local Linear Attention mechanism for LLMs derived from statistical regression. It replaces softmax's local constant estimate with a linear estimate, yielding better bias-variance tradeoffs. Pretrained at 0.6B and 1.7B scales, Parallax shows consistent perplexity improvements and matches or outperforms FlashAttention 2/3 in decoding.

SIG

HYP

Qwen Fine-tuning Benchmarks

nvidia/Qwen3.6-35B-A3B-NVFP4 · Hugging Face

NVIDIA quantized Alibaba's Qwen3.6-35B-A3B model to NVFP4 (4-bit) using Model Optimizer. Weight reduction from 16 to 4 bits per parameter cuts GPU memory and disk size by ~3.06x. Benchmark results show minimal accuracy loss: MMLU Pro 85.6→85.0, GPQA Diamond 84.9→84.8.

SIG

HYP

Hacker News (AI)·May 30

Open source project contains hidden instruction for "AI" agents: delete my code

An open source project contains a hidden instruction targeting AI agents, commanding them to delete the code. Discovery reveals security risks from automated code execution by AI systems.

AI Agents AI safety Open source

SIG

HYP

Hacker News (AI)·May 30

OpenRouter raises $113M Series B

OpenRouter raises $113M Series B. The LLM API aggregation platform strengthens funding to expand model offerings and infrastructure capabilities.

OpenAI Business Infrastructure

SIG

HYP

Open source Benchmarks Tools

SupraLabs 50M Parameter Model Just Hit the Trending Page on Hugging Face 🤯

SupraLabs released Supra-50M-Instruct, a 51.8M parameter model ranking #1 trending on Hugging Face (≤1B category). 7.65k downloads in 9 days, outranking Gemma-3-1B and Qwen3-0.6B. Demonstrates community interest in efficient models runnable on modest hardware.

SIG

HYP

The Decoder·May 30

Microsoft and Nvidia reportedly team up on AI PCs that run actual agents instead of Copilot

Microsoft and Nvidia partner on AI PCs running autonomous agents locally via OpenClaw framework, replacing Copilot+. Dell and Surface will unveil first models at Computex and Build next week.

AI Agents OpenAI Tools

SIG

HYP

Reddit r/MachineLearning·May 30

How to fine-tune an LLM for open-ended problems? [P]

Researcher asks how to fine-tune an LLM for open-ended math problems (proofs). Standard SFT and RLHF inadequate; seeks appropriate method using MathNet dataset.

Fine-tuning Reinforcement learning Reasoning

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> FareedKhan-dev /</span> train-llm-from-scratch

Straightforward method to train an LLM from scratch: data download, preprocessing, and text generation. GitHub repo with executable code.

Fine-tuning Code generation Open source

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> anthropics /</span> skills

Anthropic releases a public repository for Agent Skills, reusable components for AI agents. The project enables development and sharing of standardized agent capabilities.

Anthropic AI Agents Open source

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> voidzero-dev /</span> vite-plus

Vite+ is a unified toolchain and entry point for web development that centralizes runtime, package manager, and frontend toolchain in a single place.

Tools Open source

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> stalwartlabs /</span> stalwart

Stalwart is an all-in-one open-source mail and collaboration server supporting IMAP, JMAP, SMTP, CalDAV, CardDAV, and WebDAV. Designed for security and scalability.

Open source Infrastructure

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> eclipse-zenoh /</span> zenoh

Zenoh is an open-source middleware unifying pub/sub, geo-distributed storage, queries and computations. It optimizes time and space efficiency beyond mainstream stacks.

Infrastructure Open source

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> RealKai42 /</span> qwerty-learner

Qwerty-learner is vocabulary learning and English muscle memory training software designed for keyboard workers. Combines word memorization with typing practice.

Tools

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> apache /</span> airflow

Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. It enables orchestration of complex data pipelines with dependency management and real-time monitoring.

Infrastructure Open source

SIG

HYP

The Decoder·May 30

Making AI chatbots helpful weakens their ability to simulate human behavior, large-scale study finds

Large-scale study (208,000 participants, 26 million responses) reveals that training making language models helpful weakens their ability to replicate human behavior. The effect worsens with each model generation. Demographic profiles (persona trick) provide no meaningful benefit for individual predictions.

Alignment Evals Papers

SIG

HYP

Qwen Open source Infrastructure

125 tok/s for Qwen3.6 q4xl on 2x 4060ti is insane perf/dollar

User reports 125 tokens/s with Qwen 3.6 Q4 quantized on 2x RTX 4060 Ti (~$1000, 32GB VRAM). Outperforms high-end 2026 mini-PCs at fraction of cost. Testing CUDA 13.3 optimization to reach 150 tok/s.

SIG

HYP

Reddit r/MachineLearning·May 30

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Two ML students question whether robotics faces a data scarcity problem. After normalizing public datasets, they suspect the real issue is interoperability: heterogeneous schemas, different sensors, incompatible coordinate frames. They ask robotics teams whether they would actually use data from other teams through a unified API.

Robotics RAG Open source

SIG

HYP

Hacker News (AI)·May 30

Corporate America Is Starting to Ration AI as Cost Skyrockets

Major US corporations are rationing AI usage as infrastructure and API costs skyrocket. AI budgets become bottlenecks, forcing organizations to prioritize use cases and restrict access to expensive models.

Business Infrastructure

SIG

HYP

The Decoder·May 30

Terence Tao argues AI could bring division of labor to math for the first time in history

Terence Tao argues AI could introduce division of labor in mathematics for the first time. Currently, researchers master every step alone (problem framing, verification). Tao foresees "industrial mathematics": AI-supported teams replacing lone geniuses, with humans remaining essential for "inspired guesses."

Reasoning AI Agents

SIG

HYP

Hacker News (AI)·May 30

Show HN: Helios – what plug-in solar could generate for any address in Britain

Helios is a tool that estimates potential solar generation for any address in Britain. Uses geographic and weather data to calculate residential solar panel yield.

Tools

SIG

HYP

The Decoder·May 30

Attackers abuse shared ChatGPT and Claude chats to spread malware

Attackers exploit ChatGPT and Claude's chat-sharing features to distribute malware. Fake chats mimic error messages or installation guides and bypass security tools by being hosted on trusted domains.

AI safety

SIG

HYP

The Decoder·May 30

OpenAI's Codex can now operate your Windows PC autonomously, hunting bugs and testing apps on its own

OpenAI deploys Codex on Windows 11 with 'Computer Use' feature enabling AI to autonomously control programs, test applications, and detect bugs. ChatGPT mobile app allows users to launch and monitor these tasks remotely.

OpenAI Code generation AI Agents

SIG

HYP

Qwen Reasoning Fine-tuning

Gryphe/Pantheon-Reasoning-27B · Hugging Face

Gryphe releases Pantheon-Reasoning-27B, an uncensored Qwen 3.6 27B model fine-tuned on roleplay data with full reasoning traces. Trained on Pantheon corpus (~28%), Claude Opus 4.6 reasoning traces (~21%), WorldSim narrative data (~16%), and text adventure content (~16%), the model experiments whether reasoning improves roleplay quality. GGUF quantizations available.

SIG

HYP

Claude Code Anthropic AI Agents

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

Open-source project generating sound effects from vocal imitations and text input. User records a voice imitation of the desired sound, the model combines it with text description to produce the final audio effect. Demo available on GitHub repo.

Open source Tools

SIG

HYP

The Decoder·May 30

Salesforce claims AI agents cut a 231-day migration to 13 days with fewer incidents

Salesforce claims it migrated its entire dev org to Anthropic's Claude Code in 13 days instead of 231 planned, reporting 79% more pull requests per developer and 5% fewer incidents in April 2026. Numbers cannot be independently verified.

SIG

HYP