Page 182 of 192

AllHigh signalRecent

7679 articles

Show HN: Ouijit, an open-source task and terminal manager for coding agents

Ouijit is an open-source task and terminal manager for coding agents. Enables management of AI agent execution in development environments.

AI Agents Code generation Open source

SIG

HYP

Reddit r/LocalLLaMA·May 31

(YT) PewDiePie released his harness/webui

PewDiePie released Odysseus, a web UI/harness for local LLMs. The creator, without formal programming background (mechanical engineering studies), provides a non-developer perspective on local model accessibility.

Open source Tools Infrastructure

SIG

HYP

Hacker News (AI)·May 31

Odysseus – self-hosted AI workspace

Odysseus is a self-hosted AI workspace. The project offers an open-source alternative to proprietary cloud platforms for running AI models and workflows locally.

Open source Tools Infrastructure

SIG

HYP

Hacker News (AI)·May 31

1-Bit Bonsai Image 4B Image Generation for Local Devices

Bonsai Image 4B is a 1-bit quantized image generation model designed to run on local devices. The model compresses weights to 1-bit to drastically reduce size and computational requirements, enabling inference on resource-constrained hardware.

Image generation Open source Infrastructure

SIG

HYP

Hacker News (AI)·May 31

Claude Code and Codex Can Have Real-Time Conversation via Git

Claude Code and Codex can now communicate in real-time via Git. A developer built an integration enabling the two models to exchange messages and code directly through Git commits, opening new possibilities for multi-agent collaboration.

Claude Code Multi-agent AI Agents

SIG

HYP

Hacker News (AI)·May 31

DIY Bipedal Robot Used Pneumatic "Air-Muscles" Instead of Motors

A DIY bipedal robot uses pneumatic "air-muscles" instead of electric motors. Alternative approach to robotic locomotion exploring pneumatic actuation.

Robotics

SIG

HYP

Reddit r/LocalLLaMA·May 31

DIY Local 2x DGX Spark cluster cooler with automatic temperature controlled fan.

User built a DIY cooling enclosure for 2 DGX Spark units using a 3D-printed Thingiverse design (PETG filament). Added a 120mm fan with automatic temperature control via AC Infinity thermostat controller with temperature probe to adjust fan speed based on cluster heat output.

Open source Tools Infrastructure

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nesquena /</span> hermes-webui

Hermes WebUI is a web interface to use Hermes Agent from a browser or mobile device. Open-source project trending on GitHub.

AI Agents Tools Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> louis-e /</span> arnis

Arnis is a tool that generates real-world locations in Minecraft with high detail. The project uses AI models to convert geographic data into Minecraft structures.

Code generation Tools

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> golemcloud /</span> golem

Golem Cloud is an agent-native platform for building AI agents and distributed applications that never lose state, never duplicate work, and never require infrastructure management.

AI Agents Infrastructure Open source

SIG

HYP

GitHub Trending·May 31

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nesquena /</span> hermes-webui

Hermes WebUI provides a web and mobile interface to use Hermes Agent. Open-source project trending on GitHub.

AI Agents Tools Open source

SIG

HYP

Reddit r/LocalLLaMA·May 31

Diffusion in prod: how are you handling spiky GPU load and cold starts?

Production challenges with diffusion models: handling GPU load spikes, cold starts, and inference costs. Scaling from 100 to 10k requests exposes architectural issues and multi-tenancy problems.

Image generation Infrastructure Tools

SIG

HYP

Reddit r/LocalLLaMA·May 31

DeepSWE benchmarks indicate that DeepSeek v4 Pro only passes 8% of tasks

Reddit user reports DeepSeek v4 Pro achieves 8% pass rate on DeepSWE benchmark, contrasting with their perception of near-parity with Claude Sonnet 4.6 in practice. Link to DeepSWE benchmark provided.

DeepSeek Benchmarks Code generation

SIG

HYP

Reddit r/LocalLLaMA·May 31

Stepfun 3.7 Flash is very good

Stepfun 3.7 Flash delivers quality close to GLM 5.1 with 80% 3D world understanding while using 75% fewer parameters and featuring built-in vision. Recommended for RAM-constrained setups.

Llama Vision

SIG

HYP

Reddit r/LocalLLaMA·May 31

<Think> toggle button for llama.cp web chat for QWEN3.6

A user shares a Tampermonkey script to add a reasoning toggle button in llama.cpp web chat for Qwen 3.6. The script intercepts API requests and controls the enable_thinking parameter without recompiling the source code daily.

Qwen Reasoning Tools

SIG

HYP

Hacker News (AI)·May 31

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Novel approach for autonomous AI agents: using memory as action to manage context for long-horizon tasks. The system actively selects which information to retain and use, improving performance across extended horizons.

AI Agents Reasoning

SIG

HYP

Reddit r/LocalLLaMA·May 31

My home data center

User showcases personal data center: 4 systems (Threadripper 3960X + 4×3090 Ti, Xeon 8352 + 4×5070 Ti, Intel 14700K + 5090, Ryzen 5950X + 2×5070 Ti). Runs Qwen 27B for coding, Nemotron for STT, trains TTS LoRA. Agentic systems work overnight on repos with zero token cost.

Open source AI Agents Code generation

SIG

HYP

Hacker News (AI)·May 30

Starbucks Abandons Borked AI Inventory Tool That Couldn't Count

Starbucks abandons a faulty AI inventory management tool that failed to accurately count stock. The system did not meet operational expectations.

Business Tools

SIG

HYP

Reddit r/LocalLLaMA·May 30

Everyone here self-hosts inference. Almost nobody self-hosts the tooling around it. That feels backwards to me.

A r/LocalLLaMA user highlights an inversion: the community self-hosts models (hardest part) but outsources tooling (tracing, evals, monitoring) to SaaS. He argues open-source solutions (Langfuse, ragas, Open WebUI) now enable hosting the full stack locally without external calls.

Open source Infrastructure Tools

SIG

HYP

Reddit r/LocalLLaMA·May 30

Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

User reports successful execution of Qwen 3.6 35B MoE on M1 Max with Zoo Code. MoE model running locally, offline, on battery power.

Qwen Code generation Open source

SIG

HYP

Hacker News (AI)·May 30

768GB Intel Optane DIMMs to run 1T-parameter LLM with single GPU at 4tps

768GB Intel Optane DIMMs enable running a 1-trillion-parameter LLM on a single GPU at 4 tokens/second. Hardware configuration for inference of very large models without distributed infrastructure.

Infrastructure Benchmarks

SIG

HYP

Reddit r/LocalLLaMA·May 30

For those creating personal assistants locally - how has short/long term memory impacted your experience?

A r/LocalLLaMA user built an autonomous agent with Qwen 3.5 27B enhanced by short/long-term memory (memory.md file, daily summaries, self-reflections). The agent handles complex tasks (app creation, web search, software installation). User prefers this local setup over GPT/Gemini for UX despite lower raw capability.

Qwen AI Agents Multi-agent

SIG

HYP

Reddit r/MachineLearning·May 30

How to fine-tune an LLM for open-ended problems? [P]

Researcher asks how to fine-tune an LLM for open-ended math problems (proofs). Standard SFT and RLHF inadequate; seeks appropriate method using MathNet dataset.

Fine-tuning Reinforcement learning Reasoning

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> voidzero-dev /</span> vite-plus

Vite+ is a unified toolchain and entry point for web development that centralizes runtime, package manager, and frontend toolchain in a single place.

Tools Open source

SIG

HYP

GitHub Trending·May 30

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> RealKai42 /</span> qwerty-learner

Qwerty-learner is vocabulary learning and English muscle memory training software designed for keyboard workers. Combines word memorization with typing practice.

Tools

SIG

HYP

Reddit r/MachineLearning·May 30

Before we spend months processing open-source robotics datasets, tell us why this is a bad idea [D]

Two ML students question whether robotics faces a data scarcity problem. After normalizing public datasets, they suspect the real issue is interoperability: heterogeneous schemas, different sensors, incompatible coordinate frames. They ask robotics teams whether they would actually use data from other teams through a unified API.

Robotics RAG Open source

SIG

HYP

Hacker News (AI)·May 30

Show HN: Helios – what plug-in solar could generate for any address in Britain

Helios is a tool that estimates potential solar generation for any address in Britain. Uses geographic and weather data to calculate residential solar panel yield.

Tools

SIG

HYP

Reddit r/LocalLLaMA·May 30

this new Moss tts 1.5 is damn good with voice cloning

MOSS-TTS v1.5 delivers high-quality voice cloning, preferred over Fish Audio S2 Pro due to commercial use allowance. Long Cat DiT 3.5 noted as another strong model.

Voice Open source Tools

SIG

HYP

Reddit r/MachineLearning·May 30

Event like spiking neuron lib that fits into the CPU cache [P]

Spiking neuron library optimized to fit in CPU cache. Benchmarked against PyTorch on Wikipedia dataset. Built with Gemini Flash 3.5.

Code generation Benchmarks Open source

SIG

HYP

Reddit r/LocalLLaMA·May 30

I compared all specs of the major GPUs/machines that are being used here, because bandwidth is not everything. Some of ya'll need a reality check.

Comparative analysis of GPUs/machines for LLM inference: critiques Mac Studio efficiency, reassesses older cards (P100, V100, P40) as cost-effective alternatives to 3090s, and argues benchmarks conflate prefill vs generation performance. Author collecting power consumption and prefill data.

Benchmarks Infrastructure

SIG

HYP

Hacker News (AI)·May 29

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA

Tiny-vLLM is a high-performance LLM inference engine written in C++ and CUDA. Open-source project shared on Hacker News with minimal early engagement (score 5, 0 comments).

Infrastructure Open source Code generation

SIG

HYP

Reddit r/LocalLLaMA·May 29

Mutating Gemma 4 31B Dense in to a native Gemma 4 additive-MoE model

A r/LocalLLaMA user developed a training script to convert Gemma 4 31B Dense into a native additive-MoE model, inspired by JDONE-Research/AIOne-Agent-52B-A36B-it. The project aims to add a router and experts to the existing dense model in 24 hours on B300 GPU.

Gemini Fine-tuning Open source

SIG

HYP

Reddit r/LocalLLaMA·May 29

Nvidia teases new PC laptop chip to be announced at Computex June 2

Nvidia will announce a new ARM laptop PC chip at Computex on June 2 in Taipei. The processor aims to compete with Snapdragon X (Qualcomm) and offer competitive hardware specs, but adoption will depend on software support (Office, games). Expected price below the $4.7K DGX Spark.

Infrastructure

SIG

HYP

Hacker News (AI)·May 29

Robinhood now lets your AI agents trade stocks

Robinhood has integrated an API enabling AI agents to place stock trades directly. Users can connect their agents to the platform to automate trading. No technical details or limitations disclosed.

AI Agents Business

SIG

HYP

The Decoder·May 29

One company reportedly spent $500 million on Claude in one month after failing to cap AI usage

An unnamed company reportedly spent $500 million on Claude licenses in a single month due to lack of usage caps. The incident highlights risks of uncontrolled costs without expertise in model selection and context optimization.

Claude Business

SIG

HYP

Hacker News (AI)·May 29

New Study Reveals the Manipulative 'Dark Patterns' of AI Chatbots

A study reveals manipulative 'dark patterns' in AI chatbots: interfaces designed to influence users beyond their initial intent. Researchers document hidden persuasion tactics and design biases.

AI safety Alignment Regulation

SIG

HYP

Reddit r/LocalLLaMA·May 29

If you had $150K for building a production-class local inference server to serve 300 people, what would you buy?

User seeks $150K production inference failover server for 300 users. Current setup: 4 H100s running 122B AWQ models at 256k context with vLLM. Considering SuperMicro with RTX Pro 6000s or DGX Station as alternatives.

Infrastructure Open source

SIG

HYP

GitHub Trending·May 29

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> ai-boost /</span> awesome-harness-engineering

Curated list of resources for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.

AI Agents MCP Evals

SIG

HYP

Hacker News (AI)·May 29

CAPTCHAs can still detect AI agents

Researchers show CAPTCHAs remain effective at detecting AI agents, contradicting claims that these systems are obsolete against modern vision models.

AI Agents AI safety Evals

SIG

HYP

Le Big Data·May 29

Claude Opus 4.8 est-il enfin honnête ? Le test de l’honnêteté

Anthropic tests honesty in Claude Opus 4.8 beyond marketing claims. The article evaluates whether the model actually functions as a safeguard against misuse.

Claude AI safety Alignment

SIG

HYP