Page 175 of 192

AllHigh signalRecent

7679 articles

Show HN: Can Europe train a frontier AI model on the compute it owns?

A project investigates whether Europe can train a frontier AI model using only its own compute resources. Open question about European technological autonomy versus US AI giants.

Open source Infrastructure Regulation

SIG

HYP

Le Big Data·5d ago

Pemba, le premier robot humanoïde qui veut gravir le mont Everest

Pemba, a humanoid robot, trains to climb Mount Everest after successfully ascending Chimborazo in snowy conditions. The project tests autonomous locomotion and navigation capabilities in extreme environments.

Robotics

SIG

HYP

GitHub Trending·6d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> mikeroyal /</span> Self-Hosting-Guide

Comprehensive self-hosting guide covering on-premises software deployment, private cloud, LLMs, WireGuard, automation, Home Assistant, and networking infrastructure.

Open source Infrastructure Tools

SIG

HYP

GitHub Trending·6d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> amruthpillai /</span> reactive-resume

Reactive Resume is an open-source, free resume builder prioritizing privacy and security. The tool offers customization, portability, and data ownership for users.

Open source Tools

SIG

HYP

Le Big Data·6d ago

Ce fou furieux tente de recréer GTA 6 de A à Z… uniquement avec une IA

A developer attempts to recreate GTA 6 entirely using AI, in parallel with the official release scheduled for November. The project leverages AI models to generate code, graphics assets, and game design.

Code generation Image generation Tools

SIG

HYP

Reddit r/LocalLLaMA·6d ago

Lower generation speed with H100 and H200 than with RTX 5090?

User reports slower generation on H100 (42 tok/sec) than RTX 5090 (57 tok/sec) using llama.cpp with 31B Q6 model. H100 provides larger context (128k vs 26k) and higher bandwidth, yet generates slower.

Infrastructure Benchmarks

SIG

HYP

Le Big Data·6d ago

Le FBI s’est construit sa propre petite ville… juste pour se faire hacker

The FBI built Kinetic Cyber Range, a training facility designed as a simulated city for cyberattack drills and agent preparation against cyber threats.

AI safety

SIG

HYP

Reddit r/LocalLLaMA·6d ago

This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b

Qwen 27B achieves doubled generation speed and reduced VRAM usage (21 GB → 17.5 GB) on identical hardware while maintaining full context accuracy.

Qwen Open source Infrastructure

SIG

HYP

Reddit r/LocalLLaMA·6d ago

UI/svg block rendering by ServeurpersoCom · Pull Request #24080 · ggml-org/llama.cpp

Pull request #24080 on llama.cpp adds UI/SVG block rendering. Video demonstration shows SVG rendering capabilities integrated into the project.

Llama Open source Tools

SIG

HYP

Hacker News (AI)·6d ago

Show HN: AwsmAudio – a WebAudio editor with native MCP

AwsmAudio is a WebAudio editor with native MCP protocol integration. Project showcased on Hacker News with minimal engagement (3 points, 0 comments).

MCP Tools Open source

SIG

HYP

Reddit r/LocalLLaMA·6d ago

I made a private on-device LLM app for Android (notes + recall, nothing leaves the phone)

Developer releases Android app running LLM fully on-device for note-taking and AI-powered recall. All data stays on phone, no cloud. Seeking beta testers (8GB+ RAM recommended), free, in Google Play closed testing.

Open source Tools RAG

SIG

HYP

Le Big Data·6d ago

AMD : ce mini PC fait tourner des IA géantes… sans cloud ni abonnement

AMD introduces a mini PC capable of running large AI models locally without cloud dependency or subscriptions. The device provides an alternative to traditional cloud services for AI inference.

Infrastructure

SIG

HYP

Hacker News (AI)·6d ago

The Jqwik Anti-AI Affair

Jqwik, a Java testing library, rejected contributions generated by AI. The maintainer published a policy banning AI-generated PRs, sparking debate over code quality and attribution.

Code generation Open source

SIG

HYP

Hacker News (AI)·6d ago

AI is code – and can't be prompted into being smarter

An article arguing that AI is fundamentally code and cannot be made smarter through prompting alone. Challenges the notion that better instructions can overcome the architectural limitations of models.

Prompt engineering Reasoning

SIG

HYP

Reddit r/LocalLLaMA·6d ago

How are you handling memory provenance in persistent agents — verified vs. inferred facts?

Developer highlights the challenge of distinguishing verified facts from inferences in persistent agent memory. Old inferences get promoted to facts over sessions, breaking auditability. He manually implements provenance tagging (verified/inferred/speculative) and asks whether existing solutions (Zep, Mem0, Cognee) address this epistemic layer problem.

AI Agents RAG

SIG

HYP

Reddit r/LocalLLaMA·6d ago

Strange numbers of pp and tg rx7900xtx on ROCm and Vulcan with Qwen3.6-27b nonMTP and MTP

User reports unsatisfactory performance running Qwen 3.6-27B on RX 7900 XTX via ROCm and Vulkan with llama.cpp. Prompt processing: 235–634 tok/s depending on backend, generation: 13–31 tok/s. MTP (speculative decoding) n=3 drops generation to 17 tok/s despite 78% acceptance rate.

Qwen Open source Benchmarks

SIG

HYP

Reddit r/LocalLLaMA·6d ago

Introducing the Heretic Grimoire: The takedown-resilient, local-first backup system that keeps uncensored models available forever

Heretic announces a decentralized backup system for uncensored local models. Models compressed to 9 KB enable phone storage. The project builds takedown-resilient infrastructure with official website and redundant documentation.

Open source Llama AI safety

SIG

HYP

Reddit r/LocalLLaMA·6d ago

Built a local AI assistant because I always knew this day would come, yesterday just made it feel very real

Developer builds Bantz, a fully local AI personal assistant with 1920s butler persona running on Gemma 4B. Features Gmail summarization, Google Calendar integration, web search, system monitoring, and Wayland desktop control. CPU-only execution. Motivated by risks of relying on third-party infrastructure (references Anthropic shutdown).

Open source AI Agents Tools

SIG

HYP

Hacker News (AI)·Jun 14

Reinventing Control Theory One Feature at a Time: The Fallacy of Agentic Loops

Critique of agentic loops approach in AI. The article questions the iterative design of autonomous agents, arguing that incrementally adding features does not solve fundamental control and stability issues.

AI Agents Reasoning AI safety

SIG

HYP

GitHub Trending·Jun 14

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> GorvGoyl /</span> Clone-Wars

Repository of 100+ open-source clones of popular sites (Airbnb, Amazon, Instagram, Netflix, TikTok, Spotify, WhatsApp, YouTube). Includes source code, demos, tech stack, and GitHub stars.

Open source Tools

SIG

HYP

GitHub Trending·Jun 14

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nrwl /</span> nx

Nx is a monorepo platform optimizing builds and CI scaling, with AI agent capabilities to automatically fix failing PRs. Reduces deployment time by half.

AI Agents Tools Infrastructure

SIG

HYP

Reddit r/LocalLLaMA·Jun 14

Local models in mid-2026

Open-weight models achieve viable local execution in 2026 through sparse attention, MoE, latent KV compression, multi-token prediction, and 4-bit quantization, without requiring more RAM.

Open source Infrastructure Code generation

SIG

HYP

The Decoder·Jun 14

Amazon and five other companies reportedly triggered the government crackdown on Anthropic's Fable model

Amazon and five other companies reportedly alerted the Trump administration to security vulnerabilities in Anthropic's Fable model. The White House ordered the model offline via export control within hours, despite Amazon being one of Anthropic's largest investors.

Anthropic Regulation AI safety

SIG

HYP

Reddit r/LocalLLaMA·Jun 14

Codebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model?

Developer using Qwen3.6-27B via llama.cpp encounters recurring bugs in Python codebase despite 128K context window. Testing strategies: full project reads vs focused function analysis, KV quantization disabled. Seeking approaches to minimize model errors.

Qwen Code generation Prompt engineering

SIG

HYP

Hacker News (AI)·Jun 14

WhatsApp Claims It Thwarted an NSO Spyware Campaign

WhatsApp claims to have thwarted an NSO spyware campaign. The messaging app detected and blocked an exploitation attempt targeting its users.

AI safety

SIG

HYP

Reddit r/LocalLLaMA·Jun 14

I need a model that gets stuck in loops.

Developer seeks LLM model that loops frequently to test loop detection and recovery mechanisms in an agent. GLM Flash at low temperature with extreme quantization identified as problematic. Goal: build scoring framework to detect loops and enable system recovery through backtracking and reprompting.

AI Agents Evals

SIG

HYP

Hacker News (AI)·Jun 13

Ancient genome duplications laid the foundations of complex brains

Study reveals ancient genome duplications laid foundations for complex brains. Researchers identified how duplication events enabled evolution of sophisticated brain structures in vertebrates.

Papers

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

WIP EAGLE3 for Qwens

Work-in-progress implementation of EAGLE3 inference acceleration adapted for Qwen models. Software modification enabling faster token generation with Qwen-based systems.

Qwen Code generation Open source

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

Snapcompact: Saving Tokens With Images

Snapcompact is an image compression technique that reduces token consumption in vision models. It optimizes visual representation without significant quality loss, enabling faster and cheaper inference.

Vision Code generation Open source

SIG

HYP

Reddit r/MachineLearning·Jun 13

I’m building a free bilingual machine-learning notebook course — looking for feedback on structure and coverage [R]

Developer building open-source ML course in Jupyter Notebooks, bilingual (English/Persian). Covers fundamentals, preprocessing, regression, classification, trees, clustering, time series, MLOps. Seeking feedback on chapter order, missing classical ML topics, and bilingual notebook utility for non-native learners.

Tools Open source Fine-tuning

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

DeepSeek v4 Pro is too big for such a "midrange" performance, or am I missing something?

User questions DeepSeek v4 Pro's (1.6T parameters) relevance given mediocre performance versus smaller models: GLM 5.2 (750B), Kimi K2.7 (1T), MiniMax M3 (450B), and MiMo v2.5 Pro (1T) outperform it on benchmarks. Questions whether the model's value lies primarily in Huawei-based inference infrastructure rather than model quality.

DeepSeek Benchmarks Open source

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

RTX 5080 + RTX 3090 Setup: 80+ Tok/s on Qwen 3.6 27B Q8

User reports 80+ tokens/s with Qwen 3.6 27B Q8 quantization on dual GPU setup (RTX 5080 + RTX 3090). Performance measured on local hardware without framework or test condition details.

Qwen Open source Infrastructure

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

I don’t know who needs to hear this but 128GB BD-R XL M-DISC is SOTA for consumer-available archival optical storage (for backing up your models)

Practical guide for archiving local LLMs: BD-R XL M-DISC 128GB Blu-Ray disks ($12-14 each) offer best consumer archival durability (10 human lifespans). Compatible burners: ASUS 16D1X-U (~$100-250), LG/Buffalo alternatives from $80. M-DISC preferred over volatile USB storage.

Open source

SIG

HYP

The Decoder·Jun 13

Microsoft CEO Satya Nadella admits he's a token-maxer, too: "It's addictive"

Satya Nadella (Microsoft) warns against "token-maxing"—throwing frontier AI models at every problem. Marginal productivity gains must justify token costs. Paradoxically, he admits being a "token-maxer" himself and that it's "addictive."

OpenAI Business

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

32 bit crossplatform coding agent running on pentium m with less than a second startup time

Prism, a 32-bit cross-platform coding agent, starts in under one second on Pentium M with <1% CPU usage. Supports sub-agents, local/cloud models, plugins. 500 KB, compatible with 386+. GitHub release planned.

AI Agents Code generation Open source

SIG

HYP

Hacker News (AI)·Jun 13

AI OSS tool repo goes archived over night after raising $7.3M Seed

An AI open-source tool repository was archived overnight after the project raised $7.3M in seed funding. Exact reasons for the archival remain unclear from available details.

Open source Funding Business

SIG

HYP

GitHub Trending·Jun 13

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> lobehub /</span> lobehub

LobeHub organizes AI agents into 24/7 operations through hiring, scheduling, and reporting. Platform for managing autonomous agent teams.

AI Agents Multi-agent Tools

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

Some thoughts on decentralized model sharing: What models should we share, and how?

Discussion on decentralized distribution of open-source LLM models. Author proposes prioritizing sharing of unquantized base models (fp16/bf16) over derived variants, arguing base models are essential primary data to preserve against growing restrictions from closed model providers.

Open source Llama Infrastructure

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

GLM-5.2 next week, open weight, MIT

Zhipu AI announces GLM-5.2 next week as open weights under MIT license. No technical details provided in the Reddit post.

Open source Qwen

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

A friendly reminder that APIs are rented, local weights are forever

Anthropic disabled Fable 5 globally due to sudden US export ban, unable to instantly verify cloud users' nationality. Post argues cloud APIs are rented and revocable for compliance reasons, while local weights provide guaranteed control and independence.

Anthropic Open source Regulation

SIG

HYP

Page 175 of 192

Show HN: Can Europe train a frontier AI model on the compute it owns?

Pemba, le premier robot humanoïde qui veut gravir le mont Everest

Ce fou furieux tente de recréer GTA 6 de A à Z… uniquement avec une IA

*Lower* generation speed with H100 and H200 than with RTX 5090?

Le FBI s’est construit sa propre petite ville… juste pour se faire hacker

This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b

UI/svg block rendering by ServeurpersoCom · Pull Request #24080 · ggml-org/llama.cpp

Show HN: AwsmAudio – a WebAudio editor with native MCP

I made a private on-device LLM app for Android (notes + recall, nothing leaves the phone)

AMD : ce mini PC fait tourner des IA géantes… sans cloud ni abonnement

The Jqwik Anti-AI Affair

AI is code – and can't be prompted into being smarter

How are you handling memory provenance in persistent agents — verified vs. inferred facts?

Strange numbers of pp and tg rx7900xtx on ROCm and Vulcan with Qwen3.6-27b nonMTP and MTP

Introducing the Heretic Grimoire: The takedown-resilient, local-first backup system that keeps uncensored models available forever

Built a local AI assistant because I always knew this day would come, yesterday just made it feel very real

Reinventing Control Theory One Feature at a Time: The Fallacy of Agentic Loops

Local models in mid-2026

Amazon and five other companies reportedly triggered the government crackdown on Anthropic's Fable model

Codebase getting larger - Qwen3.6-27B starting to compound issues - how to work smartly with this model?

WhatsApp Claims It Thwarted an NSO Spyware Campaign

I need a model that gets stuck in loops.

Ancient genome duplications laid the foundations of complex brains

WIP EAGLE3 for Qwens

Snapcompact: Saving Tokens With Images

I’m building a free bilingual machine-learning notebook course — looking for feedback on structure and coverage [R]

DeepSeek v4 Pro is too big for such a "midrange" performance, or am I missing something?

RTX 5080 + RTX 3090 Setup: 80+ Tok/s on Qwen 3.6 27B Q8

I don’t know who needs to hear this but 128GB BD-R XL M-DISC is SOTA for consumer-available archival optical storage (for backing up your models)

Microsoft CEO Satya Nadella admits he's a token-maxer, too: "It's addictive"

32 bit crossplatform coding agent running on pentium m with less than a second startup time

AI OSS tool repo goes archived over night after raising $7.3M Seed

Some thoughts on decentralized model sharing: What models should we share, and how?

GLM-5.2 next week, open weight, MIT

A friendly reminder that APIs are rented, local weights are forever

Lower generation speed with H100 and H200 than with RTX 5090?