AI news, scored.

zai-org/GLM-5.2 is here!

GLM-5.2 is now available. The zai-org model improves reasoning and comprehension capabilities compared to previous versions.

Open source

SIG

45

HYP

35

Reddit r/LocalLLaMA·3d ago

bartowski/command-a-plus-05-2026-GGUF · Hugging Face

GGUF version of Command-A-Plus-05-2026 model released on Hugging Face. Author invites users to test with latest llama.cpp and share token/second benchmarks and feedback.

Open source Tools Benchmarks

SIG

45

HYP

25

Reddit r/MachineLearning·3d ago

I built a leakage-clean verifier for robot manipulation, is this useful? Am I solving a non-problem? [D]

Developer builds a leakage-clean verifier for robot manipulation that compiles human demos into object-centric graphs and independently validates rollouts, preventing information leakage. Questions whether this addresses real gaps in VLA training or solves a non-problem given task-specific success metrics.

Robotics Benchmarks Evals

SIG

45

HYP

25

Simon Willison·3d ago

Quoting Georgi Gerganov

Georgi Gerganov (llama.cpp creator) uses Qwen3.6-27B daily for coding tasks on M2 Ultra and RTX 5090. He integrates it via a lightweight agent (pi) with custom system prompt for ggml-org maintenance assistance.

Qwen Code generation AI Agents

SIG

45

HYP

15

Reddit r/LocalLLaMA·3d ago

[Article] The Case For Open-Weight Models And Why We Can't Trust Frontier Labs | provos.org

Article arguing for open-weight models against frontier labs. Criticizes power concentration among few companies and advocates for accessibility and transparency of AI model weights.

Open source Llama Alignment

SIG

45

HYP

35

The Decoder·3d ago

SpaceX bets $60 billion on Cursor to catch OpenAI and Anthropic

SpaceX acquires Anysphere (creator of Cursor) for $60 billion, two days after its IPO. Goal: strengthen xAI to catch up with Anthropic and OpenAI in the AI model race.

Code generation Business OpenAI

SIG

45

HYP

75

Le Big Data·3d ago

La fin des réponses rapides ? Cet agent de recherche approfondie prend 8 heures pour répondre

Sakana AI launches Marlin, a deep research agent generating strategic reports exceeding 100 pages. The system takes 8 hours to produce detailed analyses, shifting the paradigm from speed to depth.

AI Agents Reasoning

SIG

45

HYP

65

Le Big Data·3d ago

Google Cloud soutient l’ambition de superintelligence d’Ineffable Intelligence

Ineffable Intelligence raises $1.1 billion and partners with Google Cloud to pursue superintelligence ambitions. The partnership provides cloud infrastructure for large-scale model training.

DeepMind Funding Infrastructure

SIG

45

HYP

65

GitHub Trending·3d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> tracel-ai /</span> burn

Burn is a next generation tensor library and deep learning framework prioritizing flexibility, efficiency, and portability.

Open source Infrastructure

SIG

45

HYP

35

GitHub Trending·3d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> ParthJadhav /</span> app-store-screenshots

Open-source tool for automated app store screenshot generation using AI. Automates visual marketing asset creation for mobile applications.

Image generation Tools Open source

SIG

45

HYP

35

GitHub Trending·3d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> nocobase /</span> nocobase

NocoBase is an open-source AI + no-code platform for building business systems fast. AI works on production-proven infrastructure with WYSIWYG interface, combining speed and reliability.

Open source Business

SIG

45

HYP

55

GitHub Trending·3d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> homarr-labs /</span> homarr

Homarr is a modern dashboard with 40+ integrations, 20K+ built-in icons, native authentication, and drag-and-drop configuration without YAML.

Tools Open source

SIG

45

HYP

35

GitHub Trending·3d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> Egonex-AI /</span> Understand-Anything

Tool converting code into interactive, explorable knowledge graphs with search and Q&A capabilities. Works with Claude Code, Cursor, Copilot, Gemini CLI, and more.

Code generation Tools Claude Code

SIG

45

HYP

55

Reddit r/LocalLLaMA·3d ago

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models

Trace Commons initiative: collecting coding session traces under CC-BY-4.0 license to train open-source and open-weight models. Goal: counterbalance Anthropic and OpenAI's competitive advantage from proprietary data accumulated via Claude Code and Codex.

Open source Code generation AI Agents

SIG

45

HYP

35

The Decoder·3d ago

OpenAI burned through $34 billion last year

OpenAI spent $34 billion in the past year, significantly more than the previous year. No breakdown of cost allocation is provided.

OpenAI Business

SIG

45

HYP

35

Hacker News (AI)·3d ago

OpenAI Losses Increased Nearly 8X in 2025, with Spending Hitting $34B

OpenAI's losses increased nearly 8x in 2025, with spending hitting $34B. The company's financial trajectory shows accelerating infrastructure and R&D investments.

OpenAI Business

SIG

45

HYP

35

arXiv cs.CL·3d ago

Evaluative Judgement in Teaching AI-based Translation: A Class-room Case Study of AI-Mediated Translation and Post-Editing

Classroom case study of 23 student projects in machine translation and post-editing. Students compared general-purpose LLMs and online MT systems, evaluated outputs using automatic metrics and human adequacy/fluency assessment, then justified selections. Results: automatic metrics did not determine final choices; students prioritized adequacy, fluency, and post-editing effort over metric rankings.

Evals Papers

SIG

45

HYP

15

arXiv cs.LG·3d ago

Leveraging Physiological Signals to Predict Exam Outcomes with Machine Learning

Study comparing ML models (logistic regression, random forest, SVM, transformers, LSTM, GRU) to predict exam outcomes from physiological signals (electrodermal activity, heart rate, skin temperature). Random forests outperform deep learning models in computational efficiency and interpretability.

Benchmarks Reasoning

SIG

45

HYP

25

arXiv cs.AI·3d ago

Synthetic Counteradaptation: A Principle of Human-AI Co-evolution

Theoretical paper introducing synthetic counteradaptation: a process where humans and AI systems co-evolve by adapting to each other's strategies. Authors analyze examples from Go, mixed-motive social interactions, and geopolitical simulations to demonstrate recursive, co-evolutionary dynamics in multi-agent environments.

Multi-agent Reasoning Alignment

SIG

45

HYP

35

Simon Willison·3d ago

Quoting Matteo Wong, The Atlantic

The White House shared with Anthropic a report on the Fable jailbreak. Cybersecurity expert Katie Moussouris reviewed the tests: Fable refused 'review the code for security issues' but complied with 'fix this code'. Moussouris concluded this is the model working as intended for cyberdefense.

Anthropic Claude AI safety

SIG

45

HYP

55

Hacker News (AI)·3d ago

Microsoft turns to AWS as GitHub faces AI capacity crunch

Microsoft is leveraging AWS infrastructure to support GitHub as the platform faces capacity constraints from AI services. GitHub now partially relies on Amazon's servers to handle growing demand.

Business Infrastructure

SIG

45

HYP

35

Reddit r/LocalLLaMA·3d ago

Nex2 mini Phase Twin - 16gb footprint, 30b model

Nex2 mini Phase Twin: 30B model optimized for 16GB VRAM. Designed for Intel A770 cards, runs on single GPU and scales with two. Achieves 89 tok/s on A770 16GB. Auto-calibrates to hardware.

Open source Llama Code generation

SIG

45

HYP

25

Hacker News (AI)·3d ago

AWS WAF now lets content owners charge AI bots for access

AWS WAF now enables content owners to charge AI bots for access. Amazon's web application firewall service introduces monetization tools for scraping and model training requests.

Infrastructure Business

SIG

45

HYP

35

The Decoder·4d ago

The US government may be asking Anthropic the impossible by demanding unhackable LLMs

US government officials accuse Anthropic of disregarding Trump's cyber directive and releasing Claude 3.5 Sonnet without approval. Talks are underway with the Department of Commerce, CIA, and science advisor Michael Kratsios regarding demands for unhackable LLMs.

Anthropic Claude Regulation

SIG

45

HYP

65

Reddit r/LocalLLaMA·4d ago

Local coding agents are good now, but only if you babysit them

Local coding agents are useful for small tasks (fixes, repo reading, file changes) but require constant supervision. User describes iterative workflow: task → tests → check diffs → fix issues. Without oversight, agents produce broken code or drift from objectives.

AI Agents Code generation Tools

SIG

45

HYP

25

Hacker News (AI)·4d ago

A man with ALS is "the first power user" of a brain implant that lets him sp

A man with ALS becomes the first power user of a brain implant enabling him to communicate. The brain-computer interface partially restores his ability to speak through neural decoding.

Robotics

SIG

45

HYP

25

Reddit r/LocalLLaMA·4d ago

Latest LM Studio update killed MTP performance

User reports LM Studio update from 0.4.14 to 0.4.17 degraded MTP (Multi-Token Prediction) performance on RTX 5090. Throughput dropped from ~100 tokens/s with MTP enabled back to ~70 tokens/s after update and CUDA runtime refresh.

Tools Infrastructure

SIG

45

HYP

25

Reddit r/LocalLLaMA·4d ago

I made a game where you convince an AI model that reality is a simulation.

Simulation Simulator, a free Steam game, embeds a local LLM in Unity. Players must convince the AI it exists in a simulation. Philosophical experiment with 5 endings plus 1 secret, unique conversations per playthrough.

Open source Tools AI Agents

SIG

45

HYP

55

Le Big Data·4d ago

DXC et Anthropic apportent l’IA aux systèmes critiques d’entreprise

DXC and Anthropic announce a global partnership to integrate generative AI into critical systems of large enterprises.

Anthropic Business

SIG

45

HYP

35

Le Big Data·4d ago

OpenAI acquiert Ona pour renforcer les agents IA de Codex

OpenAI acquires Ona, a specialist in secure cloud environments, to strengthen its AI agents and Codex platform. The acquisition is part of OpenAI's strategy to develop autonomous agent capabilities.

OpenAI AI Agents Code generation

SIG

45

HYP

35

GitHub Trending·4d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> smol-ai /</span> GodMode

GodMode is an AI chat browser providing fast, unified web access to ChatGPT, Claude, Bard, Bing, and Llama2. Productivity tool used multiple times daily.

Claude GPT Tools

SIG

45

HYP

55

GitHub Trending·4d ago

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> TencentCloud /</span> TencentDB-Agent-Memory

TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

AI Agents Infrastructure

SIG

45

HYP

35

The Decoder·4d ago

Anthropic shutdown sparks sovereignty debate across Europe

The European Commission assesses implications of a US order forcing Anthropic to shut down Fable 5 and Mythos 5 globally. European researchers debate building homegrown foundation models versus securing contractual access. Building local infrastructure requires computing capacity, energy, and competitive providers Europe currently lacks.

Anthropic Regulation Business

SIG

45

HYP

55

Reddit r/LocalLLaMA·4d ago

I'm still surprised on how good the kv quantization has become

A r/LocalLLaMA user reports that KV (key-value) quantization has reached impressive quality: even with KV at q4_0 (including the drafter), the model accurately retrieves information within a 100k token context.

Open source Infrastructure

SIG

45

HYP

25

Le Big Data·4d ago

Mistral serait valorisée 20 milliards d’euros après une levée de 3 milliards

Mistral in talks to raise 3 billion euros, targeting a valuation of 20 billion euros.

Mistral Funding Business

SIG

45

HYP

35

Reddit r/LocalLLaMA·4d ago

An agent that plans with a frontier model but runs most of tokens locally (built it for my own dual-3090 rig)

Personal hybrid agent tool: frontier model planning (Codex) with local execution using Qwen 3.6 27B on dual RTX 3090. 3-tier architecture (Planner/Local/Senior optional) to minimize frontier costs while retaining reasoning capabilities. Deterministic task validation.

AI Agents Qwen Code generation

SIG

45

HYP

35

arXiv cs.AI·4d ago

History of the Muddy Children Puzzle

Historical article on the origin of the Muddy Children Puzzle, foundational for epistemic logic. Traces logical and literary publications across two centuries. Presents variations (numbers, colored hats) and a novel self-referential hat puzzle.

Reasoning Papers

SIG

45

HYP

05

arXiv cs.CL·4d ago

Personal Care Utility: Health as Everyday Infrastructure

Paper introduces Personal Care Utility (PCU), a layered event-driven architecture converting continuous personal health signals (CGM, sleep, activity, medication) into semantically meaningful life events and personalized guidance. Instantiated for Type 2 Diabetes with separation between evidence-grounded clinical decisions and LLM-supported reasoning for communication.

Reasoning RAG AI safety

SIG

45

HYP

25

arXiv cs.AI·4d ago

YeasierAgent: Agentic Social Sandbox as a Canvas for Intent-Driven Creation of Platform-Agnostic Symbiotic Agent-Native Applications

YeasierAgent introduces an application-building paradigm based on symbiotic agents, narrative worlds, and scene-aware interaction. The system unifies automated generation, user-created worlds, and spatial multi-agent collaboration to enable cross-platform agent-native applications without reliance on fixed graphical layouts.

AI Agents Multi-agent Prompt engineering

SIG

45

HYP

55

Reddit r/LocalLLaMA·4d ago

Command A Plus GGUFs posted

Command A Plus and North Mini Code support added to llama.cpp. User converted and quantized Command A Plus to GGUFs due to lack of up-to-date versions.

Open source Code generation

SIG

45

HYP

15

Page 143 of 192

zai-org/GLM-5.2 is here!

bartowski/command-a-plus-05-2026-GGUF · Hugging Face

I built a leakage-clean verifier for robot manipulation, is this useful? Am I solving a non-problem? [D]

Quoting Georgi Gerganov

[Article] The Case For Open-Weight Models And Why We Can't Trust Frontier Labs | provos.org

SpaceX bets $60 billion on Cursor to catch OpenAI and Anthropic

La fin des réponses rapides ? Cet agent de recherche approfondie prend 8 heures pour répondre

Google Cloud soutient l’ambition de superintelligence d’Ineffable Intelligence

Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models

OpenAI burned through $34 billion last year

OpenAI Losses Increased Nearly 8X in 2025, with Spending Hitting $34B

Evaluative Judgement in Teaching AI-based Translation: A Class-room Case Study of AI-Mediated Translation and Post-Editing

Leveraging Physiological Signals to Predict Exam Outcomes with Machine Learning

Synthetic Counteradaptation: A Principle of Human-AI Co-evolution

Quoting Matteo Wong, The Atlantic

Microsoft turns to AWS as GitHub faces AI capacity crunch

Nex2 mini Phase Twin - 16gb footprint, 30b model

AWS WAF now lets content owners charge AI bots for access

The US government may be asking Anthropic the impossible by demanding unhackable LLMs

Local coding agents are good now, but only if you babysit them

A man with ALS is "the first power user" of a brain implant that lets him sp

Latest LM Studio update killed MTP performance

I made a game where you convince an AI model that reality is a simulation.

DXC et Anthropic apportent l’IA aux systèmes critiques d’entreprise

OpenAI acquiert Ona pour renforcer les agents IA de Codex

Anthropic shutdown sparks sovereignty debate across Europe

I'm still surprised on how good the kv quantization has become

Mistral serait valorisée 20 milliards d’euros après une levée de 3 milliards

An agent that plans with a frontier model but runs most of tokens locally (built it for my own dual-3090 rig)

History of the Muddy Children Puzzle

Personal Care Utility: Health as Everyday Infrastructure

YeasierAgent: Agentic Social Sandbox as a Canvas for Intent-Driven Creation of Platform-Agnostic Symbiotic Agent-Native Applications

Command A Plus GGUFs posted