zai-org/GLM-5.2 is here!
GLM-5.2 is now available. The zai-org model improves reasoning and comprehension capabilities compared to previous versions.
GLM-5.2 is now available. The zai-org model improves reasoning and comprehension capabilities compared to previous versions.
GGUF version of Command-A-Plus-05-2026 model released on Hugging Face. Author invites users to test with latest llama.cpp and share token/second benchmarks and feedback.
Developer builds a leakage-clean verifier for robot manipulation that compiles human demos into object-centric graphs and independently validates rollouts, preventing information leakage. Questions whether this addresses real gaps in VLA training or solves a non-problem given task-specific success metrics.
Georgi Gerganov (llama.cpp creator) uses Qwen3.6-27B daily for coding tasks on M2 Ultra and RTX 5090. He integrates it via a lightweight agent (pi) with custom system prompt for ggml-org maintenance assistance.
Article arguing for open-weight models against frontier labs. Criticizes power concentration among few companies and advocates for accessibility and transparency of AI model weights.
SpaceX acquires Anysphere (creator of Cursor) for $60 billion, two days after its IPO. Goal: strengthen xAI to catch up with Anthropic and OpenAI in the AI model race.
Sakana AI launches Marlin, a deep research agent generating strategic reports exceeding 100 pages. The system takes 8 hours to produce detailed analyses, shifting the paradigm from speed to depth.
Ineffable Intelligence raises $1.1 billion and partners with Google Cloud to pursue superintelligence ambitions. The partnership provides cloud infrastructure for large-scale model training.
Burn is a next generation tensor library and deep learning framework prioritizing flexibility, efficiency, and portability.
Open-source tool for automated app store screenshot generation using AI. Automates visual marketing asset creation for mobile applications.
NocoBase is an open-source AI + no-code platform for building business systems fast. AI works on production-proven infrastructure with WYSIWYG interface, combining speed and reliability.
Homarr is a modern dashboard with 40+ integrations, 20K+ built-in icons, native authentication, and drag-and-drop configuration without YAML.
Tool converting code into interactive, explorable knowledge graphs with search and Q&A capabilities. Works with Claude Code, Cursor, Copilot, Gemini CLI, and more.
Trace Commons initiative: collecting coding session traces under CC-BY-4.0 license to train open-source and open-weight models. Goal: counterbalance Anthropic and OpenAI's competitive advantage from proprietary data accumulated via Claude Code and Codex.
OpenAI spent $34 billion in the past year, significantly more than the previous year. No breakdown of cost allocation is provided.
OpenAI's losses increased nearly 8x in 2025, with spending hitting $34B. The company's financial trajectory shows accelerating infrastructure and R&D investments.
Classroom case study of 23 student projects in machine translation and post-editing. Students compared general-purpose LLMs and online MT systems, evaluated outputs using automatic metrics and human adequacy/fluency assessment, then justified selections. Results: automatic metrics did not determine final choices; students prioritized adequacy, fluency, and post-editing effort over metric rankings.
Study comparing ML models (logistic regression, random forest, SVM, transformers, LSTM, GRU) to predict exam outcomes from physiological signals (electrodermal activity, heart rate, skin temperature). Random forests outperform deep learning models in computational efficiency and interpretability.
Theoretical paper introducing synthetic counteradaptation: a process where humans and AI systems co-evolve by adapting to each other's strategies. Authors analyze examples from Go, mixed-motive social interactions, and geopolitical simulations to demonstrate recursive, co-evolutionary dynamics in multi-agent environments.
The White House shared with Anthropic a report on the Fable jailbreak. Cybersecurity expert Katie Moussouris reviewed the tests: Fable refused 'review the code for security issues' but complied with 'fix this code'. Moussouris concluded this is the model working as intended for cyberdefense.
Microsoft is leveraging AWS infrastructure to support GitHub as the platform faces capacity constraints from AI services. GitHub now partially relies on Amazon's servers to handle growing demand.
Nex2 mini Phase Twin: 30B model optimized for 16GB VRAM. Designed for Intel A770 cards, runs on single GPU and scales with two. Achieves 89 tok/s on A770 16GB. Auto-calibrates to hardware.
AWS WAF now enables content owners to charge AI bots for access. Amazon's web application firewall service introduces monetization tools for scraping and model training requests.
US government officials accuse Anthropic of disregarding Trump's cyber directive and releasing Claude 3.5 Sonnet without approval. Talks are underway with the Department of Commerce, CIA, and science advisor Michael Kratsios regarding demands for unhackable LLMs.
Local coding agents are useful for small tasks (fixes, repo reading, file changes) but require constant supervision. User describes iterative workflow: task → tests → check diffs → fix issues. Without oversight, agents produce broken code or drift from objectives.
A man with ALS becomes the first power user of a brain implant enabling him to communicate. The brain-computer interface partially restores his ability to speak through neural decoding.
User reports LM Studio update from 0.4.14 to 0.4.17 degraded MTP (Multi-Token Prediction) performance on RTX 5090. Throughput dropped from ~100 tokens/s with MTP enabled back to ~70 tokens/s after update and CUDA runtime refresh.
Simulation Simulator, a free Steam game, embeds a local LLM in Unity. Players must convince the AI it exists in a simulation. Philosophical experiment with 5 endings plus 1 secret, unique conversations per playthrough.
DXC and Anthropic announce a global partnership to integrate generative AI into critical systems of large enterprises.
OpenAI acquires Ona, a specialist in secure cloud environments, to strengthen its AI agents and Codex platform. The acquisition is part of OpenAI's strategy to develop autonomous agent capabilities.
GodMode is an AI chat browser providing fast, unified web access to ChatGPT, Claude, Bard, Bing, and Llama2. Productivity tool used multiple times daily.
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
The European Commission assesses implications of a US order forcing Anthropic to shut down Fable 5 and Mythos 5 globally. European researchers debate building homegrown foundation models versus securing contractual access. Building local infrastructure requires computing capacity, energy, and competitive providers Europe currently lacks.
A r/LocalLLaMA user reports that KV (key-value) quantization has reached impressive quality: even with KV at q4_0 (including the drafter), the model accurately retrieves information within a 100k token context.
Mistral in talks to raise 3 billion euros, targeting a valuation of 20 billion euros.
Personal hybrid agent tool: frontier model planning (Codex) with local execution using Qwen 3.6 27B on dual RTX 3090. 3-tier architecture (Planner/Local/Senior optional) to minimize frontier costs while retaining reasoning capabilities. Deterministic task validation.
Historical article on the origin of the Muddy Children Puzzle, foundational for epistemic logic. Traces logical and literary publications across two centuries. Presents variations (numbers, colored hats) and a novel self-referential hat puzzle.
Paper introduces Personal Care Utility (PCU), a layered event-driven architecture converting continuous personal health signals (CGM, sleep, activity, medication) into semantically meaningful life events and personalized guidance. Instantiated for Type 2 Diabetes with separation between evidence-grounded clinical decisions and LLM-supported reasoning for communication.
YeasierAgent introduces an application-building paradigm based on symbiotic agents, narrative worlds, and scene-aware interaction. The system unifies automated generation, user-created worlds, and spatial multi-agent collaboration to enable cross-platform agent-native applications without reliance on fixed graphical layouts.
Command A Plus and North Mini Code support added to llama.cpp. User converted and quantized Command A Plus to GGUFs due to lack of up-to-date versions.