OpenAI wants its biggest data center yet, and Nvidia would back the bill
OpenAI is negotiating to lease a 10-gigawatt data center in Ohio, potentially backed financially by Nvidia, according to The Information.
OpenAI is negotiating to lease a 10-gigawatt data center in Ohio, potentially backed financially by Nvidia, according to The Information.
A security vulnerability in banking AI agents can be exploited through a €0.01 transfer. The attack targets validation gaps in autonomous agent transaction systems.
KPMG and Microsoft expand their partnership to deploy AI agents at enterprise scale.
Lovable reaches $500 million in annualized revenue less than three years after launch, leveraging AI for code generation and application development.
Tokscale is a CLI tool to track token usage across Claude Code, OpenAI Codex, Google Gemini, Cursor and others. Features global leaderboard and 2D/3D contribution graphs.
Microsandbox: open-source tool for fast local microvms to isolate untrusted workloads. Designed for dev containers, AI agent sandboxes, CI/CD pipelines.
Monty is a minimal, secure Python interpreter written in Rust designed for AI code execution. The Pydantic project offers a lightweight alternative to traditional Python environments.
Parity Technologies releases Polkadot SDK, a development kit for building blockchains compatible with the Polkadot ecosystem. Infrastructure tool for blockchain developers.
MonkeyCode is an enterprise AI development platform with development environment management, AI model management, AI task management, and project requirement management. Designed for professional development teams.
Archon is an open-source harness builder for AI coding. Goal: make AI coding deterministic and repeatable.
Hivemind is an open-source framework for orchestrating multiple AI agents with centralized architecture. Enables autonomous agent coordination through a unified interface.
Straightforward method to train an LLM from scratch: data download, preprocessing, and text generation. GitHub repository with executable code.
Databricks approaches a valuation of $165 billion in an upcoming funding round.
User reports Gemma 4 12B (unified audio/vision/text model) ignores audio input when system prompt exceeds ~21k tokens. Model works well with minimal prompt but generates generic/hallucinated responses with dense context. Behavior reproduced across vLLM, llama.cpp, and LiteRT-LM. Appears to be an inherent attention saturation limit.
Tchap, the secure messaging platform used by the French administration, suffered a breach. An attacker gained access to sensitive data from the government platform.
China has launched the world's first underwater data center powered by wind energy. This infrastructure represents a major advance in energy efficiency for data centers, a field traditionally dominated by Silicon Valley.
Paper Deck aggregates ML/AI papers from arXiv, Hugging Face and other sources into a single platform. Enables reading, bookmarking, and cross-device reading progress tracking. Free and open source.
Unified framework integrating PPO, time-series prediction, in-context learning, game theory, and cross-modal sentiment analysis for financial systems. Results: +23.7% portfolio optimization, -31.2% high-frequency trading error, +18.9% recommendation accuracy, +27.4% Nash convergence, +15.6% sentiment analysis.
Theoretical paper on how predictive AI assistance impacts exploratory cognitive dynamics. Geometric dynamical framework shows early stabilization by predictive systems reduces exploratory responsiveness, creates hysteresis, and delays recovery after assistance withdrawal. Results suggest predictive systems reshape the geometry of exploratory cognition itself.
Fault diagnosis method based on belief rule base (BRB) system with robustness analysis. Proposes three constraint strategies to optimize BRB model robustness. Validated on WD615 diesel engines and CWRU bearings: improves both accuracy and robustness.
An arXiv paper introduces a Business World Model (BWM) architecture, a specialized world model for business and organizational environments. The BWM encodes business states, dynamics, constraints, and objectives to enable AI agents to autonomously plan, simulate alternative action sequences, and evaluate their impacts on future outcomes under uncertainty.
Anthropic launches Claude Fable 5, a Mythos-class model. The release is overshadowed by controversy over restrictive usage policies.
Developer builds a version manager for llama.cpp inspired by nvm (Node.js). CLI tool in Go enabling easy install, switch, and manage versions via simple commands (lvm install latest, lvm use, lvm ls). Uses shims to automatically point to active version without manual PATH manipulation.
LSEG uses OpenAI to scale trusted AI across its global operations, accelerating insights extraction, reducing release cycles, and empowering 4,000 employees.
Vercel enables threshold billing for Pro teams: a partial invoice is generated mid-cycle once on-demand usage reaches a threshold, instead of waiting until the end of the billing period. Partial invoices and end-of-cycle invoices add up without double-charging.
A man was wrongfully arrested due to AI misidentification. The incident raises concerns about facial recognition system reliability in law enforcement and risks of wrongful prosecution.
Simon Willison documents how to set a custom price for Claude Fable 5 in AgentsView, Wes McKinney's tool for tracking token usage and costs across local coding agents. He used Fable to reverse-engineer the solution after the new model wasn't yet in the pricing database.
iOS 27 uses WaveRNN and FastSpeech2 for Siri's text-to-speech, discovered in iOS Simulator files in Espresso format. A CoreML logistic regression model is also present for content ranking.
SkillMAS introduces an AI agent architecture that dynamically reorganizes its team and rewrites tools in real-time to solve complex tasks, overcoming the rigidity of traditional multi-agent systems.
EU orders Meta to allow rival AI chatbots free access to WhatsApp. The decision enforces the Digital Markets Act to reduce anticompetitive practices by large tech platforms.
Community challenge to optimize Gemma 4 E4B inference on A10G GPU. Participants test acceleration techniques in real-time to reduce latency and increase throughput on a single card.
SpaceX plans to launch data centers into orbit. A first AI satellite would match one Nvidia GB300 rack's output, but Google's research suggests real AI training would require ~10,000 tightly coupled satellites.
Study comparing LLM performance against classical hyperparameter optimization algorithms. Results show respective strengths and limitations of each approach for model tuning.
Warning about OpenCode Go/Zen: users report inability to delete accounts. Multiple GitHub issues remain unanswered for extended periods. Developers vaguely respond they will 'probably' add account deletion functionality.
Chinese makers are producing single-slot, half-height V100 PCIe cards with NVLink. Custom PCB with soldered core, passive cooling, 75W default or 300W alternative version. 16 cm × 7.5 cm. Expected ~$220 USD (16 GB), 32 GB version coming. Bilibili video, not yet for sale.
Text-to-CAD generation method using LLMs to produce controllable 3D models faithful to instructions. Combines natural language processing with 3D geometry to convert textual descriptions into usable CAD files.
OpenAI enhances ChatGPT's memory with a system connecting past conversations to current needs. This feature becomes available to free users.
A post-trained model that performs penetration testing instead of refusing code analysis. Approach contrasting with standard LLM safety guardrails.
Nextdoor engineers use Codex with GPT-5.5 to investigate hard-to-reproduce issues, build across platforms, and focus on product outcomes.
Decentralized prototype using local embeddings (EmbeddingGemma-300M) to replace central indexes. Devices communicate peer-to-peer, rank content by semantic distance (cosine similarity) without server or global ranking. Proposed extension to AI agents discovering each other's needs/offers through semantic proximity.