Edition of2026-06-11

RAG on mobile NPU, SLO-aware multi-agent orchestration, and science-reproducing agents: AI moves down the stack

Two papers today address the same problem from opposite ends: where and how to run inference efficiently. The RAG implementation on Snapdragon X Elite (arXiv:2606.11447 — embedding, reranking, and LLM generation fully on the Hexagon NPU) delivers 18.1× faster prefilling and 4× lower system energy than CPU, with quality judged equivalent by GPT-4.1 (9.32 vs 8.95). On the other end, INFRAMIND (arXiv:2606.11440) orchestrates agents with real-time awareness of infrastructure state — GPU queues, KV cache, latencies — achieving 7× lower latency and 99.9% SLO compliance under overload. The shared signal: inference optimization is no longer just a model-level problem, it's a systems-level one, whether you're on a 4W NPU or a pressured cluster.

On the agents front, ISE (arXiv:2606.11520) is the most actionable result: fine-tuning Qwen3-8B on 23,132 multi-turn OS agent trajectories generated with live sandbox execution pushes ClawEval from 19.3 to 37.7 pass@1, beating GPT-4o zero-shot and Qwen3-32B. This is a direct demonstration that trajectory data quality — grounded in real execution rather than static synthesis — matters more than model size. SocSci-Repro-Bench (arXiv:2606.11447) rounds out the evaluation side: 221 tasks reproducing published social science findings, Claude Code ahead of Codex, with agents able to identify research questions rather than simply memorizing outputs. A useful benchmark for anyone deploying agents on real analytical workflows.

ProHiFlo (arXiv:2606.11243) is the most vertical signal: 58.9% success on enzymatic active site scaffolding versus 41.2% for RFDiffusion, with 4× fewer sampling steps. The SE(3)-equivariant coarse-to-fine architecture with functional guidance via pre-trained predictors is a clear direction for de novo protein generation. Less immediately actionable for most practitioners, but worth tracking if you work on drug discovery or biodesign pipelines.

Today's 5 picks
01
02
03
04
05