Edition of2026-05-24

Claude Code discovers a reasoning algorithm for $40 — cuts compute 70% vs. standard self-consistency

By the editorial team

Today's 5 picks

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers from UMD, Google, and Meta use AutoTTS to let Claude Code independently discover control algorithms for AI reasoning. The discovered algorithm reduces compute by 70% versus standard self-consistency while matching accuracy. The search cost $40 and took 160 minutes.

Claude Code AI Agents Reasoning

Reddit r/LocalLLaMA·SIG 72

Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA

Benchmark on 30 long PDFs (171 questions) comparing vision LLMs vs OCR for document QA. Claude Sonnet 4.5 native PDF: 52% accuracy, $0.2552/query (5th/6). LlamaCloud premium + OCR: 59.6%, $0.1885/query. Vision underperforms on charts/tables; premium OCR more robust. Vision LLM has 7% intrinsic failure rate vs 0% for OCR after retry.

Claude Vision RAG

Reddit r/LocalLLaMA·SIG 72

llampart 1.0.0 - I released a standalone local web UI for llama-server with translations, extended settings and a polished conversation sidebar

llampart 1.0.0, standalone local web UI for llama-server, released as MIT open-source. Features extended settings, 6-language localization, two-column conversation sidebar, MCP integration, interface modes (dark/light/Frosted Glass), local import/export, and Caddy deployment guide.

Llama Open source Tools

Reddit r/MachineLearning·SIG 72

Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA [D]

Benchmark on 30 long PDFs (171 questions) comparing native vision-LLMs vs OCR pipelines for document QA. Claude Sonnet 4.5 used. LlamaCloud premium achieves 59.6% accuracy ($0.1885/query), native vision 52% ($0.2552/query, most expensive). Vision underperforms on charts/tables; premium OCR more robust. Vision-LLM has 7% intrinsic failure rate vs 0% for OCR after retries.

Vision Benchmarks RAG

Reddit r/LocalLLaMA·SIG 65

I built a local GUI for the TradingAgents framework — works with Ollama

Developer builds web GUI for TradingAgents, a multi-agent LLM stock analysis framework. Replaces CLI with local interface supporting Ollama, OpenAI, Anthropic, Google, DeepSeek and others. Adds live pipeline visualization, report reader, token reduction (~50% concise mode), multi-session chat. Apache 2.0.

AI Agents Multi-agent Open source