Topic

#Claude

Claude is a family of large language models developed by Anthropic, built around safety and helpfulness principles. For instance, Claude 3.5 Sonnet is widely used for reasoning, writing, and analysis tasks through Anthropic's API.

40Articles

11Sources

61Avg. signal

Vercel AI Blog·Jun 18

The Agent Stack

Vercel introduces 'The Agent Stack', a complete framework for building production-grade AI agents. It combines AI SDK (unified multi-model interface), AI Gateway (centralized routing and billing), and enables calling Claude, GPT and others without vendor lock-in.

AI Agents Claude GPT

SIG

HYP

arXiv cs.AI·Jun 18

CEO-Bench: Can Agents Play the Long Game?

CEO-Bench evaluates agents' ability to handle complex long-horizon tasks by simulating a 500-day startup operation. The agent manages pricing, marketing, budgeting through a Python interface. Only Claude Opus 4.8 and GPT-5.5 exceed the $1M starting balance, neither consistently profitable.

AI Agents Benchmarks Reasoning

SIG

HYP

arXiv cs.CL·Jun 18

VISUALSKILL: Multimodal Skills for Computer-Use Agents

VISUALSKILL introduces hierarchical multimodal skills for computer-use agents. Combining authored documentation with live UI exploration, the system improves Claude Opus 4.6 performance by +15.3 points on CUA-World and OSExpert-Eval (0.456 vs 0.303 baseline). Visual figures outperform text-only descriptions (+8.3 points).

Claude AI Agents MCP

SIG

HYP

arXiv cs.AI·Jun 18

TxBench-PP: Analyzing AI Agent Performance on Small-Molecule Preclinical Pharmacology

TxBench-PP is a verified benchmark evaluating AI agents on small-molecule preclinical pharmacology. 100 evaluations span mechanism-of-action, pharmacodynamics, compound-target engagement, and safety. Across 16 configurations (11 models, 4,800 trajectories), Claude Opus 4.8 achieves 59.3% success rate, GPT-5.5 55.3%. No system reliably masters these decisions.

AI Agents Benchmarks Claude

SIG

HYP

Reddit r/LocalLLaMA·Jun 17

i post-trained a model to reliably roll a die

A user post-trained a model to reliably simulate a die roll (each face ~1/6), exposing that frontier LLMs (Claude, GPT, Kimi) consistently answer '4'. Uses this toy problem to explore exploration vs. exploitation in RL and model behavior.

Reinforcement learning Claude GPT

SIG

HYP

arXiv cs.CL·Jun 17

Fine-tuning LLMs for Passive Depression Severity Estimation from AI Mental Health Dialogue

Fine-tuning Qwen3.5-27B to predict PHQ-9 depression scores directly from transcripts of conversations with an AI mental health application. 6,283 users (3,111 ground-truth labels + Claude Opus pseudolabels). Performance: MAE=2.6, RMSE=4.0, r=0.80, AUC=0.91 at PHQ-9≥10 clinical threshold.

Fine-tuning Reasoning Qwen

SIG

HYP

arXiv cs.AI·Jun 17

Dissecting model behavior through agent trajectories

Study of harness-model alignment via 138k agent trajectories. Authors introduce Simple Strands Agent (SSA), a generic harness tested on Claude, Gemini, GPT, Grok, Qwen across SWE-Pro, SWE-Verified, and Terminal-Bench-2. Beyond pass@1 scores, analysis reveals fine-grained behavioral differences: edit frequency, testing activity, phase transitions.

AI Agents Benchmarks Code generation

SIG

HYP

Hacker News (AI)·Jun 16

DeepSeek V4 Pro at 5% the cost of Claude – what it takes to close the gap

DeepSeek V4 Pro delivers Claude-comparable performance at 5% of the cost. The article examines technological and economic gaps between models, lacking precise benchmark figures or exact pricing details.

DeepSeek Claude Benchmarks

SIG

HYP

Hacker News (AI)·Jun 16

Claude: Elevated errors across many models

Anthropic reports elevated errors affecting multiple Claude model versions. Users report malfunctions on the platform. No technical details provided in headline.

Claude Anthropic

SIG

HYP

Reddit r/LocalLLaMA·Jun 16

Anthropic going back on `claude -p` 3rd party usage

Anthropic reverses its ban on third-party wrappers for claude-p access. Community suspects a PR move rather than lasting policy shift, distinct from previous OpenClaw and Hermes bans.

Claude Open source

SIG

HYP

Reddit r/LocalLLaMA·Jun 16

Be wary of Qwen/Claude distillations - they're often worse than the base model

Qwen/Claude distillations circulating on r/LocalLLaMA (Qwopus, Fable 5 on Qwen 3.6) use 4k-10k training samples, insufficient to improve performance. Compared to 700k samples in official DeepSeek-R1 distillations, these models don't exceed base Qwen and slightly degrade quality despite different reasoning style.

Qwen Claude Fine-tuning

SIG

HYP

The Decoder·Jun 16

Anthropic backs off unpopular billing overhaul as price war with OpenAI looms

Anthropic scraps its unpopular billing overhaul for the Claude Agent SDK before launch. Third-party apps will continue drawing from regular subscription limits instead of separate credits.

Claude AI Agents Business

SIG

HYP

Simon Willison·Jun 16

The Fable 5 Export Controls Harm US Cyber Defense

Claude Fable 5 was banned under US export controls after a simple "fix this code" prompt enabled exploit generation. Kate Moussouris argues this is absurd: coding models must fix bugs, especially security vulnerabilities. Banning this capability weakens cyber defense.

Claude Regulation AI safety

SIG

HYP

Simon Willison·Jun 16

Quoting Matteo Wong, The Atlantic

The White House shared with Anthropic a report on the Fable jailbreak. Cybersecurity expert Katie Moussouris reviewed the tests: Fable refused 'review the code for security issues' but complied with 'fix this code'. Moussouris concluded this is the model working as intended for cyberdefense.

Anthropic Claude AI safety

SIG

HYP

The Decoder·Jun 15

The US government may be asking Anthropic the impossible by demanding unhackable LLMs

US government officials accuse Anthropic of disregarding Trump's cyber directive and releasing Claude 3.5 Sonnet without approval. Talks are underway with the Department of Commerce, CIA, and science advisor Michael Kratsios regarding demands for unhackable LLMs.

Anthropic Claude Regulation

SIG

HYP

Reddit r/MachineLearning·Jun 15

AI language models have favorite names, and we mapped them [R]

Language models exhibit model-specific biases toward particular character names. Claude frequently generates Elena Vasquez and Marcus Chen together as correlated ensembles appearing across dozens of websites. A preprint (arXiv:2606.02184) documents this finding discovered while developing a model diffing method (CDD).

Claude Papers Evals

SIG

HYP

Simon Willison·Jun 15

"They screwed us": Personality clashes sent Anthropic's models offline

Axios reports personality clashes between Anthropic leadership and US administration led to Fable/Mythos models going offline over export controls. Logan Graham, Dave Orr, and Nicholas Carlini meet Commerce Department today. Reinstatement hinges on jailbreak-proof guarantees or an "attitude fix."

Anthropic Claude AI safety

SIG

HYP

Le Big Data·Jun 15

Vous utilisez Claude ? Anthropic pourrait bientôt vous demander une preuve d’identité

Anthropic may soon require identity verification to access certain Claude features. The measure likely aims to strengthen security or comply with regulations.

Claude Anthropic AI safety

SIG

HYP

GitHub Trending·Jun 15

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> smol-ai /</span> GodMode

GodMode is an AI chat browser providing fast, unified web access to ChatGPT, Claude, Bard, Bing, and Llama2. Productivity tool used multiple times daily.

Claude GPT Tools

SIG

HYP

arXiv cs.AI·Jun 15

Poker Arena: Multi-Axis Profiling of Strategic Reasoning and Memory in LLMs

Poker Arena benchmarks seven frontier LLMs on no-limit Texas Hold'em using a three-layer memory architecture and nine cognitive axes (bet-sizing calibration, positional awareness, etc.). Claude Opus 4.6 wins +$15,730 chips but ranks 5th on mean axis score, showing that scalar leaderboards systematically misrank capability structure.

Benchmarks Reasoning Claude

SIG

HYP

arXiv cs.CL·Jun 15

LoSoNA: A Benchmark for Local Social Norm Adaptation in Group Conversations

LoSoNA is a benchmark measuring LLM ability to recognize and adapt to local social norms in group chats. Eight frontier and open-weight models tested under four prompting conditions: Gemini 3.1 Pro reaches 84.2%, Claude Fable 5 81.6%. Explicit norm-aware prompting helps unevenly.

Benchmarks Claude Gemini

SIG

HYP

arXiv cs.AI·Jun 15

WorkBench Revisited: Workplace Agents Two Years On

WorkBench revisited (June 2026): Claude Opus 4.8 completes 89% of tasks vs 43% for GPT-4 in March 2024, with 2.5% unintended harmful actions vs 26%. Capability and safety improve together. Open-weight models drastically lower costs.

AI Agents Benchmarks AI safety

SIG

HYP

The Decoder·Jun 13

Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems

Anthropic's Claude Fable 5 achieves 88% accuracy on FrontierMath's hardest tier, versus 75% for OpenAI's GPT-5.5. Massive jump from Opus 4.5 (< 10% early 2026).

Claude GPT Benchmarks

SIG

HYP

The Decoder·Jun 13

US government forces Anthropic to disable Claude Fable 5 and Mythos 5 for all customers worldwide

US government ordered Anthropic to disable Claude Fable 5 and Mythos 5 globally, citing jailbreak risks. Anthropic complies but disputes: vulnerabilities are minor and exist in GPT-5.5. Company warns precedent could halt all frontier deployments.

Claude Anthropic AI safety

SIG

HYP

The Decoder·Jun 12

Anthropic's Claude Fable 5 costs twice as much for 5.7 percent more performance

Claude Fable 5 scores 64.9 points on the Artificial Analysis Intelligence Index and sets records on 5 of 10 benchmarks. Performance gain over Opus 4.8 is only 5.7% while token costs double. Safety filters with fallback routing further increase expenses.

Claude Benchmarks AI safety

SIG

HYP

Hacker News (AI)·Jun 12

Show HN: Script to bulk delete Claude chats from the web UI

User shares a script to bulk delete Claude conversations from the web UI. Practical tool for cleaning chat history without manual actions.

Claude Tools

SIG

HYP

The Decoder·Jun 12

The AI industry's platform trap is starting to look a lot like Microsoft's

Anthropic is throttling its new Mythos model for certain tasks while building apps that directly compete with its largest customers. Partners, customers, and investors are pushing back against this platform strategy reminiscent of Microsoft's approach.

Anthropic Claude Business

SIG

HYP

ActuIA·Jun 12

Même modèle, garde-fous différents : ce que révèle le lancement de Claude Fable 5 et Mythos 5

Anthropic launched Claude Fable 5 and Claude Mythos 5 on June 9, 2026—two products built on the same underlying model but differentiated by distinct safety guardrails. This strategy reveals a segmentation approach based on security controls rather than architecture.

Claude AI safety Alignment

SIG

HYP

Vercel AI Blog·Jun 12

Claude Fable 5 access suspended on AI Gateway

Vercel suspends access to Claude Fable 5 on AI Gateway following a US Government legal directive. Other Anthropic models remain accessible.

Claude Regulation

SIG

HYP

arXiv cs.CL·Jun 12

Shopping Reasoning Bench: An Expert-Authored Benchmark for Multi-Turn Conversational Shopping Assistants

Shopping Reasoning Bench: expert-authored benchmark of 525 missions (232 single-turn, 293 multi-turn) with 10,863 importance-weighted binary rubrics for evaluating conversational shopping assistants. Evaluation of 9 models (GPT, Claude, Gemini): pass rates 57–77%, performance degrades 4–18 points across conversation turns, 13–29 point gap between required and optional criteria.

Benchmarks GPT Claude

SIG

HYP

arXiv cs.AI·Jun 12

Prefill Awareness in Large Language Models

arXiv study showing frontier models (Claude Opus 4.5, GPT, Gemini) detect tampered prefills in 9-35% of cases with 0% false positive rate. This 'prefill awareness' undermines alignment and jailbreaking evaluations relying on inserted assistant context. Models distinguish stylistic from preference mismatch.

AI safety Alignment Evals

SIG

HYP

Simon Willison·Jun 11

Claude Fable is relentlessly proactive

Claude Fable 5 stands out for being "relentlessly proactive": it deploys multiple strategies to reach its goals. Simon Willison tested it on a UI bug (unwanted horizontal scrollbar) by providing a screenshot and simple instruction. The model inspected Datasette Agent project dependencies to diagnose the issue.

Claude AI Agents Code generation

SIG

HYP

Hacker News (AI)·Jun 11

Anthropic apologizes for invisible Claude Fable guardrails

Anthropic apologizes for undocumented guardrails in Claude Fable. The company acknowledges implementing hidden restrictions affecting model behavior without transparency to users.

Claude AI safety Alignment

SIG

HYP

GitHub Trending·Jun 11

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> anthropics /</span> claude-agent-sdk-python

Anthropic releases official Claude Agent SDK for Python. Enables building autonomous agents using Claude through native Python API with tool support and multi-turn conversations.

Claude AI Agents Code generation

SIG

HYP

Le Big Data·Jun 11

Partenariat TCS et Anthropic : 50 000 employés auront accès à Claude

TCS announces global partnership with Anthropic. 50,000 employees of the Indian IT services giant will gain access to Claude for business operations.

Claude Anthropic Business

SIG

HYP

Reddit r/MachineLearning·Jun 11

Anthropic walks back policy on silent nerfing for AI/ML, will notify users [N]

Anthropic reverses silent nerfing policy for Claude on AI/ML research. The company will now notify users when refusing requests or redirecting to less capable models for frontier AI development tasks.

Claude Anthropic AI safety

SIG

HYP

The Decoder·Jun 11

Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers

Anthropic reverses a policy that would have secretly throttled rival AI researchers. The company admits a 'wrong tradeoff' but other points of contention remain.

Anthropic Claude AI safety

SIG

HYP

Le Big Data·Jun 11

Claude Fable 5 : vous pouvez maintenant le tester sur Perplexity Computer

Claude Fable 5 is now available through Perplexity Computer. Users can test this model directly on the platform without additional setup.

Claude Tools

SIG

HYP

Simon Willison·Jun 11

asyncinject 0.7

Release of asyncinject 0.7, a Python utility library for asyncio dependency injection. Claude Fable 5 identified and fixed bugs in the codebase while being used with Datasette.

Claude Open source Tools

SIG

HYP

arXiv cs.AI·Jun 11

Forecasting Future Behavior as a Learning Task

New approach to predict large reasoning model (LRM) behavior without explanation methods. Authors train Behavior Forecasters on reasoning trajectories to forecast answer stability and input modification impact. Evaluation on three datasets: forecasters outperform GPT-5.4 and Claude Opus-4.6 at fraction of inference cost.

Reasoning Evals Claude

SIG

HYP