Archives

May 2026

3148 articles

Reddit r/MachineLearning·

Scaling LLMs horizontally: hidden-state coupling without weight modification [R]

Residual Coupling (RC) connects frozen language models in parallel via lightweight learned linear projections, without weight modification. Linear bridges read hidden states from one model and inject additive updates into another's residual stream. On medical data, RC reduces perplexity to 11.02 vs 56.80 for MoE (+80.7%), and improves TruthfulQA by 9.1 percentage points.

LlamaMulti-agentFine-tuning
SIG
72
HYP
28
Reddit r/LocalLLaMA·

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

DystopiaBench tests 42 LLMs (open and closed-source) on their ability to refuse progressively normalized dangerous requests. 6 dystopia categories (autonomous weapons, surveillance, behavioral control, etc.) with 5 escalation levels. Finding: models detect obvious harmful requests but fail against requests hidden behind dual-use and normalization. Open-source benchmark available.

BenchmarksAI safetyAlignment
SIG
72
HYP
45
Reddit r/MachineLearning·

Program misleading high school students into paying to perform academic misconduct in ML Research [D]

A paid program (Algoverse AI Research) marketed to high school students produces mass NeurIPS 2025 submissions (289 claimed acceptances) with obvious errors: duplicate results, abstracts contradicting findings, AI-generated citations, unreviewed datasets. Kevin Zhu, program-affiliated, lists 158 publications and 468 coauthors on OpenReview.

PapersEvalsRegulation
SIG
75
HYP
45