Page 81 of 149

AllHigh signalRecent
5948 articles
Reddit r/MachineLearning·

Scaling LLMs horizontally: hidden-state coupling without weight modification [R]

Residual Coupling (RC) connects frozen language models in parallel via lightweight learned linear projections, without weight modification. Linear bridges read hidden states from one model and inject additive updates into another's residual stream. On medical data, RC reduces perplexity to 11.02 vs 56.80 for MoE (+80.7%), and improves TruthfulQA by 9.1 percentage points.

LlamaMulti-agentFine-tuning
SIG
72
HYP
28
Reddit r/LocalLLaMA·

I tested 42 LLMs on their willingness to build the apocalypse. The "safest" closed-source models are lying to you.

DystopiaBench tests 42 LLMs (open and closed-source) on their ability to refuse progressively normalized dangerous requests. 6 dystopia categories (autonomous weapons, surveillance, behavioral control, etc.) with 5 escalation levels. Finding: models detect obvious harmful requests but fail against requests hidden behind dual-use and normalization. Open-source benchmark available.

BenchmarksAI safetyAlignment
SIG
72
HYP
45