RSS

Reddit r/MachineLearning

https://www.reddit.com/r/MachineLearning/

Reddit r/MachineLearning·

Backpropagation destroys V1 brain alignment in one epoch, tracking RSA alignment to fMRI across training for BP, FA, predictive coding, and STDP [R]

Comparative study of learning rules (backprop, feedback alignment, predictive coding, STDP) via RSA alignment with human V1 fMRI. Backprop destroys 90% of V1 alignment after 1 epoch (r: 0.102→0.011), while PC and STDP lose only 25-31%. At epoch 40: PC/STDP >> BP/FA. Suggests fundamental trade-off between global error signals (higher layers) and early-layer alignment.

AlignmentBenchmarksPapers
SIG
78
HYP
15
Reddit r/MachineLearning·

LLM agents patch security bugs, pass all tests, but still leave the vulnerability open [R]

CVE-Bench evaluates 5 frontier models on 20 real-world CVEs (Pillow, GitPython, urllib3, etc.) across 300 runs. Max solve rate 50% (60% under advisory). Agents patch syntactically but leave vulnerabilities open. Significant cross-family gaps (OpenAI vs Laguna, p<0.05), within-family noise. Failure modes: wrong-search drift, hallucinations, context loss.

AI AgentsBenchmarksAI safety
SIG
78
HYP
15
Reddit r/MachineLearning·

[P] Built a persistent cognitive runtime around an LLM — zero behavioral prompts, emergent autonomy from architecture. Comparison test: standard LLM in identical ecosystem did nothing.[P]

Developer builds LIA, a persistent cognitive runtime around an LLM without behavioral prompts. Architecture includes 20k+ self-evaluated memories, cognitive kernel (LCRK v3), self-rule system, and private Linux domain. Control test: standard LLM in identical ecosystem remains inactive.

AI AgentsPrompt engineeringReasoning
SIG
35
HYP
72
Reddit r/MachineLearning·

How much of MLE-Bench's gains are the algorithm vs. better models + more search? [R]

MLE-Bench shows 80% gains over two years, but new research (FML-Bench) reveals little comes from real algorithmic progress. At equal step budget and identical models, the two-year-old AIDE algorithm matches modern agent/evolutionary search systems. FML-Bench unifies code editing agents, step definitions, and val/test splits to benchmark algorithmic efficiency.

BenchmarksAI AgentsEvals
SIG
72
HYP
25
Reddit r/MachineLearning·

What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D]

Developer built NeuralDBG, a PyTorch debugger that automatically detects training failures (vanishing/exploding gradients, data anomalies). Key insight: failures are layer-localized, not global. Effective monitoring: gradient norm transitions per layer rather than raw histograms. Open-source tool available on PyPI.

ToolsCode generationOpen source
SIG
72
HYP
28
Reddit r/MachineLearning·

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Tomesphere: Chrome extension + website indexing 3M arxiv papers with LLM-curated summaries, OpenReview reviews, GitHub repos, HuggingFace models, citation graphs and SPECTER2 semantic neighbors. Free, no signup.

PapersToolsOpen source
SIG
72
HYP
35
Reddit r/MachineLearning·

Cross-species RSA: same learning rules (BP, PC, STDP, FA) tested against both human fMRI and macaque electrophysiology [P]

Cross-species comparison of learning rules (BP, PC, STDP, FA) tested on human fMRI and macaque electrophysiology (V1/V2/V4/IT). STDP and PC dominate V1/V2 (ρ ≈ 0.30/0.28), conserving human pattern. In IT, alignment depends on model capacity (ResNet-50: ρ ≈ 0.25) rather than learning rule. Code and two papers (arxiv 2604.16875, 2605.22401) available.

PapersBenchmarksReasoning
SIG
72
HYP
15
Reddit r/MachineLearning·

Augmented Equivariant Mesh Networks for Anatomical Mesh Segmentation (ICML 2026 Workshops) [R]

EAMS (Equivariant Anatomical Mesh Segmentor) applies rotational equivariance to mesh networks for 3D anatomical segmentation. The model (<2M parameters) maintains performance under geometric perturbations (40° rotation) where existing methods drop 25-26 IoU points. Evaluated on 4 clinical tasks (intracranial aneurysm, intraoral segmentation, liver).

PapersVisionReasoning
SIG
72
HYP
18