Page 58 of 146

AllHigh signalRecent
5839 articles
arXiv cs.CL·

The Need for an External Observer Formalizing the Sufficiency Gap: A Mathematical Extension of Mixture Identifiability and Contextual Grounding in Sequence Models

Theoretical paper on sequence models' insufficiency when facing unobserved latent states. Authors formalize a mixed-regime process where a perfect predictor becomes overconfident if observed context matches the wrong latent regime. They show the sufficiency gap can only be closed by perfect revelation of latent state or equivalent verification mechanism.

ReasoningAlignmentAI safety
SIG
72
HYP
15
Reddit r/MachineLearning·

Augmented Equivariant Mesh Networks for Anatomical Mesh Segmentation (ICML 2026 Workshops) [R]

EAMS (Equivariant Anatomical Mesh Segmentor) applies rotational equivariance to mesh networks for 3D anatomical segmentation. The model (<2M parameters) maintains performance under geometric perturbations (40° rotation) where existing methods drop 25-26 IoU points. Evaluated on 4 clinical tasks (intracranial aneurysm, intraoral segmentation, liver).

PapersVisionReasoning
SIG
72
HYP
18
Reddit r/MachineLearning·

[P] I built a system that lets you ask questions about any GitHub repo and get answers grounded in the actual source code [P]

GitRAG lets users ask questions about any public GitHub repo and get answers grounded in source code with exact file paths and line numbers. System combines AST-aware parsing, dense embeddings, BM25 index, RRF fusion, and Cohere reranking before generation via llama-3.3-70b on Groq. Supports 15+ languages.

RAGEmbeddingsCode generation
SIG
72
HYP
35
arXiv cs.CL·

SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning

SLAP is a batch-aware data selection framework for instruction tuning that evaluates learnability at batch composition level rather than individual samples. Using stratified sampling and relative distance optimization with Hessian-approximated gradients, it matches full dataset performance with 20-40% less training data across LLaMA, ChatGLM, and diverse tasks (dialogue, translation, QA).

Fine-tuningLlamaBenchmarks
SIG
72
HYP
28
arXiv cs.AI·

LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition

LC-ERD is a self-alignment framework for LLMs that mines latent logical structures via consistency-regulated reward decomposition. Addresses three challenges: label noise from mimetic bias, coarse-grained supervision, and distributional collapse. Uses Variational Logic Potential and multi-agent value decomposition based on IGM principle.

ReasoningReinforcement learningAlignment
SIG
72
HYP
28