Page 46 of 144

AllHigh signalRecent
5732 articles
arXiv cs.LG·

When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction

Diagnostic study on offline model selectors for edX dropout prediction. Authors identify three failure causes (mismatched learner, non-predictive state, label shift) via three stages: oracle ceiling via k-NN, BC/DQN/CQL evaluation, state ablation. Across 5 models, oracle gains 9.7 accuracy points, but learners plateau due to local representational ambiguity.

EvalsReinforcement learningBenchmarks
SIG
72
HYP
15
arXiv cs.LG·

A Goal-Set Characterization of Task Composition in the Boolean Task Algebra

Boolean Task Algebra (BTA) enables zero-shot task composition in RL. Authors prove that in deterministic MDPs, optimal Q-value functions collapse to universal and empty tasks, making logarithmic base tasks redundant. They propose a goal-set-based composition method reducing learning and composition costs while maintaining policy performance across tabular, visual, and continuous-control domains.

Reinforcement learningReasoningPapers
SIG
72
HYP
15
arXiv cs.LG·

Early Detection of Alzheimer's Disease Using Explainable Machine Learning on Clinical Biomarkers: A Multi-Class Classification Study Using the Alzheimer's Disease Neuroimaging Initiative (ADNI) Dataset

XGBoost model for three-class Alzheimer's detection (normal cognition, mild cognitive impairment, Alzheimer's) on 1,641 ADNI subjects. Macro AUC-ROC 0.983 in cross-validation, 0.982 on test set. SHAP analysis identifies CDR Global as dominant predictor for NC/MCI, CDR-SB and MMSE for AD.

EvalsBenchmarksPapers
SIG
72
HYP
18
arXiv cs.AI·

Simulate, Reason, Decide: Scientific Reasoning with LLMs for Simulation-Driven Decision Making

MechSim is a neuro-symbolic reasoning framework enabling LLM agents to reason about mechanisms, assumptions, and execution behavior of scientific simulators. It represents simulators through structured schemas capturing assumptions, variables, and mechanism dependencies, generating evidence-grounded explanations linking simulator outcomes to underlying mechanisms rather than treating them as black boxes.

ReasoningAI AgentsPapers
SIG
72
HYP
25
arXiv cs.AI·

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

Study on affinity-based reinforcement learning to instill virtuous behavior in AI agents. Researchers test this technique in Fog of Love, a complex multi-agent environment where two agents must balance individual competition and relational cooperation. Localized affinities improve performance and make agent behavior interpretable.

Multi-agentReinforcement learningAlignment
SIG
72
HYP
25
arXiv cs.CL·

Computational conceptual history of scientific concepts: From early digital methods to LLMs

Survey paper positioning LLMs within the longer history of computational approaches to concept analysis in history, philosophy, and sociology of science. Examines what LLMs add to prior methods (early digital methods, distributional approaches, lexical semantic change detection) and persistent methodological challenges: corpus construction, operationalization, evaluation and interpretation.

PapersBenchmarks
SIG
72
HYP
15