Page 59 of 147

AllHigh signalRecent
5853 articles
arXiv cs.AI·

Operationalizing Reconstructive Authority: Runtime Construction, Dependency Resolution, and Execution Gating in Autonomous Agent Systems

Paper on runtime enforcement of Reconstructive Authority (RAM) in autonomous agent systems. Introduces execution model with three states (admit/deny/halt), dynamic dependency resolution, and Recovery Loop integrating drift detection with execution control. Guarantees no action executes without constructible authority.

AI AgentsAI safetyReasoning
SIG
72
HYP
15
arXiv cs.LG·

Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette

Survey paper at the intersection of human-body communication (HBC) and federated learning for wearable sensor networks. Proposes taxonomy of FL deployments (intra-body, body-hub, cross-user, clinical-cloud) and introduces BODYFED-HBC reference architecture with scheduling algorithm and reproducible simulation combining public datasets with empirical HBC signal-loss models.

Benchmarks
SIG
72
HYP
15
arXiv cs.AI·

Beyond Predefined Learning Objects: A Thinking-Learning Interaction Model for Up-to-Date Autonomous Robot Learning

Thinking-learning interaction model for autonomous robots in changing environments. Thinking guides learning (change identification, evidence selection, planning), learning improves thinking (knowledge updates, action strategies). Results: recognition accuracy 0.419→0.845, action length 13.0→4.0, evidence selection rate 0.272→0.965.

RoboticsReinforcement learningReasoning
SIG
72
HYP
25
arXiv cs.LG·

Towards Verifiable Transformers: Solver-Checkable Circuit Explanations

Verifiable Transformers framework converts task-localized Transformer circuits into solver-checkable formal claims. Extracts circuits and verifies functional equivalence, edge necessity, invariance, and robustness via SMT encoding. Demonstrates direct verification on symbolic tasks and surrogate-mediated verification at GPT-2 scale with SMT-representable operators (Signed L1 BandNorm, sparsemax, LeakyReLU).

ReasoningAI safetyPapers
SIG
72
HYP
18
arXiv cs.LG·

Overcoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference

Heteroscedastic uncertainty-aware PINN framework for flood extent mapping from SAR data. Attention-Gated FNO-UNet with dynamic Warm-Start protocol and aleatoric uncertainty modeling prevents gradient divergence ("Physics Shock"). On Sen1Floods11: +25% relative IoU improvement over deterministic baselines, with calibrated confidence bounds for disaster response.

PapersReasoningEvals
SIG
72
HYP
18
arXiv cs.LG·

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions

Riemannian archetypal analysis using data-driven pullback geometry on deformed star distributions. Combines interpretability of classical archetypal analysis with non-linear model expressiveness. Riemannian archetypal mapping (RAM) projects onto manifolds of geodesically convex archetype combinations. Experiments on MNIST demonstrate meaningful geodesics and geometry-aware denoising.

PapersReasoning
SIG
72
HYP
15
arXiv cs.LG·

ChainzRule: Sample-Efficient, Robust Deep Learning Across Tabular, NLP, and Vision Tasks

ChainzRule replaces standard activations with learnable polynomial layers governed by Differential Regularization (DREG), a Jacobian penalty computed analytically during forward pass. Tested across tabular, NLP, and vision: 85.71% on Pima Diabetes, 46.20% on SST-5 with frozen encoder (5% of RNTN training data), 55.79% on SST-5 fine-tuned BERT, +2.32% on CIFAR-10-C. Improves robustness and sample efficiency.

BenchmarksPapersReasoning
SIG
72
HYP
28
arXiv cs.AI·

Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform

arXiv paper arguing LLMs fail at causal reasoning and long-horizon planning due to lack of world models. Authors introduce Latent Dynamics Inference (LDI) and Flux, a sequential reasoning environment specified in natural language. RL agents with explicit latent state access achieve 79% win rate vs 11% for LLMs, revealing failures in persistent state tracking.

ReasoningReinforcement learningPapers
SIG
72
HYP
35
arXiv cs.LG·

A lift for input-convex neural network training

Novel training method for input-convex neural networks (ICNNs) using an unconstrained hypernetwork that emits inter-layer weights. Approach inspired by parameter-extension lifts from PDE-constrained inverse problems, circumvents limitations of projected gradient descent and softplus reparametrization. Results on log-concave density estimation and convex-potential normalizing flows show improved convergence.

PapersReasoningReinforcement learning
SIG
72
HYP
15
arXiv cs.LG·

Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning

Agent-ToM is a learning-to-monitor framework using Theory-of-Mind reasoning to detect covert malicious behavior in autonomous LLM agents. It infers agent beliefs, intent hypotheses, and behavioral deviations from task-consistent baselines. Evaluated on SHADE-Arena and CUA-SHADE-Arena benchmarks, it outperforms ensemble monitoring baselines with a two-call reasoning pipeline.

AI AgentsAI safetyReasoning
SIG
72
HYP
28
arXiv cs.LG·

Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions

New arXiv paper proposing DINOSaur, a training-free method for continual anomaly detection in industrial settings. Combines frozen DINOv3 backbone, spatially-indexed coreset memory, and neighborhood-restricted anomaly scoring. Achieves zero forgetting, outperforms all baselines across 5 protocols, runs <100ms inference on Jetson Orin Nano with on-device adaptation <30s.

BenchmarksVision
SIG
72
HYP
25