RSS

arXiv cs.LG

PROPEL is a framework training task generators via RL to create optimally difficult problems for agent learning. A lightweight probe predicts solver pass rate without repeated rollouts, reducing evaluation to a single forward pass. On code and SWE tasks, learnable-frontier generation increases from 10.1% to 20% (Qwen2.5-3B) and 9.8% to 19.6% (Qwen3.5-27B).

Reinforcement learning AI Agents Code generation

SIG

HYP

arXiv cs.LG·Jun 18

Enhanced Graph Neural Networks using K-Hop Gaussian Diffusion

New K-Hop Gaussian (KHG) diffusion method to enhance GNNs. KHG preprocesses graph data with multi-hop diffusion weighted by Gaussian, balancing local and global propagation. Outperforms standard message-passing, PPR, and Heat Kernel on benchmarks, especially on noisy graphs.

Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

Gaussian Mixture Attention: Linear-Time Sequence Mixing via Probabilistic Latent Routing

Gaussian Mixture Attention (GMA) replaces standard attention with probabilistic routing through K learned Gaussian mixture components. Queries and keys map to responsibility vectors in a shared latent space. GMA avoids explicit N×N matrix materialization, reducing memory complexity to O(NK) instead of O(N²). Competitive on long-context classification, but behind SDPA and Mamba on WikiText-103.

Reasoning Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 18

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

ASTRA is an air traffic control training simulator automating pilot roles through speech recognition, instruction interpretation, and response generation. The system reduces Word Error Rate from 107.80% to 23.45% on Singaporean-accented aviation speech, and evaluates trainee radiotelephony communications achieving 91.7% accuracy, 88.2% brevity, and 86.9% completeness scores.

Voice Fine-tuning Evals

SIG

HYP

arXiv cs.LG·Jun 18

Artemis: Anatomy-Resolved inTervention for Eliminating Multimodal NeuroImage confounderS

Artemis is a causal framework for graph neural networks addressing demographic confounders (age, sex) in multimodal brain imaging (fMRI + DTI). The method applies causal interventions at each brain region independently to learn invariant representations. Tested on ADNI, OASIS, and HCP benchmarks, it improves disease diagnosis and classification tasks.

Papers Reasoning Alignment

SIG

HYP

arXiv cs.LG·Jun 18

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

Structural pruning framework for Mixture-of-Experts models operating at channel level rather than expert level. Attribution-based method reformulates pruning as channel-score coverage maximization. Experiments on DeepSeek and Qwen models achieve 50% structured pruning with 4-bit quantization, 5.27× memory reduction on Qwen3-30B-A3B.

DeepSeek Qwen Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

Fisher Width: A Geometric Measure of Complexity on Statistical Manifolds

New geometric complexity measure called Fisher width, a Fisher-geometric analogue of Gaussian width on statistical manifolds. Replaces Euclidean geometry with Fisher information metric to capture local statistical curvature. Develops foundational theory with generalization bounds and computable estimators, validated on MNIST.

Papers Benchmarks Evals

SIG

HYP

arXiv cs.LG·Jun 18

SAGE: Retain-Aware Post-Hoc Sanitization of Final Unlearning Vector

SAGE is a post-hoc method to improve selective unlearning in LLMs. It corrects final update vectors by suppressing components damaging retention, without rerunning the original unlearning pipeline. Tested across multiple methods and scales, SAGE reduces the forget-retain trade-off.

Alignment Papers

SIG

HYP

arXiv cs.LG·Jun 18

Ghost Attractor Networks: Basin-Structured Dynamical Decoders for Closed-Loop Sequential Generation

Ghost Attractor Networks introduce an efficient dynamical decoder for sequential generation in robotics. With 2.3M parameters, it matches the offline accuracy of a 1.07B-parameter Diffusion Transformer (462× fewer parameters, 32× lower latency). On LIBERO-10, phase conditioning improves success rate by 13.5 percentage points over MLP baseline.

Code generation Robotics Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

A Survey on Data-Driven Models for Soil Moisture Regression and Classification

Survey of AI-based models for soil moisture estimation and classification. Five categories compared: statistical time-series, geostatistical methods, classical ML, deep learning, and Bayesian approaches. Data-driven methods provide flexible alternatives to computationally expensive physics-based models.

Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 18

Why SWAVE May Not Be All You Need:A Concept-Evolution Retrospective on Complex-Valued Recurrent Language Models

SWave is a complex-valued recurrent language model (169M parameters) trained on FineWeb-Edu. The paper documents its evolution across three phases, identifying structural failures (cos-domination collapse) and validating critical components (ComplexNorm, Wave Propagation Scan). Final PPL: 22.0 at step 89,861.

Papers Reasoning Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

Self-CTRL: Self-Consistency Training with Reinforcement Learning

Self-CTRL optimizes consistency between language models' self-explanations and behavior via reinforcement learning. On probabilistic reasoning tasks, the method improves R² correlation from 0.24 to 0.64. In constitutional AI, it increases refusal prediction from 36% to 92% and reduces HarmBench failure rate from 15.0% to 0.5%.

Reinforcement learning Alignment AI safety

SIG

HYP

arXiv cs.LG·Jun 18

SCOPE-FL: A Strategy-proof Chain-based Optimal pareto efficient Federated Learning System

SCOPE-FL introduces a hierarchical Federated Learning system using the Top Trading Cycle algorithm for client selection. The mechanism guarantees Pareto efficiency and strategy-proofness, with reward distribution via Shapley value approximation and blockchain execution. Evaluation on MNIST, Fashion-MNIST, CIFAR-10 shows improvement over DA, IAS.

SIG

HYP

arXiv cs.LG·Jun 18

P$^2$CE: Model-Agnostic Plausible Pareto-Optimal Counterfactual Explanations

P²CE generates plausible Pareto-optimal counterfactual explanations for ML models. The algorithm uses isolation forests and SHAP values to balance feasibility, plausibility, and computational efficiency. Evaluated on 3 datasets, it outperforms existing methods in solution quality and speed.

Evals

SIG

HYP

arXiv cs.LG·Jun 18

Beyond Prediction: Tail-Aware Scheduling for LLM Inference

New LLM inference scheduler replacing explicit length prediction with lightweight statistical signals and dynamic priority boosting. Reduces P99 TTLT by 35-50% vs SRPT with perfect length knowledge, and TTFT by 34-47% across production and open-source traces.

Benchmarks Infrastructure Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

TMR-GGNN: Credit Card Fraud Detection based on Time-Aware Multi-Relational Guided Graph Neural Network

TMR-GGNN, a time-aware multi-relational graph neural network, detects credit card fraud by modeling heterogeneous interactions between customers, merchants, devices, and IPs. The model combines temporal relational attention, contrastive learning, and a composite loss function (InfoNCE + Focal Loss) to handle imbalanced data and reduce false negatives.

Reinforcement learning

SIG

HYP

arXiv cs.LG·Jun 18

What Does the Weight Norm Control in Grokking? Logit-Scale Mediation under Cross-Entropy

Study on grokking (delayed transition from memorization to generalization). Authors show weight norm doesn't directly control grokking delay but acts through logit scale. Fixing norm and varying output temperature, they recover 85% of delay by matching logit scale. Effect is loss-dependent (cross-entropy vs MSE). Logit scale and softmax saturation are the proximal variables.

Papers Reasoning Evals

SIG

HYP

arXiv cs.LG·Jun 18

Structured Representation Learning with Locally Linear Embeddings and Adaptive Feature Fusion

RL framework inspired by neuroscience that disentangles dynamics-specific and reward-specific features using locally linear embeddings (LLE) and adaptively fuses representations via attention mechanism. Improves learning efficiency on benchmark tasks compared to conventional RL approaches.

Reinforcement learning Reasoning Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

Quantum Annealing Enhanced Reinforcement Learning for Accurate Remaining Useful Lifetime Prediction

QAQL framework couples quantum annealing with Q-learning for remaining useful life (RUL) prediction in predictive maintenance. Each Q-value update encoded as QUBO solved on D-Wave Advantage system. Validated on NASA C-MAPSS and fleet maintenance datasets: statistically significant improvements over classical and quantum baselines.

Reinforcement learning Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 18

PSyGenTAB: A Privacy-Preserving Framework for Synthetic Clinical Tabular Data Generation via Constrained Optimization

PSyGenTAB is a privacy-preserving framework for synthetic clinical tabular data generation formulated as constrained optimization solved via Augmented Lagrangian Method. It embeds configurable privacy constraints into training to preserve inter-feature clinical relationships and minority-class patterns while maintaining data utility for medical AI applications.

Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

CODEBLOCK: Learning to Supervise Code at the Right Granularity

CodeBlock is a structure-aware sparse supervision framework for code LLM fine-tuning. It selects syntactically coherent code blocks rather than isolated tokens, estimating utility via generalized cross-entropy and data-flow signals. On 6 code-generation benchmarks, CodeBlock outperforms full-token SFT while using only 1.9% of supervised response tokens.

Code generation Fine-tuning Papers

SIG

HYP

arXiv cs.LG·Jun 18

A Link between Shock-wave Theory and Symmetry-reduced Stochastic Gradient Descent for Artificial Neural Networks

Mathematical link established between shock-wave theory and symmetry-quotiented stochastic gradient descent dynamics for neural networks. After quotienting parameter symmetries and entropy coarse-graining, effective dynamics satisfy a viscous Hamilton-Jacobi equation. Applied to MLPs, CNNs, Transformers, and mean-field networks.

Papers Reasoning Reinforcement learning

SIG

HYP

arXiv cs.LG·Jun 18

DRIFT: Refining Instruction Data via On-Policy Data Attribution

DRIFT refines SFT training data distribution using on-policy Influence Functions. The method uses model rollouts as validation targets to minimize proximity gap and debias gradient norm bias. Experiments on 7B instruction and reasoning models show consistent performance ceiling improvements over existing curation baselines.

Fine-tuning Reinforcement learning Evals

SIG

HYP

arXiv cs.LG·Jun 18

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Sparse Autoencoders (SAEs) decompose activations into interpretable features, but this study shows that clamping a 'harmful' feature does not eliminate the behavior—it can recover via other residual pathways. Even with active intervention, 95.8% behavior recovery is achievable in refusal-steering, exposing a gap between feature-level control and behavioral completeness.

AI safety Alignment Evals

SIG

HYP

arXiv cs.LG·Jun 18

Neural Network Implementation of the Renormalization Group for Fault Diagnosis with Class Imbalance

RGNet, a neural network architecture based on the renormalization group, addresses class imbalance and multidimensional noise for fault diagnosis. The model hierarchically compresses feature space and captures both local details and global patterns. Tested on imbalanced AI4I dataset.

Papers Evals Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

ThousandWorlds: A benchmark for climate emulation of potentially habitable exoplanets

ThousandWorlds is an ML benchmark for climate emulation of potentially habitable exoplanets. The dataset contains ~1800 simulations from 5 global climate models mapping 8 planetary parameters to 3D atmospheric fields. Three nested subsets and two evaluation protocols test 7 baselines; GP-based methods outperform standard deep learning.

Benchmarks Papers Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

LLMZero: Discovering Adaptive Training Strategies for RL Post-Training via LLM Agents

LLMZero uses LLM agents with tree search to discover adaptive RL training strategies. The system identifies that capacity parameters accumulate monotonically while regularization parameters oscillate. Across 4 GRPO tasks, discovered strategies outperform the base model by 9-140% and grid search by 6-15%.

Reinforcement learning AI Agents Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

arXiv paper demonstrates that on biomedical tabular data, measurement noise limits the advantage of nonlinear models (deep networks, gradient boosting) over linear regression. Degree-k interactions are attenuated by the k-th power of feature reliability, while linear components are attenuated only once. Analysis of 140 UK Biobank tasks confirms this noise signature.

Benchmarks Evals

SIG

HYP

arXiv cs.LG·Jun 18

A Cross-Model VLM-Judge Protocol for Single-Image 3D Mesh Quality (and Why Cheap Proxies Fall Short)

Evaluation protocol for single-image-to-3D mesh quality using VLM judges (vision-language models). Authors demonstrate that cheap proxies (CLIP similarity, geometry validity stats) fail to correlate with perceived quality. Their VLM-judge protocol with position-bias correction achieves Cohen's kappa = 0.66 between two independent judge families.

Vision Evals Benchmarks

SIG

HYP

arXiv cs.LG·Jun 18

Task-Restricted Symmetries in Recurrent Weight Space

Study of functional redundancy in single-layer tanh RNNs using ordered real Schur coordinates. Authors identify nonnormal couplings removable with minimal loss on specific tasks (copy, flip-flop, sine generation), revealing task-dependent approximate functional invariances rather than universal weight-space symmetries.

Papers Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

The Illusion of Improvement: Reject Inference Strategies in Credit Scoring

Reject inference methods used in credit scoring to correct survival bias mask a structural failure: accuracy can improve while the ability to correctly reject defaulters collapses. Authors propose a controlled exploration strategy (approving 2-5% of rejected applicants) to diagnose this deterioration without strong statistical assumptions.

Benchmarks AI safety Evals

SIG

HYP

arXiv cs.LG·Jun 18

SFT Overtraining Predicts Rank Inversion via Entropy Collapse Under RLVR

Study shows SFT overtraining can invert model rankings during RLVR fine-tuning. On Qwen2.5-Coder-3B, increasing SFT depth raises pre-RL pass@1 but reduces GRPO pass@10 from 0.806 to 0.481. Pre-RL entropy positively correlates with RLVR outcomes (ρ=+0.69). Two-stage entropy-based diagnostic identifies high-risk checkpoints.

Reinforcement learning Fine-tuning Reasoning

SIG

HYP

arXiv cs.LG·Jun 18

Beyond AHI: An Interpretable Causal-Discovery-Guided Framework for Sleep Recovery in Connected Health

Causal framework for sleep recovery scoring from multimodal polysomnography. Uses DAG learning on two cohorts (MESA n=1540, MrOS n=825) to identify five physiological domains (respiratory burden, hypoxia, fragmentation, architecture, autonomic regulation). Sleep Recovery Score (SRS) achieves 2.5× stronger alignment with perceived recovery than standard AHI.

Papers Reasoning Evals

SIG

HYP

arXiv cs.LG·Jun 17

Sum-of-Squares Degree Barriers for the Reweighted-Hinge Method in Robust Halfspace Learning: A Christoffel-Function Characterization

Theoretical paper on Sum-of-Squares degree barriers for robust halfspace learning under malicious noise. The Christoffel function exactly characterizes corruption hidden from bounded-degree certificates. Proves a margin-degree tradeoff and a degree-2t algorithm achieving the frontier η^(1-1/2t).

Papers Reasoning AI safety

SIG

HYP

arXiv cs.LG·Jun 17

Rift: A Conflict Signature for Deception in Language Models

Researchers identify an internal signature of deception in language models: deceptive responses show 2.1-2.3x higher residual rank than naively false answers. This signature detects deception with 100% accuracy on GPT-2, Qwen2.5, and Phi-3, and transfers zero-shot across model families and languages (AUC 0.933-1.0).

AI safety Alignment Evals

SIG

HYP

arXiv cs.LG·Jun 17

Uncertainty Quantification of Engineering Structures by Polynomial Chaos Expansion and Multivariate Active Learning

Adaptive sequential sampling method for polynomial chaos expansion surrogate models, generalized for multiple quantities of interest. The approach balances input space exploration with exploitation of aggregated variance across outputs, improving surrogate accuracy and stability compared to Latin Hypercube Sampling.

Benchmarks Evals

SIG

HYP

arXiv cs.LG·Jun 17

Rethinking Groups in Critic-Free RLVR

arXiv paper on critic-free reinforcement learning for LLMs. Authors challenge the role of rollout groups in existing methods and propose negative token filtering to enable stable single-rollout training, improving performance on agentic tasks compared to group-based RL techniques.

Reinforcement learning Reasoning AI Agents

SIG

HYP

arXiv cs.LG·Jun 17

ProCUA-SFT Technical Report

ProCUA-SFT is a dataset of 3.1M step-level SFT samples generated automatically from 93K synthetic trajectories across 2,484 application combinations. Fine-tuning UI-TARS 7B on ProCUA-SFT achieves 45.0% on OSWorld, a +18.7 percentage-point improvement over the base model and +35% above AgentNet. The pipeline uses Kimi-K2.5 as task generator, precondition judge, and trajectory executor.

AI Agents Benchmarks Fine-tuning

SIG

HYP

arXiv cs.LG·Jun 17

The Critical Role of Model Selection in Causal Inference: A Comparative Analysis of Classification Models within the InferBERT Framework for Pharmacovigilance

InferBERT combines transformers with Do-calculus to detect causal adverse drug events in pharmacovigilance. Comparative study on AILF and TRAM benchmarks: BioBERT outperforms XGBoost, ALBERT, and Med-LLaMA. Finding: domain-specific pre-training outweighs model size.

Benchmarks Fine-tuning AI safety

SIG

HYP

arXiv cs.LG·Jun 17

MODE: Modality-Decomposed Expert-Level Mixed-Precision Quantization for MoE Multimodal LLMs

MODE is an expert-level mixed-precision quantization framework for MoE multimodal LLMs. It decomposes expert selection frequency by modality (vision/text) and filters redundant vision tokens to correct estimation biases. Results: <2.9% performance loss at W3A16.

Vision Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 17

Towards Fast GNN Surrogates for CO2 Migration in Complex Geological Formations

GNN surrogate for CO₂ migration forecasting in complex geological formations. Model trained on SPE11A benchmark with anisotropic message-passing mechanism capturing directional transport. Produces competitive forecasts of gas saturation and liquid-phase density over extended forecasting horizons.

Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 17

PowerOPD: Stabilizing On-Policy Distillation with Bounded Power Transformation

PowerOPD stabilizes on-policy distillation for LLMs by replacing unbounded log-ratio rewards with Box-Cox power transformation. On 6 mathematical reasoning benchmarks with Qwen3, achieves +6.37 Avg@8/+5.71 Pass@8 gains vs vanilla OPD, reduces wall-clock time by 59.2% and peak GPU memory by 23.1%.

Fine-tuning Reinforcement learning Benchmarks

SIG

HYP

arXiv cs.LG·Jun 17

Counterfactual Optimization of Baseball Pitch Sequences and Estimation of Its Impact on Season-Level Statistics

arXiv study using Transformer model on MLB Statcast data to optimize baseball pitch sequences. Counterfactual analyses show optimization of both final and setup pitches can improve seasonal pitcher statistics by over 1.0 K/9. Practical insights on velocity-band-specific effective locations and pitch command importance.

Papers Benchmarks

SIG

HYP

arXiv cs.LG·Jun 17

MM++: Unsupervised Scale-Invariant Multilayer OOD Detection via Top-K Gated Feature Fusion

MM++ is an unsupervised, post-hoc method for out-of-distribution detection. It fuses intermediate layers selected by entropy density with the final representation using Ledoit-Wolf regularized covariance, requiring no auxiliary OOD data, fine-tuning, or architectural changes.

Evals AI safety

SIG

HYP

arXiv cs.LG·Jun 17

Discrete Autoregressive Transformer for Generative Mechanism Synthesis

Discrete autoregressive transformer for mechanism synthesis. Conditional sequence model with VAE latent and quantized joint coordinates. Trained on >1M mechanisms with Chamfer distance and DTW metrics. Mean Chamfer distance 0.0132, DTW 0.153 on held-out tests.

Code generation Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 17

Amortized Probabilistic Retrieval of Atmospheric CO2 from OCO-2 Spectra Using Deep Learning with Laplace Approximations and Normalizing Flows

Deep learning framework for retrieving atmospheric CO2 from NASA's OCO-2 satellite spectra. Uses Laplace approximations and normalizing flows for uncertainty quantification. Inference orders of magnitude faster than operational algorithms, with better-calibrated non-Gaussian posterior estimates.

Benchmarks Papers

SIG

HYP

arXiv cs.LG·Jun 17

Memory-Efficient Meta-Reinforcement Learning for Adaptive Safety-Critical Control in Adversarial Spacecraft Proximity Operations

Comparative study of three recurrent architectures (LSTM, GRU, Mamba) and two algorithms (PPO, SAC) for meta-reinforcement learning applied to input-constrained control barrier functions (ICCBF) in spacecraft proximity operations. Mamba + PPO outperforms other setups in safety, task completion, and fuel savings across cooperative and adversarial scenarios.

Reinforcement learning AI safety Robotics

SIG

HYP

arXiv cs.LG·Jun 17

MorphStrata: Layer-Specific Perturbations for Generating Morphence Students in Time-Series Moving Target Defense

MorphStrata enhances Moving Target Defense for time-series forecasting models via selective layer-specific stochastic noise injection. Tested on Transformer with FGSM, BIM and PGD attacks, the approach reduces adversarial RMSE by up to 97.97% on AEP data with training overhead <1%.

Benchmarks AI safety Papers

SIG

HYP

arXiv cs.LG·Jun 17

Credibility-Weighted Pricing of Autonomous Vehicle Liability Under Operational Design Domain Shift

Hierarchical Bayesian credibility framework for pricing autonomous vehicle liability under operational design domain shifts. Tested on 648 verified Waymo crashes (4 US cities, 116M miles): credibility weights moderate (0.12-0.46), partial pooling decisively outperforms no pooling, learned kernel advantage detectable at ~12 deployed cities.

AI safety Benchmarks Regulation

SIG

HYP

arXiv cs.LG·Jun 17

Operator Boosting Produces Pareto-Efficient PDE Surrogates

Operator Boosting constructs compact neural-operator surrogates for PDEs via stagewise residual learning. Tested on FNO, DeepONet, and CNO across 30 benchmarks (PDEBench, APEBench), the method reduces parameters by 72–95% while improving accuracy on 21 dataset-architecture pairs and achieves Pareto gains on 7/10 PDE benchmarks.

Papers Benchmarks Code generation

SIG

HYP

arXiv cs.LG

Breaking the Solver Bottleneck: Training Task Generators at the Learnable Frontier

Enhanced Graph Neural Networks using K-Hop Gaussian Diffusion

Gaussian Mixture Attention: Linear-Time Sequence Mixing via Probabilistic Latent Routing

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

Artemis: Anatomy-Resolved inTervention for Eliminating Multimodal NeuroImage confounderS

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

Fisher Width: A Geometric Measure of Complexity on Statistical Manifolds

SAGE: Retain-Aware Post-Hoc Sanitization of Final Unlearning Vector

Ghost Attractor Networks: Basin-Structured Dynamical Decoders for Closed-Loop Sequential Generation

A Survey on Data-Driven Models for Soil Moisture Regression and Classification

Why SWAVE May Not Be All You Need:A Concept-Evolution Retrospective on Complex-Valued Recurrent Language Models

Self-CTRL: Self-Consistency Training with Reinforcement Learning

SCOPE-FL: A Strategy-proof Chain-based Optimal pareto efficient Federated Learning System

P$^2$CE: Model-Agnostic Plausible Pareto-Optimal Counterfactual Explanations

Beyond Prediction: Tail-Aware Scheduling for LLM Inference

TMR-GGNN: Credit Card Fraud Detection based on Time-Aware Multi-Relational Guided Graph Neural Network

What Does the Weight Norm Control in Grokking? Logit-Scale Mediation under Cross-Entropy

Structured Representation Learning with Locally Linear Embeddings and Adaptive Feature Fusion

Quantum Annealing Enhanced Reinforcement Learning for Accurate Remaining Useful Lifetime Prediction

PSyGenTAB: A Privacy-Preserving Framework for Synthetic Clinical Tabular Data Generation via Constrained Optimization

CODEBLOCK: Learning to Supervise Code at the Right Granularity

A Link between Shock-wave Theory and Symmetry-reduced Stochastic Gradient Descent for Artificial Neural Networks

DRIFT: Refining Instruction Data via On-Policy Data Attribution

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Neural Network Implementation of the Renormalization Group for Fault Diagnosis with Class Imbalance

ThousandWorlds: A benchmark for climate emulation of potentially habitable exoplanets

LLMZero: Discovering Adaptive Training Strategies for RL Post-Training via LLM Agents

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

A Cross-Model VLM-Judge Protocol for Single-Image 3D Mesh Quality (and Why Cheap Proxies Fall Short)

Task-Restricted Symmetries in Recurrent Weight Space

The Illusion of Improvement: Reject Inference Strategies in Credit Scoring

SFT Overtraining Predicts Rank Inversion via Entropy Collapse Under RLVR

Beyond AHI: An Interpretable Causal-Discovery-Guided Framework for Sleep Recovery in Connected Health

Sum-of-Squares Degree Barriers for the Reweighted-Hinge Method in Robust Halfspace Learning: A Christoffel-Function Characterization

Rift: A Conflict Signature for Deception in Language Models

Uncertainty Quantification of Engineering Structures by Polynomial Chaos Expansion and Multivariate Active Learning

Rethinking Groups in Critic-Free RLVR

ProCUA-SFT Technical Report

The Critical Role of Model Selection in Causal Inference: A Comparative Analysis of Classification Models within the InferBERT Framework for Pharmacovigilance

MODE: Modality-Decomposed Expert-Level Mixed-Precision Quantization for MoE Multimodal LLMs

Towards Fast GNN Surrogates for CO2 Migration in Complex Geological Formations

PowerOPD: Stabilizing On-Policy Distillation with Bounded Power Transformation

Counterfactual Optimization of Baseball Pitch Sequences and Estimation of Its Impact on Season-Level Statistics

MM++: Unsupervised Scale-Invariant Multilayer OOD Detection via Top-K Gated Feature Fusion

Discrete Autoregressive Transformer for Generative Mechanism Synthesis

Amortized Probabilistic Retrieval of Atmospheric CO2 from OCO-2 Spectra Using Deep Learning with Laplace Approximations and Normalizing Flows

Memory-Efficient Meta-Reinforcement Learning for Adaptive Safety-Critical Control in Adversarial Spacecraft Proximity Operations

MorphStrata: Layer-Specific Perturbations for Generating Morphence Students in Time-Series Moving Target Defense

Credibility-Weighted Pricing of Autonomous Vehicle Liability Under Operational Design Domain Shift

Operator Boosting Produces Pareto-Efficient PDE Surrogates