Page 27 of 192

AllHigh signalRecent

7679 articles

(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models

Mosaic, a probabilistic weather forecasting model, addresses three spectral degradation failures in ML-based prediction: spectral damping, high-frequency aliasing, and residual leakage. With 214M parameters at 1.5° resolution, it matches models trained 6× finer and generates well-calibrated ensembles in 12s for 10-day forecasts on H100.

Papers Benchmarks Vision

SIG

HYP

Page 27 of 192

(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models

IVF-TQ: Streaming-Robust Approximate Nearest Neighbor Search via a Codebook-Free Residual Layer

DACA-GRPO: Denoising-Aware Credit Assignment for Reinforcement Learning in Diffusion Language Models

PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

FishBack: Pullback Fisher Geometry for Optimal Activation Steering in Transformers

DISA: Offline Importance Sampling for Distribution-Matching LLM-RL

TClone: Low-Latency Forking of Live GUI Environments for Computer-Use Agents

ProxyKV: Cross-Model Proxy Pruning for Efficient Long-Context LLM Inference

Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra

Scale Determines Whether Language Models Organize Representation Geometry for Prediction

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback

WebGameBench: Requirement-to-Application Evaluation for Coding Agents via Browser-Native Games

DACA-GRPO: Denoising-Aware Credit Assignment for Reinforcement Learning in Diffusion Language Models

Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

Evaluating AI Alignment in LLMs: Output Analysis of Value Priorities Across 75 Models with Human Benchmarking

UbuntuGuard: A Culturally-Grounded Policy Benchmark for Equitable AI Safety in African Languages

Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models

Toward Robust Multilingual Adaptation of LLMs for Low-Resource Languages

Beyond Execution: Static-Analysis Rewards and Hint-Conditioned Diffusion RL for Code Generation

SomaliWeb v1: A Quality-Filtered Somali Web Corpus with a Matched Tokenizer and a Public Language-Identification Benchmark

DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention

CAM-VFD: Cross-Attention Multimodal Video Forgery Detection

Can LLMs Refuse Questions They Do Not Know? Measuring Knowledge-Aware Refusal in Factual Tasks

Learning Reasoning Rewards from Expert Demonstrations with Inverse Reinforcement Learning

DynMuon: A Dynamic Spectral Shaping View of Muon

LoopQ: Quantization for Recursive Transformers

PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications

SVFSearch: A Multimodal Knowledge-Intensive Benchmark for Short-Video Frame Search in the Gaming Vertical Domain

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code

Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench

Adversarial Fragility and Language Vulnerability in Clinical AI: A Systematic Audit of Diagnostic Collapse Under Imperceptible Perturbations and Cross-Lingual Drift in Low-Resource Healthcare Settings

LiTS: A Modular Framework for LLM Tree Search

DocReward: A Document Reward Model for Structuring and Stylizing

Beacon: Single-Turn Diagnosis and Mitigation of Latent Sycophancy in Large Language Models

AdaptiveLoad: Towards Efficient Video Diffusion Transformer Training

Mechanistically Interpretable Neural Encoding Reveals Fine-Grained Functional Selectivity in Human Visual Cortex

Ensemble Monitoring for AI Control: Diverse Signals Outweigh More Compute

STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps