Page 67 of 148

AllHigh signalRecent

5885 articles

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

Experiment-as-Code Labs proposes a paradigm where scientific experiments are encoded as declarative configurations compilable to instrument APIs. AI agents formulate hypotheses, a systems layer performs program analysis and orchestration, then experiments execute via physical equipment control. General-purpose stack independent of science domain, lab type, or instrument.

AI Agents Papers Reasoning

SIG

HYP

Page 67 of 148

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

When Does Non-Uniform Replay Matter in Reinforcement Learning?

PersonaDual: Balancing Personalization and Objectivity via Adaptive Reasoning

Agentic AI Governance and Lifecycle Management in Healthcare

PriHA: A RAG-Enhanced LLM Framework for Primary Healthcare Assistant in Hong Kong

Black-Box Optimization From Small Offline Datasets via Meta Learning with Synthetic Tasks

Real-Time Aligned Reward Model beyond Semantics

Limitations of Sequence-Based Protein Representations for Parkinson's Disease Classification: A Leakage-Free Benchmark

Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity

Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction

How Wrong Can Your Counterfactual Be? Quantifying Confounding Bias for Continuous Treatments without a Control Group

Embracing Biased Transition Matrices for Complementary-Label Learning with Many Classes

No Free Swap: Protocol-Dependent Layer Redundancy in Transformers

A Scalable Tool for Measuring Manner and Result Verbs in Developmental Language Research

Language Acquisition Device in Large Language Models

RTI-Bench: A Structured Dataset for Indian Right-to-Information Decision Analysis

Early Pruning for Public Transport Routing

ARROW: Augmented Replay for RObust World models

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

Spatiotemporal Robustness of Temporal Logic Tasks using Multi-Objective Reasoning

JSPG: Dynamic Dictionary Filtering via Joint Semantic-Pinyin-Glyph Retrieval for Chinese Contextual ASR

Effort as Ceiling, Not Dial: Reasoning Budget Does Not Modulate Cognitive Cost Alignment Between Humans and Large Reasoning Models

Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents

Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain

PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

Can Heterogeneous Language Models Be Fused?

LEAF: A Living Benchmark for Event-Augmented Forecasting

Taming "Zombie'' Agents: A Markov State-Aware Framework for Resilient Multi-Agent Evolution

EmergentBridge: Improving Zero-Shot Cross-Modal Transfer in Unified Multimodal Embedding Models

AMATA: Adaptive Multi-Agent Trajectory Alignment for Knowledge-Intensive Question Answering

MiniGPT: Rebuilding GPT from First Principles

Forgetting is Competition: Rethinking Unlearning as Representation Interference in Diffusion Models

Beyond Transcripts: Iterative Peer-Editing with Audio Unlocks High-Quality Human Summaries of Conversational Speech

From Documents to Segments: A Contextual Reformulation for Topic Assignment

A Pilot Benchmark for NL-to-FOL Translation in Planetary Exploration

UxSID: Semantic-Aware User Interests Modeling for Ultra-Long Sequence

Empowering VLMs for Few-Shot Multimodal Time Series Classification via Tailored Agentic Reasoning

Strategic Exploitation in LLM Agent Markets: A Simulation Framework for E-Commerce Trust