May 2026

3149 articles

Towards Ubiquitous Mapping and Localization for Dynamic Indoor Environments

UbiSLAM presents a real-time mapping and localization system for dynamic indoor environments using a network of fixed RGB-D cameras. This approach reduces computational load on robots and improves navigation accuracy and human-robot interactions, while requiring automatic calibration and optimized communication protocols to handle blind spots.

Robotics Vision

SIG

HYP

May 2026

Towards Ubiquitous Mapping and Localization for Dynamic Indoor Environments

Beyond Inference-Time Search: Reinforcement Learning Synthesizes Reusable Solvers

The Hidden Cost of Contextual Sycophancy: an AI Literacy Intervention in Human-AI Collaboration

Focused Forcing: Content-Aware Per-Frame KV Selection for Efficient Autoregressive Video Diffusion

Same Signal, Different Semantics: A Cross-Framework Behavioral Analysis of Software Engineering Agents

Improved Baselines with Representation Autoencoders

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs

CommitDistill: A Lightweight Knowledge-Centric Memory Layer for Software Repositories

The Expressive Power of Low Precision Softmax Transformers with (Summarized) Chain-of-Thought

RAG-based EEG-to-Text Translation Using Deep Learning and LLMs

Trust No Tool: Evaluating and Defending LLM Agents under Untrusted Tool Feedback

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

Machine Unlearning for Masked Diffusion Language Models

Privacy Preserving Reinforcement Learning with One-Sided Feedback

FastOCR: Dynamic Visual Fixation via KV Cache Pruning for Efficient Document Parsing

SomaliWeb v1: A Quality-Filtered Somali Web Corpus with a Matched Tokenizer and a Public Language-Identification Benchmark

Context Memorization for Efficient Long Context Generation

A Simplex Witness Certificate for Constant Collapse in Variational Autoencoders

SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning

Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models

PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries

HyperPersona: A Multi-Level Hypergraph Framework for Text-Based Automatic Personality Prediction

Compress the Context, Keep the Commitments: A Formal Framework for Verifiable LLM Context Compression

Fixed External Cameras as Common Prior Maps for Active 3D Scene Graph Generation

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models

An Empirical Study of Privacy Leakage Chains via Prompt Injection in Black-Box Chatbot Environments

Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

OpenJarvis: Personal AI, On Personal Devices

Parameterized 4-Qubit EWL Quantum Game Circuits with Dirac-Solow-Swan Hamiltonian Integration for Quadruple Helix Disruptive Innovation Recommender Systems

Improving Spatio-Temporal Residual Error Propagation by Mitigating Over-Squashing

UCSF-PDGM-VQA: Visual Question Answering dataset for brain tumor MRI interpretation

Confidence-Gated Robot Autonomy: When Does Uncertainty Actually Help?

Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users

RAGA: Reading-And-Graph-building-Agent for Autonomous Knowledge Graph Construction and Retrieval-Augmented Generation

Quantum Sidecar Architectures for Hybrid AI Training and Inference: Stateful Protected Registers, Stateless Reset-and-Reprepare Circuits and Quantum Weight-State Outlook

FedSDR: Federated Self-Distillation with Rectification

Unveiling Memorization-Generalization Coexistence: A Case Study on Arithmetic Tasks with Label Noise

TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model

SAS: Semantic-aware Sampling for Generative Dataset Distillation

MARR: Module-Adaptive Residual Reconstruction for Low-Bit Post-Training Quantization

Predictive Prefetching for Retrieval-Augmented Generation

Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation

The Alpha Illusion: Reported Alpha from LLM Trading Agents Should Not Be Treated as Deployment Evidence

Training data attribution in diffusion models via mirrored unlearning and noise-consistent skew

Prompt Compression in Diffusion Large Language Models: Evaluating LLMLingua-2 on LLaDA

Confidence Geometry Reveals Trace-Level Correctness in Large Language Model Reasoning

Domain Transfer Becomes Identifiable via a Single Alignment

One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception

DCFold: Efficient Protein Structure Generation with Single Forward Pass

Attention Sinks and Outliers in Attention Residuals

PAREDA: A Multi-Accent Speech Dataset of Natural Language Processing Research Discussions

Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization

Temporal Aware Pruning for Efficient Diffusion-based Video Generation

Efficient Bilevel Optimization for Meta Label Correction in Noisy Label Learning

EmoMind: Decoding Affective Captions from Human Brain fMRI

CounterCount: A Diagnostic Framework for Counting Bias in Vision Language Models

TierCheck: Tiered Checkpointing for Fault Tolerance in Large Language Model Training

Algorithmic Cultivation: How Social Media Feeds Shape User Language

LLMs in Qualitative Research: Opportunities, Limitations, and Practical Considerations

One Model, Two Roles: Emergent Specialization in a Shared Recurrent Transformer

Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation

SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?

Systematic Evaluation of the Quality of Synthetic Clinical Notes Rephrased by LLMs at Million-Note Scale

Generative Artificial Intelligence for Literature Reviews

Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization

Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification

Attention-Guided Fusion of 1D and 2D CNNs for Robust ECG-Based Biometric Recognition

PEIRA: Learning Predictive Encoders through Inter-View Regressor Alignment

Bridging the Version Gap: Multi-version Training Improves ICD Code Prediction, Especially for Rare Codes

Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning

Bayesian-Monte Carlo Schedule Updating for Construction Digital Twins: A Probabilistic Framework for Dynamic Project Forecasting

UniAlign: A Model-Agnostic Framework for Robust Network Traffic Classification under Distribution Shifts

Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports

Visual Sculpting: Visually-Aligned Planning Representations for Long-Horizon Robot Clay Sculpting

Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review

BESplit: Bias-Compensated Split Federated Learning with Evidential Aggregation

A Distributional View for Visual Mechanistic Interpretability: KL-Minimal Soft-Constraint Principle

Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE