Page 71 of 148

AllHigh signalRecent

5898 articles

LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

LMAC leverages LLM reasoning to design communication protocols in MARL, enabling agents to reconstruct the underlying state uniformly and accurately. The approach iteratively refines protocols using an explicit state-awareness criterion. Experiments on MARL benchmarks demonstrate substantial performance gains over prior baselines.

Multi-agent Reinforcement learning Reasoning

SIG

HYP

arXiv cs.CL·May 19

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

CodeBind introduces a multimodal alignment framework using shared-specific compositional codebooks. The method decomposes representations into semantic shared components and modality-unique components, validated across 9 modalities (text, image, video, audio, depth, thermal, tactile, 3D point cloud, EEG) achieving state-of-the-art performance in classification and retrieval tasks.

Embeddings Vision Robotics

SIG

HYP

Page 71 of 148

LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

Divergence-Suppressing Couplings for Rectified Flow

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning

Responsible Federated LLMs via Safety Filtering and Constitutional AI

Factual Inconsistencies in Multilingual Wikipedia Tables

Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models

Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models

Early Stopping Chain-of-thoughts in Large Language Models

When TableQA Meets Noise: A Dual Denoising Framework for Complex Questions and Large-scale Tables

We Think, Therefore We Align LLMs to Helpful, Harmless and Honest Before They Go Wrong

NeuSymMS: A Hybrid Neuro-Symbolic Memory System for Persistent, Self-Curating LLM Agents

Multi-Party Multi-Objective Optimization as Consensus Search: Runtime Analysis of Cross-Party Recombination

TTE-Flash: Accelerating Reasoning-based Multimodal Representations via Think-Then-Embed Tokens

Automated Coding of Communication Data Using ChatGPT: Consistency Across Subgroups

Evaluating Language Models' Evaluations of Games

Unlocking the Potential of Diffusion Language Models through Template Infilling

QQJ: Quantifying Qualitative Judgment for Scalable and Human-Aligned Evaluation of Generative AI

Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning

Reasoning Before Diagnosis: Physician-Inspired Structured Thinking for ECG Classification

LISTEN to Your Preferences: An LLM Framework for Multi-Objective Selection

VolTA-3D: Self-Supervised Learning for Brain MRI using 3D Volumetric Token Alignment

Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning

GraphMind: Theorem Selection and Conclusion Generation Framework with Dynamic GNN for LLM Reasoning

Distinguishable Deletion: Unifying Knowledge Erasure and Refusal for Large Language Model Unlearning

CatalyticMLLM: A Graph-Text Multimodal Large Language Model for Catalytic Materials

CAREBench: Evaluating LLMs' Emotion Understanding by Assessing Cognitive Appraisal Reasoning

Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation

ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language

From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning

"The Whole Is Greater Than the Sum of Its Parts": A Compatibility-Aware Multi-Teacher CoT Distillation Framework

Dynamics of collective creativity in AI art competitions

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

Capturing LLM Capabilities via Evidence-Calibrated Query Clustering

New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions

Prompt Compression in Diffusion Large Language Models: Evaluating LLMLingua-2 on LLaDA

Training data attribution in diffusion models via mirrored unlearning and noise-consistent skew

One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception

Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation