Page 79 of 149

AllHigh signalRecent
5940 articles
arXiv cs.AI·

Hierarchical Two-Stage Framework for Environment-Aware Long-Horizon Vessel Trajectory Prediction

Hierarchical two-stage framework for long-horizon vessel trajectory prediction under real ocean conditions. Combines long-term predictor with short-term Spatio-Temporal Graph Transformer on discretized maritime cells. Environmental module integrates currents, wind, wave height via cross-modal attention. Results: 25% improvement in ADE, 17% in FDE on Australian CTS data.

ReasoningBenchmarksPapers
SIG
72
HYP
15
arXiv cs.AI·

LAST-RAG: Literature-Anchored Stochastic Trajectory Retrieval-Augmented Generation for Knowledge-Conditioned Degradation Model Selection

LAST-RAG proposes a method for selecting stochastic degradation models to estimate remaining useful life (RUL). It combines observed trajectories and domain context via retrieval from a local evidence bank, with RCRUS mechanism to prevent premature model elimination. Experiments show outperformance versus statistical and prognostic baselines.

RAGReasoningBenchmarks
SIG
72
HYP
15
arXiv cs.AI·

Modelling Customer Trajectories with Reinforcement Learning for Practical Retail Insights

Reinforcement learning framework for predicting customer trajectories in retail spaces. RL-based approach outperforms TSP/PNN heuristics (average 28% deviation from shortest paths) by modeling bounded rationality. Validated on real convenience store data: RL predictions better align with observed behavior, more accurate impulse purchase rates and shelf traffic estimates, enabling practical layout optimization.

Reinforcement learningAI AgentsBusiness
SIG
72
HYP
18
arXiv cs.CL·

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

HINT-SD proposes targeted self-distillation for training long-horizon LLM agents. The method uses full-trajectory hindsight to identify failure-relevant actions and applies feedback-conditioned distillation only on targeted action spans. On BFCL v3 and AppWorld, it improves over dense per-turn feedback baselines by up to 18.80% while achieving 2.26× lower time per training step.

AI AgentsReinforcement learningReasoning
SIG
72
HYP
18
arXiv cs.CL·

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

CodeBind introduces a multimodal alignment framework using shared-specific compositional codebooks. The method decomposes representations into semantic shared components and modality-unique components, validated across 9 modalities (text, image, video, audio, depth, thermal, tactile, 3D point cloud, EEG) achieving state-of-the-art performance in classification and retrieval tasks.

EmbeddingsVisionRobotics
SIG
72
HYP
25
arXiv cs.AI·

Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation

CGPO (Curriculum Group Policy Optimization) improves text-to-image model training via adaptive curriculum based on reward variance. Method prioritizes partially-mastered prompts (high variance) and balances categories through proportional fairness optimization. Gains validated on GenEval, T2I-CompBench++, DPG Bench.

Image generationReinforcement learningBenchmarks
SIG
72
HYP
28
arXiv cs.CL·

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning

AdaSwitch proposes a cloud-local collaborative paradigm where a local agent (small LLM) handles simple tasks and requests assistance from a cloud agent (large LLM) for complex reasoning. The adaptive mechanism detects local errors and dynamically switches. Evaluation on 7 benchmarks (mathematical reasoning, complex QA) shows performance improvement with reduced computational overhead.

AI AgentsMulti-agentReasoning
SIG
72
HYP
25
arXiv cs.AI·

CommitDistill: A Lightweight Knowledge-Centric Memory Layer for Software Repositories

CommitDistill is an open-source Python prototype extracting typed knowledge units (Facts, Skills, Patterns) from local git history via deterministic regex and exposing them through a TF-IDF retriever. Tested on 5 repositories (25k commits), it achieves 0.750 hit-rate at 256-character budget versus 0.333 for BM25. No statistically detectable improvement on time-travel bug-fixes in LLM-as-judge evaluation.

Code generationRAGAI Agents
SIG
72
HYP
18