Page 74 of 149

AllHigh signalRecent
5929 articles
arXiv cs.AI·

ReTAMamba: Reliability-Aware Temporal Aggregation with Mamba for Irregular Clinical Time Series Prediction

ReTAMamba is a Mamba-based model for predicting irregular clinical time series. It estimates observation reliability from missingness and elapsed time, integrates multi-resolution information via Chronological Weaving, and uses a budgeted token router. On MIMIC-IV, eICU, and PhysioNet 2012, it improves AUPRC by 7.51%, 7.80%, and 10.15% respectively.

BenchmarksPapersReasoning
SIG
72
HYP
18
arXiv cs.AI·

Fre-Res: Frequency-Residual Video Token Compression for Efficient Video MLLMs

Fre-Res introduces adaptive video-token compression for video MLLMs. The framework separates spatial details (high-fidelity anchors) from temporal evolution (residual-frequency tokens via 1D-DCT). A Spatial-Guided Absorber aligns frequency dynamics with visual embeddings. Results: near full-token performance with substantial reduction in token length across short and long-video benchmarks.

VisionVideo generationEvals
SIG
72
HYP
18
arXiv cs.AI·

Bayesian-Monte Carlo Schedule Updating for Construction Digital Twins: A Probabilistic Framework for Dynamic Project Forecasting

Bayesian-Monte Carlo probabilistic framework for dynamic construction project schedule updating. Models activity durations with lognormal distributions, updates them via Bayesian inference, and propagates uncertainty through Monte Carlo simulation. Demonstrates improved forecasting accuracy over deterministic CPM methods on PSPLIB benchmarks.

ReasoningBenchmarks
SIG
72
HYP
15
arXiv cs.AI·

Train the Trainers -- An Agentic AI Framework for Peer-Based Mental Health Support in Battlefield Environments

Agentic AI framework for peer-based mental health support in military operations. Recovered soldiers trained as peer facilitators supervise specialized AI agents (symptom triage, interventions, documentation) in air-gapped environments. Prototype developed with U.S. Army Health Center. Goal: reduce evacuations, accelerate care, maintain human oversight.

AI AgentsMulti-agentAI safety
SIG
72
HYP
28
arXiv cs.AI·

Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents

Reversa is a reverse documentation engineering framework converting legacy systems into operational specifications for AI agents. A multi-agent pipeline extracts implicit business rules, synthesizes architecture, and generates traceable specifications with confidence marking. Case study: COBOL-to-Go ATM migration producing 517 claims, 10 identified gaps, and 53 Gherkin scenarios.

AI AgentsMulti-agentCode generation
SIG
72
HYP
25
arXiv cs.AI·

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Vision-OPD introduces regional-to-global self-distillation to improve fine-grained visual understanding in MLLMs. The framework transfers the model's privileged perception on evidence-centered crops to its full-image policy via KL divergence minimization between token distributions. Competitive results on fine-grained visual understanding benchmarks without external models or ground-truth labels.

VisionReinforcement learningBenchmarks
SIG
72
HYP
18
arXiv cs.CL·

Multilingual OCR-Aware Fine-Tuning and Prompt-Guided Chain-of-Thought Reasoning for Multimodal Large Language Models

Multilingual OCR-aware fine-tuning framework for MLLMs combining synthetic OCR-to-translation data generation, LoRA-based SFT, and structured visual chain-of-thought reasoning. Significantly improves extraction of small, blurred, occluded text on receipts, menus, documents under degraded visual conditions. Outperforms GPT-5 and Gemini on OCR grounding and hallucination reduction.

VisionReasoningFine-tuning
SIG
72
HYP
28