Page 47 of 144

AllHigh signalRecent
5735 articles
arXiv cs.AI·

From Long News to Accurate Forecast: Importance-Aware Fusion and PRM-Guided Reflection for Time Series Forecasting

Novel framework combining importance-aware news compression and process-level retrieval supervision for time series forecasting. A reward model estimates each article's forecasting utility for sequential fusion, while a PRM ranks supplementary-news candidates based on error profile. Experiments on finance, energy, traffic, and bitcoin benchmarks show improved accuracy and fewer refinement iterations.

LlamaReasoningRAG
SIG
72
HYP
28
arXiv cs.CL·

The Ghost Annotator: a Framework to Explore Human Label Variation in Content Moderation through Conformal Prediction

Framework combining conformal prediction and collaborative filtering-style annotator representation to analyze LLM behavior against human annotators in content moderation. Introduces Ghost Prediction metric to quantify model-human divergences. Evaluation across 4 LLMs and 4 datasets shows larger models more confident on texts with no human alignment, revealing structural demographic bias.

EvalsAI safetyAlignment
SIG
72
HYP
18
arXiv cs.LG·

Representational Capacity: Geometric Limits on Feature Representation in Transformer Language Models

Theoretical study on geometric limits of feature representation in transformers. Authors establish a framework based on linear representation and superposition hypotheses, showing representational capacity depends on vectors-to-dimensions ratio (k/d) rather than raw count. Analysis of dozens of open-source models reveals two classes based on orthogonality constraint ε.

PapersReasoningBenchmarks
SIG
72
HYP
15
arXiv cs.AI·

ChatHealthAI: Aligning Electronic Health Record Representations with Large Language Models for Grounded Clinical Reasoning

ChatHealthAI aligns structured EHR representations from a pretrained EHR foundation model with a frozen LLM's semantic space via a task-aware resampler. The multimodal framework integrates longitudinal patient representations with refined clinical event descriptions, improving interpretable clinical reasoning while maintaining competitive predictive performance on the EHRSHOT benchmark.

RAGReasoningEvals
SIG
72
HYP
18
arXiv cs.CL·

Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions

Researchers propose an agent economy where AI agents self-coordinate through auctions and payment exchanges without centralized control, inspired by Hayek's economic theory. This approach generates emergent multi-step reasoning strategies and outperforms baselines on five tasks including mathematical reasoning, financial research, and distributed-system optimization.

Multi-agentAI AgentsReasoning
SIG
72
HYP
35
arXiv cs.LG·

Auditable Climate Risk Intelligence from Fragmented ESG Data: Deterministic Orchestration and Imbalance-Aware Learning for Scope 1-3 Validation

Deterministic orchestration framework for validating fragmented ESG data (Scope 1-3) with temporal anomaly detection, imbalance-aware ensemble learning, and audit provenance tracing. Synthetic benchmark calibrated against GHG Protocol, PCAF, ISSB standards. Evaluation on classification, calibration, and provenance chain completeness metrics.

BenchmarksEvalsReinforcement learning
SIG
72
HYP
15