Page 25 of 139

AllHigh signalRecent
5526 articles
arXiv cs.CL·

S3Mem: Structured Spatiotemporal Scene-Event Memory for Long-Horizon Interactive Question Answering

S3MEM introduces a structured scene-event episodic memory framework for long-horizon interactive agents. The system structures trajectories into organized memory units and uses anchor-sensitive retrieval to improve spatiotemporal question answering. Evaluated on Crafter, Jericho, SciWorld, and ALFWorld, S3MEM outperforms Vanilla RAG and Graph-NoReader in accuracy while using fewer evidence tokens.

RAGAI AgentsReasoning
SIG
75
HYP
15
arXiv cs.CL·

Text-Preserving Lossy Text Compression: A Study of Strategic Deletion and LLM Reconstruction

Study of lossy semantic text compression where an encoder strategically deletes text parts and an LLM reconstructs original content. Benchmarks 6 deletion strategies (uniform, frequency, entropy, LP-optimized, hybrid) on BBC News. WordFreq provides best cost/performance ratio; semantic methods excel at moderate compression; QLoRA fine-tuning competes with Gemini 2.0 Flash.

BenchmarksReasoningFine-tuning
SIG
75
HYP
15
Reddit r/LocalLLaMA·

Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

Zai replaced the network architecture on a 1000-GPU cluster running GLM-5.1 from ROFT to ZCube (developed with Tsinghua and HarnetsAI). Results: switch/optical costs down 33%, GPU throughput up 15%, P99 first-token latency down 40.6%. ZCube removes the Spine layer for full bipartite interconnect, eliminating asymmetric traffic hotspots inherent to Prefill-Decode disaggregated inference.

InfrastructureReasoning
SIG
75
HYP
25
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> anthropics /</span> claude-code

Claude Code is an agentic coding tool in the terminal that understands your codebase and executes routine tasks, explains complex code, and handles git workflows through natural language commands.

ClaudeClaude CodeAI Agents
SIG
75
HYP
35
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> microsoft /</span> RAMPART

Microsoft releases RAMPART, a pytest-native safety and security testing framework for agentic AI applications. Enables evaluation of security and safety risks in multi-agent systems.

AI AgentsMulti-agentAI safety
SIG
75
HYP
15
arXiv cs.CL·

Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction

CAROL is a probabilistic framework for test-time hallucination reduction in LLMs. It defines semantic uncertainty based on consistency between generated responses and trusted context, formulating mitigation as a Markov chain accept-reject process with convergence guarantees. Results on QA and multi-agent reasoning benchmarks show significant hallucination reduction.

ReasoningAI safetyAlignment
SIG
75
HYP
15
arXiv cs.AI·

C-MIG: Multi-view Information Gain-based Retrieval-Augmented Generation for Clinical Diagnosis Reasoning

C-MIG introduces a multi-view information gain-based RAG framework for clinical diagnosis reasoning. It replaces exact-match binary rewards with information gain estimation from two views (retrieved documents and document refinement) to better supervise LLM reasoning. Experiments on four medical benchmarks show improvements over RAG-RL baselines in both in-domain and out-of-domain settings.

RAGReinforcement learningReasoning
SIG
75
HYP
15