Page 51 of 144

AllHigh signalRecent
5744 articles
arXiv cs.AI·

HADT: A Heterogeneous Multi-Agent Differential Transformer for Autonomous Earth Observation Satellite Cluster

Novel transformer-based architecture for autonomous resource management in heterogeneous satellite clusters (optical and SAR). Uses model-free reinforcement learning for real-time decision-making in Earth Observation missions. Demonstrates significant performance improvements and transferability across varying cluster sizes.

Multi-agentReinforcement learningReasoning
SIG
72
HYP
15
Reddit r/LocalLLaMA·

Speed difference between Windows 11 and Linux with llama.cpp: a myth when using medium and large MoE models

llama.cpp benchmark comparing Windows 11 and Linux (Ubuntu 26.04) on Nvidia GPU (RTX 5080 + 2× RTX 5060 Ti). No significant performance difference: Qwen 3.5 122B achieves PP 300/TG 28 (Windows) vs PP 290/TG 28.5 (Linux); Qwen 3.5 397B: PP 140/TG 16 vs PP 150/TG 15.2. Tests repeated 4 times with recent llama.cpp including VRAM optimization.

LlamaQwenBenchmarks
SIG
72
HYP
15
Reddit r/MachineLearning·

What I learned building a debugger for PyTorch training loops and how it changed how I think about failure diagnosis [D]

Developer built NeuralDBG, a PyTorch debugger that automatically detects training failures (vanishing/exploding gradients, data anomalies). Key insight: failures are layer-localized, not global. Effective monitoring: gradient norm transitions per layer rather than raw histograms. Open-source tool available on PyPI.

ToolsCode generationOpen source
SIG
72
HYP
28
arXiv cs.AI·

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

Redpanda introduces an Agentic Data Plane architecture using out-of-band metadata channels to enforce security policies, data classifications, and behavioral constraints outside the agent's read/write path. These channels prevent hallucinations and adversarial manipulation while maintaining tamper-proof audit trails. Demonstrated with a multi-agent portfolio rebalancing system.

AI AgentsMulti-agentAI safety
SIG
72
HYP
28
arXiv cs.AI·

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

The Cognitive Categorical Transformer (CCT), a 306M-parameter model augmenting GPT-2 Small, incorporates category-theoretic and cognitive-science-inspired components. On WikiText-103, CCT achieves 21.27 validation perplexity versus 24.19 for GPT-2 Small baseline, a 12% relative reduction (2.92 PPL). Ablations show simplicial message passing accounts for 84% of the improvement.

GPTPapersBenchmarks
SIG
72
HYP
25