Page 26 of 139

AllHigh signalRecent
5530 articles
arXiv cs.CL·

Chain-based Adaptive Reconfiguration Over Lattices for Hallucination Reduction

CAROL is a probabilistic framework for test-time hallucination reduction in LLMs. It defines semantic uncertainty based on consistency between generated responses and trusted context, formulating mitigation as a Markov chain accept-reject process with convergence guarantees. Results on QA and multi-agent reasoning benchmarks show significant hallucination reduction.

ReasoningAI safetyAlignment
SIG
75
HYP
15
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> langfuse /</span> langfuse

Langfuse is an open-source LLM engineering platform providing observability, metrics, evals, prompt management, and playground. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM. Y Combinator W23 graduate.

Open sourceToolsEvals
SIG
75
HYP
25
arXiv cs.AI·

Advancing Creative Physical Intelligence in Large Multimodal Models

MM-CreativityBench, a new benchmark, evaluates large multimodal models' ability to solve creative problems by identifying non-obvious object uses in physically constrained environments. Current LMMs fail due to insufficient grounded exploration and hallucinations. Affordance-grounded alignment via Direct Preference Optimization reduces these errors and improves entity selection.

BenchmarksVisionReasoning
SIG
75
HYP
25
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> mozilla /</span> cargo-vet

Mozilla releases cargo-vet, a supply-chain security tool for Rust. It enables auditing and validating Rust dependencies before production use.

Open sourceAI safetyTools
SIG
75
HYP
15
GitHub Trending·

<svg aria-hidden="true" data-component="Octicon" height="16" viewBox="0 0 16 16" version="1.1" width="16" data-view-component="true" class="octicon octicon-repo mr-1 tmp-mr-1 color-fg-muted"> <path d="M2 2.5A2.5 2.5 0 0 1 4.5 0h8.75a.75.75 0 0 1 .75.75v12.5a.75.75 0 0 1-.75.75h-2.5a.75.75 0 0 1 0-1.5h1.75v-2h-8a1 1 0 0 0-.714 1.7.75.75 0 1 1-1.072 1.05A2.495 2.495 0 0 1 2 11.5Zm10.5-1h-8a1 1 0 0 0-1 1v6.708A2.486 2.486 0 0 1 4.5 9h8ZM5 12.25a.25.25 0 0 1 .25-.25h3.5a.25.25 0 0 1 .25.25v3.25a.25.25 0 0 1-.4.2l-1.45-1.087a.249.249 0 0 0-.3 0L5.4 15.7a.25.25 0 0 1-.4-.2Z"></path> </svg> <span data-view-component="true" class="text-normal"> modelscope /</span> FunASR

FunASR is an industrial-grade speech recognition toolkit supporting 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

VoiceOpen sourceTools
SIG
75
HYP
25
arXiv cs.AI·

Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security

Comprehensive survey on trustworthy agentic AI systems (LLMs augmented with planning, tool use, memory). Examines safety, robustness, privacy, and system security. Proposes unified metrics, benchmarks, and stage-targeted mitigation strategies across agent workflows. Identifies open challenges: self-evolving agents, runtime verification, privacy-preserving personalization.

AI AgentsAI safetyBenchmarks
SIG
75
HYP
20
arXiv cs.CL·

Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation

Method to improve consistency in automated labeling pipelines for content moderation. Authors propose an AI-driven workflow where an LLM writes detailed per-category constitutions (harassment, hate speech, non-violent crime), then a frontier LLM interprets them to generate golden labels. Result: 57x reduction in cross-model inconsistency vs paragraph definitions.

EvalsAI safetyAlignment
SIG
75
HYP
15