arXiv cs.CL·1 June 2026

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

Signal

Hype

In three linesCanLegalRAGBench is an evaluation benchmark for RAG systems applied to Canadian law, based on realistic queries and expert-annotated answers. The study shows open-source embedding models are competitive with closed-source alternatives, but identifies hallucinations in 8-29% of generated answers unsupported by retrieved documents.

Read source

Your take?

RAG Embeddings Evals Benchmarks AI safety

Summary generated by Claude — human-verified

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

Other angles on this story