Back to feed
arXiv cs.CL·

CanLegalRAGBench: Evaluating Retrieval-Augmented Generation on Canadian Case Law

Signal
78
Hype
15
In three linesCanLegalRAGBench is an evaluation benchmark for RAG systems applied to Canadian law, based on realistic queries and expert-annotated answers. The study shows open-source embedding models are competitive with closed-source alternatives, but identifies hallucinations in 8-29% of generated answers unsupported by retrieved documents.
Read source
Your take?
RAGEmbeddingsEvalsBenchmarksAI safety

Summary generated by Claude — human-verified