Back to feed
arXiv cs.CL·

HKJudge: A Legal Discourse-Annotated Corpus for Interpreting What Courts Find, How They Reason, and What They Rule

Signal
82
Hype
15
In three linesHKJudge is the first sentence-level expert-annotated legal discourse corpus. It contains ~290k sentences and ~6.5M tokens from Hong Kong criminal judgments across all court levels, annotated by legal linguistics experts. Two benchmark tasks: rhetorical role classification (26 categories) and legal element extraction. Evaluation on BERT models, open-source and commercial LLMs.
Read source
Your take?
BenchmarksPapersFine-tuningCode generation

Summary generated by Claude — human-verified