ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language
Signal
72
Hype
18
In three linesAnnotation framework using Llama3.1 as teacher model to tag Polish medical texts. Corpus spans 5 clinical categories (Radiology, Oncology, Cardiology, Hypertension, Pathology). DistilBERT achieves F1 > 0.80 per category, 500× smaller than LLM, 300× lower GPU VRAM, inference several hundred times faster.Read source
Your take?
Summary generated by Claude — human-verified