Retrieval-Augmented Long-Context Translation for Cultural Image Captioning: Gators submission for AmericasNLP 2026 shared task
Signal
78
Hype
25
In three linesTwo-stage pipeline for captioning cultural images in Indigenous languages: Qwen2.5-VL generates Spanish intermediate caption, then Gemini 2.5 Flash produces target-language caption via retrieval-augmented prompting. Achieves 164.1% (Bribri), 131.7% (Guaraní), 122.6% (Orizaba Nahuatl) improvements over baseline. Overall winner of AmericasNLP 2026 shared task.Read source
Your take?
Summary generated by Claude — human-verified