Back to feed
arXiv cs.AI·

StyleText: A Large-Scale Dataset and Benchmark for Stylized Scene Text Inpainting

Signal
75
Hype
25
In three linesStyleText is a dataset of 28,518 image-mask-prompt triplets for scene text inpainting with style preservation. Automated pipeline combines LLM templating, Flux with KV-cache injection, OCR, polygon mask extraction, and FluxFill augmentation. FluxFill+LoRA baseline substantially improves OCR accuracy while maintaining scene style consistency.
Read source
Your take?
BenchmarksImage generationVisionPapers

Summary generated by Claude — human-verified