StyleText: A Large-Scale Dataset and Benchmark for Stylized Scene Text Inpainting
Signal
75
Hype
25
In three linesStyleText is a dataset of 28,518 image-mask-prompt triplets for scene text inpainting with style preservation. Automated pipeline combines LLM templating, Flux with KV-cache injection, OCR, polygon mask extraction, and FluxFill augmentation. FluxFill+LoRA baseline substantially improves OCR accuracy while maintaining scene style consistency.Read source
Your take?
Summary generated by Claude — human-verified