Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Gao, Zuan"'
Visual text generation has significantly advanced through diffusion models aimed at producing images with readable and realistic text. Recent works primarily use a ControlNet-based framework, employing standard font text images to control diffusion m
Externí odkaz:
http://arxiv.org/abs/2407.11502
In text recognition, self-supervised pre-training emerges as a good solution to reduce dependence on expansive annotated real data. Previous studies primarily focus on local visual representation by leveraging mask image modeling or sequence contrast
Externí odkaz:
http://arxiv.org/abs/2405.05841
Scene text images contain not only style information (font, background) but also content information (character, texture). Different scene text tasks need different information, but previous representation learning methods use tightly coupled feature
Externí odkaz:
http://arxiv.org/abs/2405.04377