Diffusion-Based Ukrainian Handwritten Text Generation with Cross-Domain Style Transfer 文章

ArXiv CS.CV2026-05-28NEWSen作者: Andrii Ahitoliev, Pavlo Berezin

摘要

arXiv:2605.27487v1 Announce Type: new Abstract: Handwritten text generation (HTG) conditioned on writer style has been widely studied for Latin scripts, but remains underexplored for low-resource and non-Latin writing systems, leaving open how well existing models generalise beyond the Latin domain. Cyrillic, particularly Ukrainian, lacks both large-scale writer-labeled datasets and empirical evidence of such generalisation. To address this gap, we construct a Ukrainian handwritten word dataset of 126,177 images from 308 writers using connected-component segmentation, quality filtering, and targeted oversampling of underrepresented Ukrainian characters. We retrain DiffusionPen, a MobileNetV2 triplet-loss style encoder with a CANINE-conditioned latent diffusion U-Net, on this dataset without architectural modification, testing direct transfer from Latin to Cyrillic.

相关公司

暂无数据

相关人物

暂无数据