TY - GEN
T1 - A Local Perceptual Approach for Few-Shot Text Effect Transfer
AU - Zhan, Hongjian
AU - Tian, Wei
AU - Lu, Yue
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - Text effect transfer (TET) aims to preserve the content of character images while rendering their style into various forms, including colors, outlines, shadows, textures, and glyphs. However, manually designing a complete font library is a labor-intensive task, making few-shot text effect transfer an increasingly important research focus. Existing methods often suffer from poor generalization, as their models are limited to a small range of text effects. Some approaches attempt to address this issue, but due to the scarcity of reference-style images, they tend to overfit or lack fine details, leading to failures when handling unseen text effects. To overcome these challenges, we propose a novel fine-tuning strategy that integrates Local Perceptual Fusion and Discrimination to enhance few-shot text effect transfer. Specifically, our fine-tuning strategy allows the model to adapt its parameters based on a small set of reference images from previously unseen styles, enabling the generation of realistic text effects. Additionally, we introduce a structure-level fusion mechanism in the style encoder to improve detail fidelity. To mitigate overfitting, we design a global discriminator and a local discriminator: the global discriminator assesses the overall realism of the generated styles, while the local discriminator performs fine-grained evaluation based on localized observations, ensuring both global consistency and fine-detail preservation. Experimental results demonstrate that our approach achieves advanced performance in few-shot text effect transfer, generating high-quality and highly faithful text effects.
AB - Text effect transfer (TET) aims to preserve the content of character images while rendering their style into various forms, including colors, outlines, shadows, textures, and glyphs. However, manually designing a complete font library is a labor-intensive task, making few-shot text effect transfer an increasingly important research focus. Existing methods often suffer from poor generalization, as their models are limited to a small range of text effects. Some approaches attempt to address this issue, but due to the scarcity of reference-style images, they tend to overfit or lack fine details, leading to failures when handling unseen text effects. To overcome these challenges, we propose a novel fine-tuning strategy that integrates Local Perceptual Fusion and Discrimination to enhance few-shot text effect transfer. Specifically, our fine-tuning strategy allows the model to adapt its parameters based on a small set of reference images from previously unseen styles, enabling the generation of realistic text effects. Additionally, we introduce a structure-level fusion mechanism in the style encoder to improve detail fidelity. To mitigate overfitting, we design a global discriminator and a local discriminator: the global discriminator assesses the overall realism of the generated styles, while the local discriminator performs fine-grained evaluation based on localized observations, ensuring both global consistency and fine-detail preservation. Experimental results demonstrate that our approach achieves advanced performance in few-shot text effect transfer, generating high-quality and highly faithful text effects.
KW - Few Shot
KW - Local Perceptual Approach
KW - Text Effect Transfer
UR - https://www.scopus.com/pages/publications/105022171976
U2 - 10.1007/978-981-95-3729-7_21
DO - 10.1007/978-981-95-3729-7_21
M3 - 会议稿件
AN - SCOPUS:105022171976
SN - 9789819537280
T3 - Lecture Notes in Computer Science
SP - 247
EP - 259
BT - Image and Graphics - 13th International Conference, ICIG 2025, Proceedings
A2 - Lin, Zhouchen
A2 - Wang, Liang
A2 - Jiang, Yugang
A2 - Wang, Xuesong
A2 - Liao, Shengcai
A2 - Shan, Shiguang
A2 - Liu, Risheng
A2 - Dong, Jing
A2 - Yu, Xin
PB - Springer Science and Business Media Deutschland GmbH
T2 - 13th International Conference on Image and Graphics, ICIG 2025
Y2 - 31 October 2025 through 2 November 2025
ER -