Abstract: |
This paper proposes a Chinese font generation algorithm that addresses common issues in existing methods, including structural imbalance, stroke errors, and character blending. The proposed algorithm integrates Coordinate Attention (CA) with the Convolutional Block Attention Module (CBAM). To mitigate structural imbalance, CA is utilized to incorporate positional information into channel features, generating direction-aware and position-sensitive attention maps that enhance the stability of font structures. To address stroke errors and character blending, the CBAM module is introduced to improve the generator`s focus on stylistic features while preserving critical information such as glyph style, structure, and stroke details, and suppressing noise and irrelevant data. The CA and CBAM attention mechanisms are combined within a Generative Adversarial Network (GAN). Experiments were conducted using four types of Hanyi character databases, each with 6763 samples. The proposed algorithm was compared with Pix2pix, CycleGAN, Zi2zi, MX-Font and DG-Font. The experimental results indicate that the proposed method outperforms existing approaches in terms of structural stability and stroke detail style, with improvements observed in PSNR, SSIM, and LPIPS metrics. |
|