Multilingual-GAN: A Multilingual GAN-based Approach for Handwritten Generation

2021 International Conference on Multimedia Analysis and Pattern Recognition (MAPR) Pub Date : 2021-10-01 DOI:10.1109/MAPR53640.2021.9585285

Manh-Khanh Ngo Huu, Sy-Tuyen Ho, Vinh-Tiep Nguyen, T. Ngo

引用次数: 1

Abstract

Handwritten Text Recognition (HTR) is a difficult problem because of the diversity of calligraphic styles. To enhance the accuracy of HTR systems, a large amount of training data is required. The previous methods aim at generating handwritten images from input strings via RNN models such as LSTM or GRU. However, these methods require a predefined alphabet corresponding to a given language. Thus, they can not well adapt to a new languages. To address this problem, we propose an Image2Image-based method named Multilingual-GAN, which translates a printed text image into a handwritten style one. The main advantage of this approach is that the model does not depend on any language alphabets. Therefore, our model can be used on a new language without re-training on a new dataset. The quantitative results demonstrate that our proposed method outperforms other state-of-the-art models. Code is available at https://github.com/HoSyTuyen/MultilingualGAN

查看原文本刊更多论文

多语言gan:一种基于多语言gan的手写生成方法

由于书法风格的多样性，手写体文本识别(HTR)是一个难题。为了提高HTR系统的准确性，需要大量的训练数据。之前的方法旨在通过LSTM或GRU等RNN模型从输入字符串生成手写图像。然而，这些方法需要与给定语言对应的预定义字母表。因此，他们不能很好地适应一种新的语言。为了解决这个问题，我们提出了一种基于image2image的方法，称为Multilingual-GAN，它将打印的文本图像翻译成手写样式的图像。这种方法的主要优点是该模型不依赖于任何语言字母。因此，我们的模型可以在新的语言上使用，而无需在新的数据集上重新训练。定量结果表明，我们提出的方法优于其他最先进的模型。代码可从https://github.com/HoSyTuyen/MultilingualGAN获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)

自引率

0.00%

发文量