Hidden Markov Models for Online Handwritten Tamil Word Recognition

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Pub Date : 2007-09-23 DOI:10.1109/ICDAR.2007.131

A. Bharath, S. Madhvanath

引用次数: 68

Abstract

Hidden Markov models (HMM) have long been a popular choice for Western cursive handwriting recognition following their success in speech recognition. Even for the recognition of Oriental scripts such as Chinese, Japanese and Korean, hidden Markov models are increasingly being used to model substrokes of characters. However, when it comes to Indie script recognition, the published work employing HMMs is limited, and generally focussed on isolated character recognition. In this effort, a data-driven HMM-based online handwritten word recognition system for Tamil, an Indie script, is proposed. The accuracies obtained ranged from 98% to 92.2% with different lexicon sizes (IK to 20 K words). These initial results are promising and warrant further research in this direction. The results are also encouraging to explore possibilities for adopting the approach to other Indie scripts as well.

查看原文本刊更多论文

隐马尔可夫模型用于在线手写泰米尔语单词识别

隐马尔可夫模型(HMM)在语音识别领域取得成功后，一直是西方手写体识别领域的热门选择。即使是汉字、日文、韩文等东方文字的识别，也越来越多地使用隐马尔可夫模型来模拟汉字的笔划。然而，当涉及到独立脚本识别时，使用hmm的出版作品是有限的，并且通常集中在孤立的字符识别上。在此基础上，提出了一种基于数据驱动的独立语言泰米尔语的在线手写单词识别系统。在不同的词汇量(IK到20k单词)下，准确率在98%到92.2%之间。这些初步结果是有希望的，值得在这个方向上进一步研究。结果也鼓励我们探索将这种方法应用于其他独立脚本的可能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

自引率

0.00%

发文量