Recognising hand-written Japanese sentences

Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication Pub Date : 1993-11-03 DOI:10.1109/ROMAN.1993.367736

D. Inman

{"title":"Recognising hand-written Japanese sentences","authors":"D. Inman","doi":"10.1109/ROMAN.1993.367736","DOIUrl":null,"url":null,"abstract":"This paper makes a case for handwriting recognition compared to other input methods for communication with machines. A comparison is made with voice recognition and keyboard input systems for both western languages and for Japanese. Both single word recognition and whole sentence recognition are considered. A case is made for handwriting recognition for a language with a large character set and many homonyms, such as the Japanese language. For such a language, a fundamental problem exists for both keyboard input and for voice recognition. Both these systems need to convert a phonetic representation into Kanji, and this requires extensive knowledge of the meaning of the text if it is to be automatic. AI research has yet to deliver fast, competent text understanding systems. Consequently, both voice and keyboard input methods need to present the user with alternative choices during recognition, and this makes these methods slow and unnatural. A system is described here which is designed for accurate, fast sentence recognition of both western scripts and Japanese. The system is designed for whole sentence recognition, with the user allowed to write in a natural way. There is considerable flexibility allowed in terms of size and shape of the writing. The distinguishing characteristic of the system, is the use of a unified recognition technique applied to character, word and sentence recognition. This technique is an adaptation of chart parsing, used extensively in natural language processing in AI. Here the technique has been developed to allow weighted multiple hypotheses during recognition. This is important for a system that allows the user to write naturally. This approach to sentence recognition, allows mistakes made during low level processing to be corrected at higher levels. Knowledge of the vocabulary and allowable sentence structures are incorporated in the system in a unified way. A useful additional result of this approach, is the ability to produce a syntactic parse of the sentence recognised. Provisional results are presented for recognition of Japanese Hiragana characters and for English capital letters. The users were given considerable freedom on the style of writing used. The results show recognition rates of over 80% at present, for a variety of users. Improvements in this performance are anticipated when lexical and syntactic modules are added. Further improvements are anticipated by incorporating learning into the system, so that the knowledge base will be tuned for each user.<<ETX>>","PeriodicalId":270591,"journal":{"name":"Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROMAN.1993.367736","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

This paper makes a case for handwriting recognition compared to other input methods for communication with machines. A comparison is made with voice recognition and keyboard input systems for both western languages and for Japanese. Both single word recognition and whole sentence recognition are considered. A case is made for handwriting recognition for a language with a large character set and many homonyms, such as the Japanese language. For such a language, a fundamental problem exists for both keyboard input and for voice recognition. Both these systems need to convert a phonetic representation into Kanji, and this requires extensive knowledge of the meaning of the text if it is to be automatic. AI research has yet to deliver fast, competent text understanding systems. Consequently, both voice and keyboard input methods need to present the user with alternative choices during recognition, and this makes these methods slow and unnatural. A system is described here which is designed for accurate, fast sentence recognition of both western scripts and Japanese. The system is designed for whole sentence recognition, with the user allowed to write in a natural way. There is considerable flexibility allowed in terms of size and shape of the writing. The distinguishing characteristic of the system, is the use of a unified recognition technique applied to character, word and sentence recognition. This technique is an adaptation of chart parsing, used extensively in natural language processing in AI. Here the technique has been developed to allow weighted multiple hypotheses during recognition. This is important for a system that allows the user to write naturally. This approach to sentence recognition, allows mistakes made during low level processing to be corrected at higher levels. Knowledge of the vocabulary and allowable sentence structures are incorporated in the system in a unified way. A useful additional result of this approach, is the ability to produce a syntactic parse of the sentence recognised. Provisional results are presented for recognition of Japanese Hiragana characters and for English capital letters. The users were given considerable freedom on the style of writing used. The results show recognition rates of over 80% at present, for a variety of users. Improvements in this performance are anticipated when lexical and syntactic modules are added. Further improvements are anticipated by incorporating learning into the system, so that the knowledge base will be tuned for each user.<>

查看原文本刊更多论文

识别手写的日语句子

本文将手写体识别与其他与机器通信的输入法进行了比较。并与西方语言和日语的语音识别和键盘输入系统进行了比较。同时考虑了单字识别和整句识别。为具有大字符集和许多同音异义词的语言(如日语)的手写识别提供了一个案例。对于这样一种语言，键盘输入和语音识别都存在一个基本问题。这两种系统都需要将语音表示转换为汉字，如果要实现自动化，则需要对文本的含义有广泛的了解。人工智能研究尚未提供快速、有效的文本理解系统。因此，语音和键盘输入方法都需要在识别过程中向用户提供可供选择的选项，这使得这些方法变得缓慢且不自然。本文介绍了一种能够准确、快速地识别西文和日文句子的系统。该系统是为整句识别而设计的，允许用户以自然的方式书写。字体的大小和形状有相当大的灵活性。该系统的显著特点，是采用统一的识别技术应用于字符、单词和句子的识别。该技术是对图表解析的一种改进，广泛应用于人工智能的自然语言处理。在这里，该技术已经发展到允许在识别过程中加权多个假设。这对于允许用户自然编写的系统非常重要。这种句子识别方法允许低级处理过程中出现的错误在高级处理过程中得到纠正。词汇知识和允许的句子结构被统一地纳入系统。这种方法的另一个有用的结果是能够对所识别的句子进行语法解析。给出了识别日文平假名字符和英文大写字母的临时结果。使用者在使用写作风格上有相当大的自由。结果表明，目前的识别率在80%以上，适用于各种用户。当添加词法和语法模块时，预期性能会有所提高。通过将学习整合到系统中，预期会有进一步的改进，这样知识库将针对每个用户进行调整

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication

自引率

0.00%

发文量