{"title":"机器打印阿拉伯数学公式的识别","authors":"Kaouther Khazri Ayeb, A. Echi, A. Belaïd","doi":"10.1109/ICTIA.2014.7883769","DOIUrl":null,"url":null,"abstract":"The proposed system recognizes machine-printed Arabic mathematical formulas, extracted from scanned images of clearly printed documents, and outputs the recognition results in MathML format. Two main stages are followed by the proposed system: symbol recognition and symbol-arrangement analysis. A combination of different statistical features (Run length, central Zernike moments, Bi-level co-occurrence, etc.) and K∗, an instance-based classifier, have been used to achieve high accuracy for the recognition of mathematical symbols. We defined a set of replacement rules by a coordinate grammar to parse mathematical formulas. We used the coordinate grammar with emphasis on symbol recognizer as well as symbol arrangement analysis. Both ascending and descending parsing scheme, based on operator dominance, has been used to parse the formula. The proposed system provides output in MathML and achieves satisfactory results.","PeriodicalId":390925,"journal":{"name":"2014 Information and Communication Technologies Innovation and Application (ICTIA)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Recognition of machine-printed Arabic mathematical formulas\",\"authors\":\"Kaouther Khazri Ayeb, A. Echi, A. Belaïd\",\"doi\":\"10.1109/ICTIA.2014.7883769\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The proposed system recognizes machine-printed Arabic mathematical formulas, extracted from scanned images of clearly printed documents, and outputs the recognition results in MathML format. Two main stages are followed by the proposed system: symbol recognition and symbol-arrangement analysis. A combination of different statistical features (Run length, central Zernike moments, Bi-level co-occurrence, etc.) and K∗, an instance-based classifier, have been used to achieve high accuracy for the recognition of mathematical symbols. We defined a set of replacement rules by a coordinate grammar to parse mathematical formulas. We used the coordinate grammar with emphasis on symbol recognizer as well as symbol arrangement analysis. Both ascending and descending parsing scheme, based on operator dominance, has been used to parse the formula. The proposed system provides output in MathML and achieves satisfactory results.\",\"PeriodicalId\":390925,\"journal\":{\"name\":\"2014 Information and Communication Technologies Innovation and Application (ICTIA)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Information and Communication Technologies Innovation and Application (ICTIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTIA.2014.7883769\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Information and Communication Technologies Innovation and Application (ICTIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTIA.2014.7883769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recognition of machine-printed Arabic mathematical formulas
The proposed system recognizes machine-printed Arabic mathematical formulas, extracted from scanned images of clearly printed documents, and outputs the recognition results in MathML format. Two main stages are followed by the proposed system: symbol recognition and symbol-arrangement analysis. A combination of different statistical features (Run length, central Zernike moments, Bi-level co-occurrence, etc.) and K∗, an instance-based classifier, have been used to achieve high accuracy for the recognition of mathematical symbols. We defined a set of replacement rules by a coordinate grammar to parse mathematical formulas. We used the coordinate grammar with emphasis on symbol recognizer as well as symbol arrangement analysis. Both ascending and descending parsing scheme, based on operator dominance, has been used to parse the formula. The proposed system provides output in MathML and achieves satisfactory results.