手写数学公式表示和符号分割的视线笔画和Parzen形状上下文特征

Lei Hu, R. Zanibbi
{"title":"手写数学公式表示和符号分割的视线笔画和Parzen形状上下文特征","authors":"Lei Hu, R. Zanibbi","doi":"10.1109/ICFHR.2016.0044","DOIUrl":null,"url":null,"abstract":"This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.","PeriodicalId":194844,"journal":{"name":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","volume":"138 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation\",\"authors\":\"Lei Hu, R. Zanibbi\",\"doi\":\"10.1109/ICFHR.2016.0044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.\",\"PeriodicalId\":194844,\"journal\":{\"name\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"volume\":\"138 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFHR.2016.0044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2016.0044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

本文提出了一种手写数学公式的新表示:手写笔画上的视距(LOS)图,使用笔画凸包计算。使用CROHME 2012和2014数据集的实验结果表明,LOS图比常用的图(如时间序列图、最小生成树图和k近邻图)更好地捕捉手写公式的视觉结构。然后,我们引入了一种基于形状上下文的特征(Parzen窗口形状上下文(PSC)),它与简单的几何特征和笔画之间的时间距离相结合,以获得最先进的符号分割结果(CROHME 2014的f值为92.43%)。这个结果是用一个简单的方法获得的,没有使用OCR或表达式语法。二进制随机森林分类器确定哪些LOS图边表示应该合并为符号的笔画对,并用合并笔画上的连接组件定义符号。视线图形和Parzen形状上下文很好地表示了视觉结构,并且可以有效地应用于其他符号。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation
This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信