{"title":"手写数学公式表示和符号分割的视线笔画和Parzen形状上下文特征","authors":"Lei Hu, R. Zanibbi","doi":"10.1109/ICFHR.2016.0044","DOIUrl":null,"url":null,"abstract":"This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.","PeriodicalId":194844,"journal":{"name":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","volume":"138 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation\",\"authors\":\"Lei Hu, R. Zanibbi\",\"doi\":\"10.1109/ICFHR.2016.0044\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.\",\"PeriodicalId\":194844,\"journal\":{\"name\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"volume\":\"138 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFHR.2016.0044\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2016.0044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Line-of-Sight Stroke Graphs and Parzen Shape Context Features for Handwritten Math Formula Representation and Symbol Segmentation
This paper presents a new representation for handwritten math formulae: a Line-of-Sight (LOS) graph over handwritten strokes, computed using stroke convex hulls. Experimental results using the CROHME 2012 and 2014 datasets show that LOS graphs capture the visual structure of handwritten formulae better than commonly used graphs such as Time-series, Minimum Spanning Trees, and k-Nearest Neighbor graphs. We then introduce a shape context-based feature (Parzen window Shape Contexts (PSC)) which is combined with simple geometric features and the distance in time between strokes to obtain state-of-the-art symbol segmentation results (92.43% F-measure for CROHME 2014). This result is obtained using a simple method, without use of OCR or an expression grammar. A binary random forest classifier identifies which LOS graph edges represent stroke pairs that should be merged into symbols, with connected components over merged strokes defining symbols. Line-of-Sight graphs and Parzen Shape Contexts represent visual structure well, and might be usefully applied to other notations.