Florent Montreuil, Stéphane Nicolas, E. Grosicki, L. Heutte
{"title":"A New Hierarchical Handwritten Document Layout Extraction Based on Conditional Random Field Modeling","authors":"Florent Montreuil, Stéphane Nicolas, E. Grosicki, L. Heutte","doi":"10.1109/ICFHR.2010.13","DOIUrl":null,"url":null,"abstract":"In this study we describe a new approach to extract layout of unconstrained handwritten letters such as those sent by individuals to companies. The proposed model uses a hierarchical combination of Conditional Random Fields (CRFs) which gives access to various levels of the layout interpretation. The analysis proceeds by decreasing the resolution and increasing the abstraction of the document, starting from high resolution analysis (pixel level), to a low resolution of the layout structure. Informations of high resolution are used to bring a specific prior knowledge of the layout like presence of textual information. Experiments have been performed on the RIMES database composed of more than 5000 handwritten letters. Good results have been reported showing the capacity of our approach to extract simultaneously the physical and logical layouts.","PeriodicalId":335044,"journal":{"name":"2010 12th International Conference on Frontiers in Handwriting Recognition","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 12th International Conference on Frontiers in Handwriting Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2010.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
In this study we describe a new approach to extract layout of unconstrained handwritten letters such as those sent by individuals to companies. The proposed model uses a hierarchical combination of Conditional Random Fields (CRFs) which gives access to various levels of the layout interpretation. The analysis proceeds by decreasing the resolution and increasing the abstraction of the document, starting from high resolution analysis (pixel level), to a low resolution of the layout structure. Informations of high resolution are used to bring a specific prior knowledge of the layout like presence of textual information. Experiments have been performed on the RIMES database composed of more than 5000 handwritten letters. Good results have been reported showing the capacity of our approach to extract simultaneously the physical and logical layouts.