{"title":"从扫描文档中提取多方向手写注释","authors":"M. B. Jlaiel, R. Mullot, A. Alimi","doi":"10.1109/DAS.2014.17","DOIUrl":null,"url":null,"abstract":"In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.","PeriodicalId":220495,"journal":{"name":"2014 11th IAPR International Workshop on Document Analysis Systems","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Multi-oriented Handwritten Annotations Extraction from Scanned Documents\",\"authors\":\"M. B. Jlaiel, R. Mullot, A. Alimi\",\"doi\":\"10.1109/DAS.2014.17\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.\",\"PeriodicalId\":220495,\"journal\":{\"name\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"volume\":\"88 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 11th IAPR International Workshop on Document Analysis Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DAS.2014.17\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 11th IAPR International Workshop on Document Analysis Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DAS.2014.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-oriented Handwritten Annotations Extraction from Scanned Documents
In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.