{"title":"文献图像分析的预注意阅读与选择性注意","authors":"C. Faure","doi":"10.1109/ICDAR.1999.791853","DOIUrl":null,"url":null,"abstract":"PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The approach is based on the combination of perception and symbol reading which are the two processes involved when humans detect the organisation of a document. \"Preattentive reading\" denotes the physical segmentation related to perceptual organisation. \"Selective attention\" means that symbol reading is limited to specific sequences of symbols or to pre-attentively selected locations. An OCR provides the primary structured description of the document. PixED improves the quality of this description, completes the physical segmentation and adds a logical description. A distributed software architecture and an incremental strategy are defined to enable the integration of perception and symbol reading. The approach is tested on a set of documents composed of several pages which are gathered from proceedings of scientific conferences.","PeriodicalId":130039,"journal":{"name":"Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Preattentive reading and selective attention for document image analysis\",\"authors\":\"C. Faure\",\"doi\":\"10.1109/ICDAR.1999.791853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The approach is based on the combination of perception and symbol reading which are the two processes involved when humans detect the organisation of a document. \\\"Preattentive reading\\\" denotes the physical segmentation related to perceptual organisation. \\\"Selective attention\\\" means that symbol reading is limited to specific sequences of symbols or to pre-attentively selected locations. An OCR provides the primary structured description of the document. PixED improves the quality of this description, completes the physical segmentation and adds a logical description. A distributed software architecture and an incremental strategy are defined to enable the integration of perception and symbol reading. The approach is tested on a set of documents composed of several pages which are gathered from proceedings of scientific conferences.\",\"PeriodicalId\":130039,\"journal\":{\"name\":\"Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.1999.791853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.1999.791853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Preattentive reading and selective attention for document image analysis
PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The approach is based on the combination of perception and symbol reading which are the two processes involved when humans detect the organisation of a document. "Preattentive reading" denotes the physical segmentation related to perceptual organisation. "Selective attention" means that symbol reading is limited to specific sequences of symbols or to pre-attentively selected locations. An OCR provides the primary structured description of the document. PixED improves the quality of this description, completes the physical segmentation and adds a logical description. A distributed software architecture and an incremental strategy are defined to enable the integration of perception and symbol reading. The approach is tested on a set of documents composed of several pages which are gathered from proceedings of scientific conferences.