用于文档分析和识别的通用元数据存储库

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA) Pub Date : 2016-11-01 DOI:10.1109/AICCSA.2016.7945622

Hassanin M. Al-Barhamtoshy, Maher Khemakhem, K. Jambi, F. Essa, A. Fattouh, A. Al-Ghamdi

{"title":"用于文档分析和识别的通用元数据存储库","authors":"Hassanin M. Al-Barhamtoshy, Maher Khemakhem, K. Jambi, F. Essa, A. Fattouh, A. Al-Ghamdi","doi":"10.1109/AICCSA.2016.7945622","DOIUrl":null,"url":null,"abstract":"Document Analysis and Recognition (DAR) has two main objectives, first the analysis of the physical structure of the input image of the document, which should lead to the correct identification of the corresponding different homogeneous components and their boundaries in terms of XY coordinates. Second, each of these homogeneous components should be recognized in such a way that, if it is a text image, consequently this image should be recognized and translated into an intelligible text. DAR remains one of the most challenging topics in pattern recognition. Indeed, despite the diversity of the proposed approaches, techniques and methods, results remain very weak and away from expectations especially for several categories of documents such as complex, low quality, handwritten and historical documents. The complex structure and/or morphology of such documents are behind the weakness of results of these proposed approaches, techniques and methods. One of the challenging problems related to this topic is the creation of standard datasets that can be used by all stakeholders of this topic such as system developers, expert evaluators, and users. In addition, another challenging problem is how one could take advantages of all existing datasets that unfortunately are dispersed around the world without knowing, most of the times, any information about their locations and the way to reach them. As an attempt to solve the two mentioned above problems, we propose in this paper a Universal Datasets Repository for Document Analysis and Recognition (UMDAR) that has, in fact, a twofold advantage. First, it can help dataset creators to standardize their datasets and making them accessible to the research community once published on the proposed repository. Second, it can be used as a central which bridges in a smart manner between datasets and all DAR stakeholders.","PeriodicalId":448329,"journal":{"name":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Universal metadata repository for document analysis and recognition\",\"authors\":\"Hassanin M. Al-Barhamtoshy, Maher Khemakhem, K. Jambi, F. Essa, A. Fattouh, A. Al-Ghamdi\",\"doi\":\"10.1109/AICCSA.2016.7945622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Document Analysis and Recognition (DAR) has two main objectives, first the analysis of the physical structure of the input image of the document, which should lead to the correct identification of the corresponding different homogeneous components and their boundaries in terms of XY coordinates. Second, each of these homogeneous components should be recognized in such a way that, if it is a text image, consequently this image should be recognized and translated into an intelligible text. DAR remains one of the most challenging topics in pattern recognition. Indeed, despite the diversity of the proposed approaches, techniques and methods, results remain very weak and away from expectations especially for several categories of documents such as complex, low quality, handwritten and historical documents. The complex structure and/or morphology of such documents are behind the weakness of results of these proposed approaches, techniques and methods. One of the challenging problems related to this topic is the creation of standard datasets that can be used by all stakeholders of this topic such as system developers, expert evaluators, and users. In addition, another challenging problem is how one could take advantages of all existing datasets that unfortunately are dispersed around the world without knowing, most of the times, any information about their locations and the way to reach them. As an attempt to solve the two mentioned above problems, we propose in this paper a Universal Datasets Repository for Document Analysis and Recognition (UMDAR) that has, in fact, a twofold advantage. First, it can help dataset creators to standardize their datasets and making them accessible to the research community once published on the proposed repository. Second, it can be used as a central which bridges in a smart manner between datasets and all DAR stakeholders.\",\"PeriodicalId\":448329,\"journal\":{\"name\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA.2016.7945622\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2016.7945622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

文档分析与识别(Document Analysis and Recognition, DAR)有两个主要目标，首先是对文档输入图像的物理结构进行分析，从而正确识别相应的不同同构分量及其在XY坐标下的边界。其次，每一个同质成分都应该以这样一种方式被识别，如果它是一个文本图像，那么这个图像应该被识别并翻译成可理解的文本。DAR仍然是模式识别中最具挑战性的课题之一。事实上，尽管所提议的方法、技术和方法多种多样，但结果仍然非常薄弱，与预期相差甚远，特别是对于复杂、低质量、手写和历史文件等几类文件。这些文件的复杂结构和/或形态是这些提出的方法、技术和方法的结果薄弱的原因。与该主题相关的一个具有挑战性的问题是创建可由该主题的所有利益相关者(如系统开发人员、专家评估人员和用户)使用的标准数据集。此外，另一个具有挑战性的问题是，人们如何利用所有现有的数据集，不幸的是，这些数据集分散在世界各地，大多数时候，人们不知道它们的位置和到达它们的方式的任何信息。作为解决上述两个问题的尝试，我们在本文中提出了一个用于文档分析和识别的通用数据集存储库(UMDAR)，它实际上具有双重优势。首先，它可以帮助数据集创建者标准化他们的数据集，并且一旦在建议的存储库上发布，研究社区就可以访问它们。其次，它可以作为一个中心，以智能的方式在数据集和所有DAR利益相关者之间架起桥梁。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Universal metadata repository for document analysis and recognition

Document Analysis and Recognition (DAR) has two main objectives, first the analysis of the physical structure of the input image of the document, which should lead to the correct identification of the corresponding different homogeneous components and their boundaries in terms of XY coordinates. Second, each of these homogeneous components should be recognized in such a way that, if it is a text image, consequently this image should be recognized and translated into an intelligible text. DAR remains one of the most challenging topics in pattern recognition. Indeed, despite the diversity of the proposed approaches, techniques and methods, results remain very weak and away from expectations especially for several categories of documents such as complex, low quality, handwritten and historical documents. The complex structure and/or morphology of such documents are behind the weakness of results of these proposed approaches, techniques and methods. One of the challenging problems related to this topic is the creation of standard datasets that can be used by all stakeholders of this topic such as system developers, expert evaluators, and users. In addition, another challenging problem is how one could take advantages of all existing datasets that unfortunately are dispersed around the world without knowing, most of the times, any information about their locations and the way to reach them. As an attempt to solve the two mentioned above problems, we propose in this paper a Universal Datasets Repository for Document Analysis and Recognition (UMDAR) that has, in fact, a twofold advantage. First, it can help dataset creators to standardize their datasets and making them accessible to the research community once published on the proposed repository. Second, it can be used as a central which bridges in a smart manner between datasets and all DAR stakeholders.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)

自引率

0.00%

发文量