Semantic analysis of documents workshop (SemADoc): extended abstract

Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering Pub Date : 2014-09-16 DOI:10.1145/2644866.2644897

E. Milios, C. Domeniconi

{"title":"Semantic analysis of documents workshop (SemADoc): extended abstract","authors":"E. Milios, C. Domeniconi","doi":"10.1145/2644866.2644897","DOIUrl":null,"url":null,"abstract":"A large number of document management problems would benefit from having the semantics of documents explicitly represented. However, manually assigning semantic descriptions to documents is labour intensive and error prone. At the same time, the manual generation of domain specific taxonomies is not only labour intensive, but it also needs to be repeated often as the domains themselves and their key concepts shift with time. In this workshop we focus on document content analysis and semantic enrichment to generate a layer of semantic description of documents that is useful for document management tasks, such as semantic information retrieval, conceptual organization and clustering of document collections for sense making, semantic expert profiling, and document recommender systems. The aim of the workshop is to bring together researchers and practitioners, and discuss different perspectives on the problems, challenges encountered in various application scenarios, and potential solutions. We have invited submissions in all areas of semantic analysis and enrichment of documents, such as automatic tagging, named entity disambiguation, semantic linking, interactive classification and clustering of documents, document summarization, curation and validation of the analysis process, generation of visualizations of document, author and document collection semantics, user engagement in the semantic analysis process via suitable annotation and correction tools, and study of the trade off between accuracy of the results and user effort. Submissions aimed at solving practical problems in specific application domains, including but not limited to digital libraries, legal document management, personalized online learning systems, news media, are especially welcome. The workshop is timely and relevant to the Document Engineering community, as its focus is on semantically enriching documents and document collections, to make them more accessible to their readers. The task is nontrivial due to the volume of text data and the rate at which text data is accumulated by companies, government, and individuals.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"73 3 1","pages":"209-210"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2644866.2644897","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

A large number of document management problems would benefit from having the semantics of documents explicitly represented. However, manually assigning semantic descriptions to documents is labour intensive and error prone. At the same time, the manual generation of domain specific taxonomies is not only labour intensive, but it also needs to be repeated often as the domains themselves and their key concepts shift with time. In this workshop we focus on document content analysis and semantic enrichment to generate a layer of semantic description of documents that is useful for document management tasks, such as semantic information retrieval, conceptual organization and clustering of document collections for sense making, semantic expert profiling, and document recommender systems. The aim of the workshop is to bring together researchers and practitioners, and discuss different perspectives on the problems, challenges encountered in various application scenarios, and potential solutions. We have invited submissions in all areas of semantic analysis and enrichment of documents, such as automatic tagging, named entity disambiguation, semantic linking, interactive classification and clustering of documents, document summarization, curation and validation of the analysis process, generation of visualizations of document, author and document collection semantics, user engagement in the semantic analysis process via suitable annotation and correction tools, and study of the trade off between accuracy of the results and user effort. Submissions aimed at solving practical problems in specific application domains, including but not limited to digital libraries, legal document management, personalized online learning systems, news media, are especially welcome. The workshop is timely and relevant to the Document Engineering community, as its focus is on semantically enriching documents and document collections, to make them more accessible to their readers. The task is nontrivial due to the volume of text data and the rate at which text data is accumulated by companies, government, and individuals.

查看原文本刊更多论文

文档语义分析车间(SemADoc):扩展抽象

大量的文档管理问题将受益于显式表示文档的语义。然而，手动为文档分配语义描述是一项劳动密集型工作，而且容易出错。与此同时，手工生成特定领域的分类法不仅是劳动密集型的，而且还需要经常重复，因为领域本身及其关键概念会随着时间的推移而变化。在本次研讨会中，我们将重点关注文档内容分析和语义丰富，以生成对文档管理任务有用的文档语义描述层，例如语义信息检索、用于意义构建的文档集合的概念组织和聚类、语义专家分析和文档推荐系统。研讨会的目的是将研究人员和实践者聚集在一起，从不同的角度讨论各种应用场景中遇到的问题、挑战和潜在的解决方案。我们邀请在语义分析和丰富文档的所有领域提交作品，例如自动标记、命名实体消歧、语义链接、文档的交互式分类和聚类、文档摘要、分析过程的管理和验证、文档可视化的生成、作者和文档集合语义、用户通过适当的注释和纠正工具参与语义分析过程。研究结果的准确性和用户努力之间的权衡。特别欢迎以解决具体应用领域的实际问题为目的的投稿，包括但不限于数字图书馆、法律文件管理、个性化在线学习系统、新闻媒体等。该研讨会是及时的，与文档工程社区相关，因为它的重点是语义丰富的文档和文档集合，使它们更容易被读者访问。由于文本数据的数量以及公司、政府和个人积累文本数据的速度，这项任务并不简单。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering

自引率

0.00%

发文量