Proceedings of the 2017 ACM Symposium on Document Engineering最新文献

Towards a Model and a Textual Representation for Location-based Games 基于位置的游戏的模型和文本表示

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121035

Cristiane Ferreira, C. Salles, Luís Santos, Fernando A. M. Trinta, Windson Viana

引用次数: 2

Historical Document Processing 历史文献处理

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103026

B. Gatos, G. Louloudis, N. Stamatopoulos, Giorgos Sfikas

引用次数: 0

Automatic Knowledge Base Construction from Scholarly Documents 基于学术文献的知识库自动构建

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121043

Rabah A. Al-Zaidy, C. Lee Giles

{"title":"Automatic Knowledge Base Construction from Scholarly Documents","authors":"Rabah A. Al-Zaidy, C. Lee Giles","doi":"10.1145/3103010.3121043","DOIUrl":"https://doi.org/10.1145/3103010.3121043","url":null,"abstract":"The continuing growth of published scholarly content on the web ensures the availability of the most recent scientific findings to researchers. Scholarly documents, such as research articles, are easily accessed by using academic search engines that are built on large repositories of scholarly documents. Scientific information extraction from documents into a structured knowledge graph representation facilitates automated machine understanding of a document's content. Traditional information extraction approaches, that either require training samples or a preexisting knowledge base to assist in the extraction, can be challenging when applied to large repositories of digital documents. Labeled training examples for such large scale are difficult to obtain for such datasets. Also, most available knowledge bases are built from web data and do not have sufficient coverage to include concepts found in scientific articles. In this paper we aim to construct a knowledge graph from scholarly documents while addressing both these issues. We propose a fully automatic, unsupervised system for scientific information extraction that does not build on an existing knowledge base and avoids manually-tagged training data. We describe and evaluate a constructed taxonomy that contains over 15k entities resulting from applying our approach to 10k documents.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114468092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

The Intangible Nature of Drama Documents: an FRBR View 戏剧文献的无形性:FRBR视角

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103019

V. Lombardo, R. Damiano, Antonio Pizzo, Carmi Terzulli

引用次数: 7

The Common Fold: Utilizing the Four-Fold to Dewarp Printed Documents from a Single Image 普通折叠:利用四折从单个图像中去除打印文档

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121030

Sagnik Das, Gaurav Mishra, A. Sudharshana, Roy Shilkrot

引用次数: 27

Distributing Text Mining tasks with librAIry 使用librAIry分发文本挖掘任务

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121040

Carlos Badenes-Olmedo, José Luis Redondo García, Óscar Corcho

引用次数: 11

Using Abstract Anchors to Aid The Development of Multimedia Applications With Sensory Effects 利用抽象锚点辅助具有感官效果的多媒体应用程序的开发

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103014

R. Abreu, J. Santos

{"title":"Using Abstract Anchors to Aid The Development of Multimedia Applications With Sensory Effects","authors":"R. Abreu, J. Santos","doi":"10.1145/3103010.3103014","DOIUrl":"https://doi.org/10.1145/3103010.3103014","url":null,"abstract":"Declarative multimedia authoring languages allows authors to combine multiple media objects, generating a range of multimedia presentations. Novel multimedia applications, focusing at improving user experience, extend multimedia applications with multisensory content. The idea is to synchronize sensory effects with the audiovisual content being presented. The usual approach for specifying such synchronization is to mark the content of a main media object (e.g. a main video) indicating the moments when a given effect has to be executed. For example, a mark may represent when snow appears in the main video so that a cold wind may be synchronized with it. Declarative multimedia authoring languages provide a way to mark subparts of a media object through anchors. An anchor indicates its begin and end times (video frames or audio samples) in relation to its parent media object. The manual definition of anchors in the above scenario is both not efficient and error prone (i) when the main media object size increases, (ii) when a given scene component appears several times and (iii) when the application requires marking scene components. This paper tackles this problem by providing an approach for creating abstract anchors in declarative multimedia documents. An abstract anchor represents (possibly) several media anchors, indicating the moments when a given scene component appears in a media object content. The author, therefore is able to define the application behavior through relationships among, for example, sensory effects and abstract anchors. Prior to executing, abstract anchors are automatically instantiated for each moment a given element appears and relationships are cloned so the application behavior is maintained. This paper presents an implementation of the proposed approach using NCL (Nested Context Language) as the target language. The abstract anchor processor is implemented in Lua and uses available APIs for video recognition in order to identify the begin and end times for abstract anchor instances. We also present an evaluation of our approach using a real world use cases.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115562002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Personalized Ubiquitous Data Collection and Intervention as Interactive Multimedia Documents 作为交互式多媒体文档的个性化泛在数据收集与干预

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121046

C. C. Viel, Kamila R. H. Rodrigues, Isabela Zaine, Bruna C. R. Cunha, L. Scalco, M. G. Pimentel

引用次数: 10

Post-Processing OCR Text using Web-Scale Corpora 使用网络规模语料库的OCR文本后处理

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121032

Jie Mei, Aminul Islam, A. Mohammad, Yajing Wu, E. Milios

引用次数: 6

Interactive Documents based on Discrete Trials 基于离散试验的交互式文档

Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121048

A. F. Orlando, Isabela Zaine, M. G. Pimentel, D. G. Souza, C. Teixeira

引用次数: 0