Proceedings of the 2017 ACM Symposium on Document Engineering最新文献

筛选
英文 中文
Assessing Binarization Techniques for Document Images 评估文档图像的二值化技术
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103021
R. Lins, M. Almeida, R. Bernardino, D. Jesus, José Mário Oliveira
{"title":"Assessing Binarization Techniques for Document Images","authors":"R. Lins, M. Almeida, R. Bernardino, D. Jesus, José Mário Oliveira","doi":"10.1145/3103010.3103021","DOIUrl":"https://doi.org/10.1145/3103010.3103021","url":null,"abstract":"Image binarization is a technique widely used for documents as monochromatic documents claim for far less space for storage and computer bandwidth for network transmission than their color or even grayscale equivalent. Paper color, texture, aging, translucidity, kind and color of ink used in handwritting, printing process, digitalization process, etc., are some of the factors that affect binarization. No algorithm is good enough to be a winner in the binarization of all kinds of documents. This paper presents a methodology to assess the performance of binarization algorithms for a wide variety of text documents, allowing a judicious quantitative choice of the best algorithms and their parameters.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125836717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Session details: Document Analysis: Content Analysis 会话详细信息:文档分析:内容分析
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3248711
D. Brailsford
{"title":"Session details: Document Analysis: Content Analysis","authors":"D. Brailsford","doi":"10.1145/3248711","DOIUrl":"https://doi.org/10.1145/3248711","url":null,"abstract":"","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124968937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session details: Document Analysis & #38; Visual Document Analysis 会议详情:文档分析& #38;可视化文档分析
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3248712
S. Simske
{"title":"Session details: Document Analysis & #38; Visual Document Analysis","authors":"S. Simske","doi":"10.1145/3248712","DOIUrl":"https://doi.org/10.1145/3248712","url":null,"abstract":"","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125605128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session details: User Interactions 会话细节:用户交互
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3248708
K. Marriott
{"title":"Session details: User Interactions","authors":"K. Marriott","doi":"10.1145/3248708","DOIUrl":"https://doi.org/10.1145/3248708","url":null,"abstract":"","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"179 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131908330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session details: Keynote II 会议详情:主题演讲二
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3248709
Stefania Cristina
{"title":"Session details: Keynote II","authors":"Stefania Cristina","doi":"10.1145/3248709","DOIUrl":"https://doi.org/10.1145/3248709","url":null,"abstract":"","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126525314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a Transcription System of Sign Language Video Resources via Motion Trajectory Factorisation 基于运动轨迹分解的手语视频资源转录系统研究
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103020
M. Borg, K. Camilleri
{"title":"Towards a Transcription System of Sign Language Video Resources via Motion Trajectory Factorisation","authors":"M. Borg, K. Camilleri","doi":"10.1145/3103010.3103020","DOIUrl":"https://doi.org/10.1145/3103010.3103020","url":null,"abstract":"Sign languages are visual languages used by the Deaf community for communication purposes. Whilst recent years have seen a high growth in the quantity of sign language video collections available online, much of this material is hard to access and process due to the lack of associated text-based tagging information and because 'extracting' content directly from video is currently still a very challenging problem. Also limited is the support for the representation and documentation of sign language video resources in terms of sign writing systems. In this paper, we start with a brief survey of existing sign language technologies and we assess their state of the art from the perspective of a sign language digital information processing system. We then introduce our work, focusing on vision-based sign language recognition. We apply the factorisation method to sign language videos in order to factor out the signer's motion from the structure of the hands. We then model the motion of the hands in terms of a weighted combination of linear trajectory basis and apply a set of classifiers on the basis weights for the purpose of recognising meaningful phonological elements of sign language. We demonstrate how these classification results can be used for transcribing sign videos into a written representation for annotation and documentation purposes. Results from our evaluation process indicate the validity of our proposed framework.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132664621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Linear Extended Annotation Graphs 线性扩展标注图
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3103011
Vincent Barrellon, P. Portier, S. Calabretto, Olivier Ferret
{"title":"Linear Extended Annotation Graphs","authors":"Vincent Barrellon, P. Portier, S. Calabretto, Olivier Ferret","doi":"10.1145/3103010.3103011","DOIUrl":"https://doi.org/10.1145/3103010.3103011","url":null,"abstract":"Multistructured (M-S) data models were introduced to allow the expression of multilevel, concurrent annotation. However, most models lack either a consistent or an efficient validation mechanism. In a former paper, we introduced extended Annotation Graphs (eAG), a cyclic-graph data model equipped with a novel schema mechanism that, by allowing validation \"by construction\", bypasses the typical algorithmic cost of traditional methods for the validation of graph-structured data. We introduce here LeAG, a markup syntax for eAG annotations over text data. LeAG takes the shape of a classic, inline markup model. A LeAG annotation can then be written, in a human-readable form, in any notepad application, and saved as a text file; the syntax is simple and familiar -- yet LeAG proposes a natural syntax for multilayer annotation with (self-) overlap and links. From a theoretical point of view, LeAG inaugurates a hybrid markup paradigm. Syntactically speaking, it is a full inline model, since the tags are all inserted along the annotated resources; still, we evidence that representing independent elements' co-occurring in an inline manner requires to make the annotation rest upon a notion of reference value, that is typical of stand-off markup. To our knowledge, LeAG is the first inline markup syntax to properly conceptualize the notion of elements' accidental co-occurring, that is yet fundamental in multilevel annotation.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122823585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Baseline Detection on Arabic Handwritten Documents 阿拉伯语手写文档的基线检测
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121037
Ahmed Fawzi, M. Pastor, C. Martínez-Hinarejos
{"title":"Baseline Detection on Arabic Handwritten Documents","authors":"Ahmed Fawzi, M. Pastor, C. Martínez-Hinarejos","doi":"10.1145/3103010.3121037","DOIUrl":"https://doi.org/10.1145/3103010.3121037","url":null,"abstract":"Document processing comprises different steps depending on the nature of the documents. For text documents, specially for handwritten documents, transcription of their contents is one of the main tasks. Handwritten Text Recognition (HTR) is the process of automatically obtaining the transcription of the content of a handwritten text document. In document processing, the basic unit for the acquisition process is the page image, whilst line image is the basic form for the HTR process. This is a bottle-neck which is holding back the massive industrial document processing. Baseline detection can be used not only to segment page images into line images but also for many other document processing steps. Baseline detection problem can be formulated as a clustering problem over a set of interest points. In this work, we study the use of an automatic baseline detection technique, based on interest point clustering, in Arabic handwritten documents. The experiments reveal that this technique provides promising results for this task.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130829581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
SketchTab3d: A Hybrid Sketch Library using Tablets and Immersive 3D Environments SketchTab3d:使用平板电脑和沉浸式3D环境的混合草图库
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121029
Charlotte Boddien, Jill Heitmann, Florian Hermuth, Dawid Lokiec, Carlos Tan, Laura Wölbeling, Thomas Jung, J. H. Israel
{"title":"SketchTab3d: A Hybrid Sketch Library using Tablets and Immersive 3D Environments","authors":"Charlotte Boddien, Jill Heitmann, Florian Hermuth, Dawid Lokiec, Carlos Tan, Laura Wölbeling, Thomas Jung, J. H. Israel","doi":"10.1145/3103010.3121029","DOIUrl":"https://doi.org/10.1145/3103010.3121029","url":null,"abstract":"This paper proposes a 2d sketching tool and an immersive 3d sketch library as an approach to easily create and access documents (i.e. sketches). The sketch library allows users to store, arrange and assemble their own sketches and others' in theoretically unlimited space. A user can get an idea about the general activities of all users since the sketch library is updated whenever changes are made. The system provides 2d and 3d means to access the sketch library. Whereas the 2d interfaces offers a standard dash board, the 3d environment provides unrestricted spatial access to the sketch library. Furthermore, a 2d sketching interfaces is provided in order to create sketch-based documents. Possible application areas are in the fields of engineering, design, public displays, shared knowledge applications, and art. The system was evaluated among eight participants regarding its pragmatic and hedonic qualities as well as searching performance. The results suggest that the users appreciate the particular combination of 2d and 3d technologies in SketchTab3d and requested for improvement in the 3d interaction technique. No significant differences were found in the search performance, however the physical demand during searching was perceived significantly higher in the 3d condition than in the 2d condition.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125007485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
High-performance Computational Framework for Phrase Relatedness 短语相关性的高性能计算框架
Proceedings of the 2017 ACM Symposium on Document Engineering Pub Date : 2017-08-31 DOI: 10.1145/3103010.3121039
Zichu Ai, Jie Mei, A. Mohammad, N. Zeh, Meng He, E. Milios
{"title":"High-performance Computational Framework for Phrase Relatedness","authors":"Zichu Ai, Jie Mei, A. Mohammad, N. Zeh, Meng He, E. Milios","doi":"10.1145/3103010.3121039","DOIUrl":"https://doi.org/10.1145/3103010.3121039","url":null,"abstract":"TrWP is a text relatedness measure that computes semantic similarity between words and phrases utilizing aggregated statistics from the Google Web 1T 5-gram corpus. The phrase similarity computation in TrWP is costly in terms of both time and space, making the existing implementation of TrWP impractical for real-world usage. In this work, we present an in-memory computational framework for TrWP, which optimizes the corpus search using perfect hashing and minimizes the required memory cost using variable length encoding. Evaluated using the Google Web 1T 5-gram corpus, we demonstrate that the computational speed of our framework outperforms a file-based implementation by several orders of magnitude.","PeriodicalId":200469,"journal":{"name":"Proceedings of the 2017 ACM Symposium on Document Engineering","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133206163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信