Proceedings of the 2015 ACM Symposium on Document Engineering最新文献_第2页

Knuth-Plass Revisited: Flexible Line-Breaking for Automatic Document Layout Knuth-Plass重访:自动文档布局的灵活断行

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797091

Tamir Hassan, Andrew Hunter

{"title":"Knuth-Plass Revisited: Flexible Line-Breaking for Automatic Document Layout","authors":"Tamir Hassan, Andrew Hunter","doi":"10.1145/2682571.2797091","DOIUrl":"https://doi.org/10.1145/2682571.2797091","url":null,"abstract":"There is an inherent flexibility in typesetting a block of text. Traditionally, line breaks would be manually chosen at strategic points in such a way as to minimize the amount of whitespace in each line. Hyphenation would only be used as a last resort. Knuth and Plass automated this optimization procedure, which has been used in various typesetting systems and DTP applications ever since. However, an optimal solution for the line-breaking problem does not necessarily lead us to an optimal document layout on the whole. The flexibility of choosing line breaks enables us, in many cases, to adjust the height of a paragraph by changing the number of lines, without having to make adjustments to font size, leading, etc. In many cases, the word spacing remains within the usual tolerances and visual quality does not noticeably suffer. This paper presents a modification to the Knuth-Plass algorithm to return several results for a given column of text, each corresponding to a different height, and describes steps to quantify the amount of expected flexibility in a given paragraph. We conclude with a discussion on how such \"sub-optimal\" results can lead to a better overall document layout, particularly in the context of mobile layouts, where flexibility is of key importance.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124945973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

MSoS: A Multi-Screen-Oriented Web Page Segmentation Approach 面向多屏幕的网页分割方法

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797090

Mira Sarkis, C. Concolato, Jean-Claude Dufourd

引用次数: 5

Change Classification in Graphics-Intensive Digital Documents 图形密集型数字文档中的变化分类

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797079

Jeremy Svendsen, A. Albu

引用次数: 0

VEDD: A Visual Editor for Creation and Semi-Automatic Update of Derived Documents VEDD:用于创建和半自动更新派生文档的可视化编辑器

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797075

K. Marriott, Mingzheng Shi, Michael Wybrow

引用次数: 0

Automatic Text Document Summarization Based on Machine Learning 基于机器学习的文本文档自动摘要

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797099

G. Silva, Rafael Ferreira, R. Lins, L. Cabral, Hilário Oliveira, S. Simske, M. Riss

引用次数: 15

Searching Live Meeting Documents "Show me the Action" 搜索实时会议文档“Show me the Action”

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797082

Laurent Denoue, S. Carter, Matthew L. Cooper

引用次数: 2

Document Engineering Issues in Document Analysis 文档分析中的文档工程问题

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2801033

Charles K. Nicholas, Robert Brandon

引用次数: 0

The Delaunay Document Layout Descriptor Delaunay文档布局描述符

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797059

Sébastien Eskenazi, Petra Gomez-Krämer, J. Ogier

{"title":"The Delaunay Document Layout Descriptor","authors":"Sébastien Eskenazi, Petra Gomez-Krämer, J. Ogier","doi":"10.1145/2682571.2797059","DOIUrl":"https://doi.org/10.1145/2682571.2797059","url":null,"abstract":"Security applications related to document authentication require an exact match between an authentic copy and the original of a document. This implies that the documents analysis algorithms that are used to compare two documents (original and copy) should provide the same output. This kind of algorithm includes the computation of layout descriptors from the segmentation result, as the layout of a document is a part of its semantic content. To this end, this paper presents a new layout descriptor that significantly improves the state of the art. The basic of this descriptor is the use of a Delaunay triangulation of the centroids of the document regions. This triangulation is seen as a graph and the adjacency matrix of the graph forms the descriptor. While most layout descriptors have a stability of 0% with regard to an exact match, our descriptor has a stability of 74% which can be brought up to 100% with the use of an appropriate matching algorithm. It also achieves 100% accuracy and retrieval in a document retrieval scheme on a database of 960 document images. Furthermore, this descriptor is extremely efficient as it performs a search in constant time with respect to the size of the document database and it reduces the size of the index of the database by a factor 400.","PeriodicalId":106339,"journal":{"name":"Proceedings of the 2015 ACM Symposium on Document Engineering","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129654215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Automatic Extraction of Figures from Scholarly Documents 从学术文献中自动提取数字

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797085

Sagnik Ray Choudhury, P. Mitra, C. Lee Giles

引用次数: 28

Multimedia Document Structure for Distributed Theatre 分布式影院的多媒体文档结构

Proceedings of the 2015 ACM Symposium on Document Engineering Pub Date : 2015-09-08 DOI: 10.1145/2682571.2797087

Jack Jansen, Michael Frantzis, Pablo César

引用次数: 2