{"title":"HP relate: a customer communication system for the SMB market","authors":"S. Pruitt, A. Wiley","doi":"10.1145/2361354.2361387","DOIUrl":"https://doi.org/10.1145/2361354.2361387","url":null,"abstract":"Enterprise businesses rely on variable data publishing solutions to produce customer communications, such as letters, statements, and financial reports, which are tailored to individual recipients. Until now, however, such customer communications systems were out of the reach of the small and medium business (SMB) market for several reasons. In order to produce enterprise-quality documents, businesses needed employees with advanced skills in document design and automated document composition. In addition, customized documents typically require scripted business logic and complicated data integration. To achieve this level of document composition and delivery would require the SMB user to have access to IT systems and staffing that would be prohibitively expensive. HP Relate is an innovative document design system that delivers enterprise-quality documents for a next-generation customer communication system for the SMB market. HP Relate features easy-to-use document design tools that require no more than self-assisted training. Document business logic and data integration is accessible to SMB users through common office tools, such as dragging and dropping and spreadsheets. Instead of requiring software installed on the user's system, HP Relate is provisioned on a cloud-based platform using a software as a service (SaaS) subscription-based model. In addition, the HP Relate platform enables SMBs to deliver documents in the format of a customer's choosing, including traditional print forms, web-based deployment, and mobile devices.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"100 1","pages":"141-144"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81510690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Interactive non-linear video: definition and XML structure","authors":"Britta Meixner, H. Kosch","doi":"10.1145/2361354.2361367","DOIUrl":"https://doi.org/10.1145/2361354.2361367","url":null,"abstract":"A literature review on the term \"interactive video\" and \"interactive non-linear video\" revealed different levels of interaction in varying definitions. We give a formal definition of the term \"interactive non-linear video\" to clarify the elements and possible relations between elements contained in such videos. Furthermore, we introduce a new event-based XML format consisting of four required and two optional elements to describe this form of video. A scene graph consisting of scenes with triggers for annotations builds the core of the format. Formal definition and XML format are both illustrated by a real world example.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"118 1","pages":"49-58"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88029556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deriving document workflows from feature models","authors":"M. Penadés, A. Gómez, J. Canós","doi":"10.1145/2361354.2361405","DOIUrl":"https://doi.org/10.1145/2361354.2361405","url":null,"abstract":"Despite the increasing interest in the Document Engineering community, a formal definition of document workflow is still to come. Often, the term refers to an abstract process consisting in a set of tasks to contribute to some document contents, and some techniques are being developed to support parts of these tasks rather than how to generate the process itself. In most proposals, these tasks are implicit in the business processes running in an organization, lacking an explicit document workflow model that could be analysed and enacted as a coherent unit. In this paper, we propose a document-centric approach to document workflow generation. We have extended the feature-based document meta-model of the Document Product Lines approach with an organiza-tional metamodel. For a given configuration of the feature model, we assign tasks to different members of the organization to con-tribute to the document contents. Moreover, the relationships between features define an ordering of the tasks, which may be refined to produce a specification of the document workflow model automatically. The generation of customized software manuals is used to illustrate the proposal.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"20 1","pages":"237-240"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73552836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Advene as a tailorable hypervideo authoring tool: a case study","authors":"Olivier Aubert, Yannick Prié, D. Schmitt","doi":"10.1145/2361354.2361370","DOIUrl":"https://doi.org/10.1145/2361354.2361370","url":null,"abstract":"Audiovisual documents provide a great primary material for analysis in multiple domains, such as sociology or interaction studies. Video annotation tools offer new ways of analysing these documents, beyond the conventional transcription. However, these tools are often dedicated to specific domains, putting constraints on the data model or interfaces that may not be convenient for alternative uses. Moreover, most tools serve as exploratory and analysis instruments only, not proposing export formats suitable for publication. We describe in this paper a usage of the Advene software, a versatile video annotation tool that can be tailored for various kinds of analyses: users can define their own analysis structure and visualizations, and share their analyses either as structured annotations with visualization templates, or published on the Web as hypervideo documents. We explain how users can customize the software through the definition of their own data structures and visualizations. We illustrate this adaptability through an actual usage for interview analysis.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"11 1","pages":"79-82"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81650019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sift: an end-user tool for gathering web content on the go","authors":"Matthias Geel, Timothy Church, M. Norrie","doi":"10.1145/2361354.2361395","DOIUrl":"https://doi.org/10.1145/2361354.2361395","url":null,"abstract":"Although web sites have started to embed semantic metadata within their documents, it remains a challenge for non-technical end-users to exploit that markup to extract and store information of interest. To address this challenge, we show how tools can be developed that allow users to identify extractable information while browsing and then control how that information should be extracted and stored in a personal library. The proposed approach is based on an extensible framework capable of using different kinds of markup to aid the extraction process and a unique fusion of several well-established techniques from areas such as the semantic web, data warehousing, web scraping and web feeds. We present the Sift tool which is a proof-of-concept implementation of the approach.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"46 1","pages":"181-190"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81265836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Soares, Carlos de Salles Soares Neto, José Geraldo Sousa
{"title":"Architecture for hypermedia dynamic applications with content and behavior constraints","authors":"L. Soares, Carlos de Salles Soares Neto, José Geraldo Sousa","doi":"10.1145/2361354.2361403","DOIUrl":"https://doi.org/10.1145/2361354.2361403","url":null,"abstract":"This paper deals with the generation of dynamic hypermedia applications whose content and behavior their authors may not be able to predict a priori, but which must conform to a strict set of explicitly defined constraints. In the paper, we show that it is possible to establish an architecture configuration to be followed by this special kind of dynamic applications. In the proposed architecture, templates are responsible for specifying the design patterns and the constraints to be followed. Some alternatives for distributing (from the client side to the server side) the components that comprise the architecture are discussed, and one of them is used to exemplify an instantiation of the architecture. In the instantiation, TAL (Template Authoring Language) is used to define templates. In TAL, templates are open-compositions, that is, especial set of patterns for compositions, whose content must obey some explicitly defined constraints. The paper also shows how the architecture instantiation could be used to build dynamic digital TV applications.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"29 1","pages":"217-226"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89556848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Charactles: more than characters","authors":"Blanca Mancilla, J. Plaice","doi":"10.1145/2361354.2361406","DOIUrl":"https://doi.org/10.1145/2361354.2361406","url":null,"abstract":"In this paper, we propose a general notion of character which encompasses two concepts: points within a character set, such as Unicode, as well as arbitrary tuples defining structured objects. We call these general characters \"charactles\". Using this model, text can be defined to be a linear sequence of charactles, not requiring the use of hierarchical structures to encodethe text. As a result, all sorts of processing, such as searching and typesetting, are potentially simplified.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"27 1","pages":"241-244"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79046715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Document understanding of graphical content in natively digital PDF documents","authors":"Aysylu Gabdulkhakova, Tamir Hassan","doi":"10.1145/2361354.2361385","DOIUrl":"https://doi.org/10.1145/2361354.2361385","url":null,"abstract":"This paper presents an object-based method for analysing the content drawn by graphical operators in natively digital PDF documents. We propose that graphical content in a document can be classified either as structural or non-structural and present an output model for our analysis result. Heuristic techniques are used to group the instructions into regions and determine their logical role in the document's structure. Experimental results demonstrate the effectiveness of the algorithm.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"30 1","pages":"137-140"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73941888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Law, Nicolas Thome, Stéphane Gançarski, M. Cord
{"title":"Structural and visual comparisons for web page archiving","authors":"M. Law, Nicolas Thome, Stéphane Gançarski, M. Cord","doi":"10.1145/2361354.2361380","DOIUrl":"https://doi.org/10.1145/2361354.2361380","url":null,"abstract":"In this paper, we propose a Web page archiving system that combines state-of-the-art comparison methods based on the source codes of Web pages, with computer vision techniques. To detect whether successive versions of a Web page are similar or not, our system is based on: (1) a combination of structural and visual comparison methods embedded in a statistical discriminative model, (2) a visual similarity measure designed for Web pages that improves change detection, (3) a supervised feature selection method adapted to Web archiving. We train a Support Vector Machine model with vectors of similarity scores between successive versions of pages. The trained model then determines whether two versions, defined by their vector of similarity scores, are similar or not. Experiments on real archives validate our approach.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"7 1","pages":"117-120"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80274074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A first approach to the automatic recognition of structural patterns in XML documents","authors":"A. Iorio, S. Peroni, Francesco Poggi, F. Vitali","doi":"10.1145/2361354.2361374","DOIUrl":"https://doi.org/10.1145/2361354.2361374","url":null,"abstract":"XML is among the preferred formats for storing the structure of documents such as scientific articles, manuals, documentation, literary works, etc. Sometimes publishers adopt established and well-known vocabularies such as DocBook and TEI, other times they create partially or entirely new ones that better deal with the particular requirements of their documents. The (explicit and implicit) requirements of use in these vocabularies often follow well-established patterns, creating meta-structures (the block, the container, the inline element, etc.) that persist across vocabularies and authors and that describe a truer and more general conceptualization of the documents' building blocks. Addressing such meta-structures not only gives a better insight of what documents really are composed of, but provides abstract and more general mechanisms to work on documents regardless of the availability of specific schemas, tools and presentation stylesheets. In this paper we introduce a schemaindependent theory based on eleven structural patterns. We provide a definition of such patterns and how they synthesize characteristics emerging from real markup documents. Additionally, we propose an algorithm that allows us to identify the pattern of each element in a set of homogeneous markup documents.","PeriodicalId":91385,"journal":{"name":"Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering","volume":"15 1","pages":"85-94"},"PeriodicalIF":0.0,"publicationDate":"2012-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83373976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}