{"title":"Separating XHTML content from navigation clutter using DOM-structure block analysis","authors":"Constantine Mantratzis, M. Orgun, S. Cassidy","doi":"10.1145/1083356.1083384","DOIUrl":"https://doi.org/10.1145/1083356.1083384","url":null,"abstract":"This short paper gives an overview of the principles behind an algorithm that separates the core-content of a web document from hyperlinked-clutter such as text advertisements and long links of syndicated references to other resources.Its advantage over other approaches is its ability to identify both loosely as well as tightly defined \"table-like\" or \"list-like\" structures of hyperlinks (from nested tables to simple, bullet-pointed lists) by operating at various levels within the DOM tree.The resulting data can then be used to extract the core-content from a web document for semantic analysis or other information retrieval purposes as well as to aid in the process of \"clipping\" a web document to its bare essentials for use with hardware-limited devices such as PDAs and cell phones.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123216945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semantically annotated hypermedia services","authors":"I. Pandis, Nikos Karousos, T. Tiropanis","doi":"10.1145/1083356.1083406","DOIUrl":"https://doi.org/10.1145/1083356.1083406","url":null,"abstract":"Hypermedia systems' researchers investigate the various approaches in the way documents and resources are linked, navigated and stored in a distributed environment. Unfortunately, those systems fail to provide effortlessly usable discrete services, since it is difficult both to discover and to invoke any of them. This paper proposes the usage of emerging technologies that try to augment the Web resources with semantics in order to provide Hypermedia services that can be easily discovered, and integrated by potential third party developers. In this context, we analyze the benefits for the Hypermedia community upon the adoption of Semantic Web technologies for the description of Hypermedia services, and we implement an initial corresponding ontology.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128369131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Generalized semantics-to-document derivation","authors":"L. Rutledge, M. Alberink, L. Hardman, M. Veenstra","doi":"10.1145/1083356.1083422","DOIUrl":"https://doi.org/10.1145/1083356.1083422","url":null,"abstract":"This poster presents a general clustering-based algorithm for deriving presentation structure from semantic structure. Domain-independent presentation generation results from this algorithm.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133440242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Constraints in spatial structures","authors":"Claus Atzenbeck, Peter J. Nürnberg","doi":"10.1145/1083356.1083368","DOIUrl":"https://doi.org/10.1145/1083356.1083368","url":null,"abstract":"People have become used to paper as an information carrier over thousands of years. Paper is usually easy to handle and has been adopted as a metaphor for information structures in computer applications. This article gives a brief overview of our analysis on real world bindings. We further compare those to some metaphor-based spatial structure applications. We conclude that the high abstract implementation level in spatial structure applications takes away additional metainformation that may be useful for the user to find information quicker.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"231 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114693606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"As we may perceive: inferring logical documents from hypertext","authors":"Pavel A. Dmitriev, C. Lagoze, B. Suchkov","doi":"10.1145/1083356.1083370","DOIUrl":"https://doi.org/10.1145/1083356.1083370","url":null,"abstract":"In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggregation of web pages into web communities. Such logical information units improve a variety of web algorithms and provide the building blocks for the construction of organized information spaces such as digital libraries. In this paper, we focus on a type of logical information units called \"compound documents\". We argue that the ability to identify compound documents can improve information retrieval, automatic metadata generation, and navigation on the Web. We propose a unified framework for identifying the boundaries of compound documents, which combines both structural and content features of constituent web pages. The framework is based on a combination of machine learning and clustering algorithms, with the former algorithm supervising the latter one. We also propose a new method for evaluating quality of clusterings, based on a user behavior model. Experiments on a collection of educational web sites show that our approach can reliably identify most of the compound documents on these sites.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117057526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fragment identifiers for plain text files","authors":"Erik Wilde, M. Baschnagel","doi":"10.1145/1083356.1083398","DOIUrl":"https://doi.org/10.1145/1083356.1083398","url":null,"abstract":"Hypermedia systems like the Web heavily depend on their ability to link resources. One of the key features of the Web's URIs is their ability to not only specify a resource, but to also identify a subresource within that resource, by using a fragment identifier. Fragment identification enables user to create better hypermedia. We present a proposal for fragment identifiers for plain text files, which makes it possible to identify character or line ranges, or subresources identified by regular expressions. Using these fragment identifiers, it is possible to create more specific hyperlinks, by not only linking to a complete plain text resource, but only the relevant part of it. Along with this proposal, a prototype implementation is described which can be used both as a server-side testbed and as a client-side extension for the Firefox browser.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115967619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bulk loading large collections of hyperlinked resources","authors":"Davood Rafiei","doi":"10.1145/1083356.1083413","DOIUrl":"https://doi.org/10.1145/1083356.1083413","url":null,"abstract":"The problem of loading large collections of hyperlinked resources into a relational database is complicated with inter-node references when these references cannot be indexed. We show that this scenario can arise in many real life hyperlinked resources and propose several solutions to address the problem. We run some experiments over a graph of the Web with 178 million nodes and around 1 billion edges and report our results.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131877752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Smart content factory: assisting search for digital objects by generic linking concepts to multimedia content","authors":"Tobias Bürger, Erich Gams, Georg Güntner","doi":"10.1145/1083356.1083423","DOIUrl":"https://doi.org/10.1145/1083356.1083423","url":null,"abstract":"Search, retrieval and navigation in audiovisual repositories is a task common to all media asset management systems: Users are supported by a wide range of features which are traditionally based on full text search and metadata queries. In this paper we describe an approach to superimpose a semantic indexing infrastructure over the media assets and the metadata associated with them. The infrastructure is based on formal knowledge models and facilitates the use of further navigation dimensions: By identifying semantic concepts we are able to create a dynamic navigation structure which is based on the underlying knowledge model and the conceptual relations defined therein.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133403847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yasuhiro Yamamoto, K. Nakakoji, Yoshiyuki Nishinaka, Mitsuhiro Asada, Ryouichi Matsuda
{"title":"What is the space for?: the role of space in authoring hypertext representations","authors":"Yasuhiro Yamamoto, K. Nakakoji, Yoshiyuki Nishinaka, Mitsuhiro Asada, Ryouichi Matsuda","doi":"10.1145/1083356.1083378","DOIUrl":"https://doi.org/10.1145/1083356.1083378","url":null,"abstract":"This paper describes our approach of using spatial hypertext as a means separated from an end representation for hypertext authoring. By taking advantage of the power of rich interpretation and constant grounding capabilities of a spatial hypertext representation, ART001, ART006, and ART014 use spatial hypertext as a means for authoring linear, hierarchical, and network structures, respectively. The role of the space of the tools includes controlling a structure and annotating a structure. The three prototyped tools have been developed to demonstrate what visual interaction design concerns need to be taken into account to integrate a spatial hypertext as a means with another structural representation as an end. The paper concludes with a discussion of what it means to separate representations as a means from those as an end in hypertext authoring.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125821720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High-level translation of adaptive hypermedia applications","authors":"Ewald Ramp, P. D. Bra, Peter Brusilovsky","doi":"10.1145/1083356.1083379","DOIUrl":"https://doi.org/10.1145/1083356.1083379","url":null,"abstract":"In the early years of the adaptive hypermedia research a large number of special-purpose adaptive hypermedia systems (AHS) have been developed, to illustrate research ideas, or to serve a single application. Many of these systems are now obsolete. In this paper we propose to bring new life to these applications by means of translation to a general purpose adaptive hypermedia architecture. We illustrate that this approach can work by showing a high-level translation from InterBook [2] to AHA! [5]. Such a translation consists of three parts: the structure of concepts and concept relationships needs to be translated, the adaptive behavior for these concept relationships must be defined, and the layout and presentation of the source application must be \"simulated\". Our high-level translation covers all three parts.","PeriodicalId":134809,"journal":{"name":"UK Conference on Hypertext","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125914169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}