网络档案研究的自适应搜索系统

Proceedings of the 5th Information Interaction in Context Symposium Pub Date : 2014-08-26 DOI:10.1145/2637002.2637063

Hugo C. Huurdeman

{"title":"网络档案研究的自适应搜索系统","authors":"Hugo C. Huurdeman","doi":"10.1145/2637002.2637063","DOIUrl":null,"url":null,"abstract":"The wealth of digital information available in our time has become indispensable for a rich variety of tasks. We use data on the Web for work, leisure, and research, aided by various search systems, allowing us to find small needles in giant haystacks. Despite recent advances in personalization and contextualization, however, various types of tasks, ranging from simple lookup tasks to complex, exploratory and analytical ventures, are mainly supported in elementary, \"one-size-fits-all\" search interfaces. Web archives, keepers of our future cultural heritage, have gathered petabytes of valuable Web data, which characterize our times for future generations. Access to these archives, however, is surprisingly limited: online Web archives usually provide a URL-based Wayback Machine interface, sometimes extended with rudimentary search options. As a result of limited access, Web archives have not been widely used for research so far. For emerging research using Web archives, there is a need to move beyond URL-based and simple search access, towards providing support for complex (re)search tasks. In my thesis, I am exploring ways to move beyond the \"one-size-fits-all\" approach for search systems, and I work on systems which can support the flow of complex search, also in the context of archived Web data. Rich models of search and research can be incorporated into adaptive search systems, supporting search strategies in various stages of complex search tasks. Concretely, I look at the use case of the Humanities researcher, for which the large, Terabyte-scale Web archives can be a valuable addition to existing sources utilized to perform research.","PeriodicalId":447867,"journal":{"name":"Proceedings of the 5th Information Interaction in Context Symposium","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive search systems for web archive research\",\"authors\":\"Hugo C. Huurdeman\",\"doi\":\"10.1145/2637002.2637063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The wealth of digital information available in our time has become indispensable for a rich variety of tasks. We use data on the Web for work, leisure, and research, aided by various search systems, allowing us to find small needles in giant haystacks. Despite recent advances in personalization and contextualization, however, various types of tasks, ranging from simple lookup tasks to complex, exploratory and analytical ventures, are mainly supported in elementary, \\\"one-size-fits-all\\\" search interfaces. Web archives, keepers of our future cultural heritage, have gathered petabytes of valuable Web data, which characterize our times for future generations. Access to these archives, however, is surprisingly limited: online Web archives usually provide a URL-based Wayback Machine interface, sometimes extended with rudimentary search options. As a result of limited access, Web archives have not been widely used for research so far. For emerging research using Web archives, there is a need to move beyond URL-based and simple search access, towards providing support for complex (re)search tasks. In my thesis, I am exploring ways to move beyond the \\\"one-size-fits-all\\\" approach for search systems, and I work on systems which can support the flow of complex search, also in the context of archived Web data. Rich models of search and research can be incorporated into adaptive search systems, supporting search strategies in various stages of complex search tasks. Concretely, I look at the use case of the Humanities researcher, for which the large, Terabyte-scale Web archives can be a valuable addition to existing sources utilized to perform research.\",\"PeriodicalId\":447867,\"journal\":{\"name\":\"Proceedings of the 5th Information Interaction in Context Symposium\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th Information Interaction in Context Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2637002.2637063\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th Information Interaction in Context Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2637002.2637063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在我们这个时代，丰富的数字信息已经成为各种各样的任务所不可或缺的。在各种搜索系统的帮助下，我们将网络上的数据用于工作、休闲和研究，使我们能够在大海捞针中找到小针。尽管最近在个性化和上下文化方面取得了进展，但是，各种类型的任务，从简单的查找任务到复杂的、探索性的和分析性的任务，主要支持基本的、“一刀切”的搜索界面。网络档案馆，我们未来文化遗产的守护者，已经收集了数以拍字节的有价值的网络数据，这些数据将成为我们这个时代的特征。然而，访问这些档案的权限却非常有限:在线Web档案通常提供基于url的Wayback Machine界面，有时还扩展了基本的搜索选项。由于访问的限制，网络档案目前还没有被广泛用于研究。对于使用Web档案的新兴研究，有必要超越基于url的简单搜索访问，转而支持复杂的(重新)搜索任务。在我的论文中，我正在探索超越搜索系统的“一刀切”方法的方法，我研究的系统可以支持复杂搜索流，也可以在存档的Web数据上下文中支持复杂搜索流。丰富的搜索和研究模型可以整合到自适应搜索系统中，支持复杂搜索任务各个阶段的搜索策略。具体地说，我考察了人文学科研究者的用例，对于他们来说，tb规模的大型Web档案可以作为现有资源的一个有价值的补充，用于执行研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Adaptive search systems for web archive research

The wealth of digital information available in our time has become indispensable for a rich variety of tasks. We use data on the Web for work, leisure, and research, aided by various search systems, allowing us to find small needles in giant haystacks. Despite recent advances in personalization and contextualization, however, various types of tasks, ranging from simple lookup tasks to complex, exploratory and analytical ventures, are mainly supported in elementary, "one-size-fits-all" search interfaces. Web archives, keepers of our future cultural heritage, have gathered petabytes of valuable Web data, which characterize our times for future generations. Access to these archives, however, is surprisingly limited: online Web archives usually provide a URL-based Wayback Machine interface, sometimes extended with rudimentary search options. As a result of limited access, Web archives have not been widely used for research so far. For emerging research using Web archives, there is a need to move beyond URL-based and simple search access, towards providing support for complex (re)search tasks. In my thesis, I am exploring ways to move beyond the "one-size-fits-all" approach for search systems, and I work on systems which can support the flow of complex search, also in the context of archived Web data. Rich models of search and research can be incorporated into adaptive search systems, supporting search strategies in various stages of complex search tasks. Concretely, I look at the use case of the Humanities researcher, for which the large, Terabyte-scale Web archives can be a valuable addition to existing sources utilized to perform research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 5th Information Interaction in Context Symposium

自引率

0.00%

发文量