Workshop on Research Advances in Large Digital Book Repositories最新文献

筛选
英文 中文
Book search: indexing the valuable parts 图书搜索:索引有价值的部分
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458429
Walid Magdy, Kareem Darwish
{"title":"Book search: indexing the valuable parts","authors":"Walid Magdy, Kareem Darwish","doi":"10.1145/1458412.1458429","DOIUrl":"https://doi.org/10.1145/1458412.1458429","url":null,"abstract":"With massive book digitization efforts underway, there is a need for developing effective book retrieval strategies. This paper explores the relative contribution of different parts of digitized and OCR'ed books towards effective retrieval. The examined parts include the entire content of books, book headings, book titles, and table of content entries. Results show that indexing the headers and titles of books is nearly as effective as indexing the entire contents of books. These results indicate that certain portions of the books, specifically titles and headers, are more valuable than other parts of books. This is akin to web search where hypertext and page titles are more valuable to index than the rest of the webpage. Also, using a combination of evidence approach provides further improved retrieval effectiveness compared to using any portion of the book in isolation.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"175 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122005331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Social navigation and annotation for electronic books 电子图书的社会化导航和注释
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458421
Jae-Kyung Kim, Rosta Farzan, Peter Brusilovsky
{"title":"Social navigation and annotation for electronic books","authors":"Jae-Kyung Kim, Rosta Farzan, Peter Brusilovsky","doi":"10.1145/1458412.1458421","DOIUrl":"https://doi.org/10.1145/1458412.1458421","url":null,"abstract":"Modern efforts on digitizing electronic books focus on preserving authentic \"spatial\" representation of the original sources. The new format requires new tools to help users to access, process, and make sense of digital information. This paper presents an approach which assists users of these new \"spatial\" sources by giving them a combination of annotation and social navigation support. This approach is currently fully implemented and under evaluation in a classroom study","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132876401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Traditional resources help interpret texts 传统资源有助于解释文本
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458418
J. Gelernter, M. Lesk
{"title":"Traditional resources help interpret texts","authors":"J. Gelernter, M. Lesk","doi":"10.1145/1458412.1458418","DOIUrl":"https://doi.org/10.1145/1458412.1458418","url":null,"abstract":"Simple word matching between the user query and document is common, as are mis-matches of meaning that occur as a consequence, and errors in recall. These defects in the \"bag of words\" model are well known, and raising the semantic level of representation will improve retrieval. This can be done by expanding words and user queries using traditional reference sources such as gazetteers and synonym lists or ontologies.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123409274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A unified field theory of publishing in the networked era 网络时代出版的统一场理论
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458415
B. Stein
{"title":"A unified field theory of publishing in the networked era","authors":"B. Stein","doi":"10.1145/1458412.1458415","DOIUrl":"https://doi.org/10.1145/1458412.1458415","url":null,"abstract":"I’ve been exploring the potential of “new media” for nearly thirty years. There was an important aha moment early on when I was trying to understand the essential nature of books as a medium. The breakthrough came when i stopped thinking about the physical form or content of books and focused instead on how they are used. At that time print was unique compared to other media, in terms of giving its users complete control of the sequence and pace at which they accessed the contents. The ability to re-read a paragraph until its understood, to flip back and forth almost instantly between passages, to stop and write in the margins, or just think — this affordance of reflection (in a relatively inexpensive portable package) was the key to understanding why books have been such a powerful vehicle for moving ideas across space and time. I started calling books userdriven media — in contrast to movies, radio, and television, which at the time were producer-driven. Once microprocessors were integrated into audio and video devices, I reasoned, this distinction would disappear. However — and this is crucial — back in 1981 I also reasoned that its permanence was another important defining aspect of a book. The book of the future would be just like the book of the past, except that it might contain audio and video on its frozen \"pages.\" This was the videodisc/cdrom era of electronic publishing.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129228225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
How should users access the content of digital books? 用户应该如何访问数字图书的内容?
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458424
N. Wacholder
{"title":"How should users access the content of digital books?","authors":"N. Wacholder","doi":"10.1145/1458412.1458424","DOIUrl":"https://doi.org/10.1145/1458412.1458424","url":null,"abstract":"I report briefly on some of my own work in each of these areas and elucidate some of the questions that this research has raised. Then I propose as a research agenda the development of a digital library environment containing a suite of inter-related tools specifically designed to facilitate non-sequential access to portions of full-text books and other relatively long documents.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127381922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Feasibility of a primarily digital research library 主要数字化研究型图书馆的可行性
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458431
G. Henry, Lisa M. Spiro
{"title":"Feasibility of a primarily digital research library","authors":"G. Henry, Lisa M. Spiro","doi":"10.1145/1458412.1458431","DOIUrl":"https://doi.org/10.1145/1458412.1458431","url":null,"abstract":"This position paper explores the issues related to the feasibility of having a primarily digital research library support the teaching and research needs of a university. The Asian University for Women (AUW), a new university in Chittagong, Bangladesh, will open in September 2009. It must make a decision regarding the investment to be made in research resources to support the university. Mass digitization efforts now make it possible to consider establishing a research library that consists primarily of digital resources rather than print. There are, however, many issues that make this consideration quite complex and far from certain. In this paper we explore the issues at a preliminary level. We focus on four broad perspectives in order to begin addressing the complex interactions that must be considered in transitioning to a primarily digital research environment: technical, economic, policy and social issues. The purpose of this paper is to begin to explore a research agenda for transitioning from a model for libraries where resources are primarily print to one that is predominantly digital. Our research in this area is just beginning, so our purpose is to raise the issues rather than offer firm conclusions.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124584869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
E-Books are not books 电子书不是书
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458416
Mark Carden
{"title":"E-Books are not books","authors":"Mark Carden","doi":"10.1145/1458412.1458416","DOIUrl":"https://doi.org/10.1145/1458412.1458416","url":null,"abstract":"Currently, in the early days of their development, e-books are essentially following the evolutionary path of physical books, a path that started thousands of years ago. Yet physical books are containers for a wide variety of information types, and are accessed in a wide variety of ways, which offers the possibility of differing electronic manifestations. The evolutionary approach will continue to be reasonably successful in meeting current needs, but the real growth in the adoption of e-books will happen when the traditional book is deconstructed and reconstructed (textually, behaviorally and commercially) in order to create new paradigms for storing and delivering content in electronic forms.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126021000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Multimedia enriched digital books 多媒体丰富的数字图书
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458417
Carlos J. C. Teixeira
{"title":"Multimedia enriched digital books","authors":"Carlos J. C. Teixeira","doi":"10.1145/1458412.1458417","DOIUrl":"https://doi.org/10.1145/1458412.1458417","url":null,"abstract":"This paper proposes new extensions of the digital book concept together with the required approaches to support their automatic generation. Most best-sellers have often inspired other related products, sometimes in different media. Some of these can be merged into suitable forms to provide consumers with a better view and understanding of the original masterpiece: other texts about the original book, images, audio recordings or even films. Several standards and technologies, such as hypermedia, speech and language processing, and widely accessible PDAs, are nowadays available to make these experiences effective. A prototype of a Multimedia Enriched Digital Book (MEDB) is presented for reading, listening and viewing as well as for a text querying scenario. Within the scope of already available technologies, these are tangible visions for the very next years.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"172 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131108326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic metadata generation for scanned scientific volumes 自动元数据生成扫描的科学卷
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458430
Xiaonan Lu, B. Kahle
{"title":"Automatic metadata generation for scanned scientific volumes","authors":"Xiaonan Lu, B. Kahle","doi":"10.1145/1458412.1458430","DOIUrl":"https://doi.org/10.1145/1458412.1458430","url":null,"abstract":"Large scale digitization projects have been conducted at the Internet Archive digital library to preserve cultural artifacts and to provide permanent access. The increasing amount of digitized resources requires advanced tools and methods that will efficiently analyze and manage digitized resources. In this position paper, we identify several issues related to scanned books projects, present our initial work on automatic metadata generation for scanned scientific journals, and suggest potential future actions.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116127558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A web service for long tail book publishing 用于长尾图书出版的web服务
Workshop on Research Advances in Large Digital Book Repositories Pub Date : 2008-10-30 DOI: 10.1145/1458412.1458427
Prakash Reddy, Jian Fan, Jim Rowson, S. Rosenberg, A. Bolwell
{"title":"A web service for long tail book publishing","authors":"Prakash Reddy, Jian Fan, Jim Rowson, S. Rosenberg, A. Bolwell","doi":"10.1145/1458412.1458427","DOIUrl":"https://doi.org/10.1145/1458412.1458427","url":null,"abstract":"More than 32M unique book titles are available in US libraries, but Amazon, the biggest retailer, had only 1.2M unique titles available for sale in 2004. Currently there is an effort underway by public libraries, universities, the Open Content Alliance, Google and others, to non-destructively scan these 32M unique books and make them available for on-line viewing and search. Twenty percent (6.4M) of the 32M titles are out of copyright and out of print. A publisher estimates that an average of 40 copies of each title can be sold per year if they could be made available for sale. This long tail opportunity represents a several billion dollar market with the right cost structure. To address this long-tail book market we need to take the cost out of several parts of the value chain: automatic book preparation to minimize publishing setup costs, print-on-demand to remove warehouse and waste costs, and web 2.0 techniques to minimize marketing costs. We have created this system with several partners based on HP technology, and available as an incubation business.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131050385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信