{"title":"The present and future of google books","authors":"James Crawford","doi":"10.1145/1871854.1871866","DOIUrl":null,"url":null,"abstract":"The Google Books project has the modest goal of scanning all of the world’s books, converting them to digital form, and making them searchable and accessible. To date over twelve million books, containing over four billion pages, have been scanned and digitized. This is an impressive number but it turns out that scanning is only the beginning of the challenge. One part of the challenge in making books searchable and accessible is that a scan produces an image of a page, and often a blurred or partially obscured one at that, but searching requires a digital representation of the text on the page. Converting the image to text is also critical to creating a good reading experience since the text can then be reformatted to match the display size and the user can control the font size and layout. This is especially important for tablet devices and smart phones.","PeriodicalId":258166,"journal":{"name":"Workshop on Research Advances in Large Digital Book Repositories","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Research Advances in Large Digital Book Repositories","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1871854.1871866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The Google Books project has the modest goal of scanning all of the world’s books, converting them to digital form, and making them searchable and accessible. To date over twelve million books, containing over four billion pages, have been scanned and digitized. This is an impressive number but it turns out that scanning is only the beginning of the challenge. One part of the challenge in making books searchable and accessible is that a scan produces an image of a page, and often a blurred or partially obscured one at that, but searching requires a digital representation of the text on the page. Converting the image to text is also critical to creating a good reading experience since the text can then be reformatted to match the display size and the user can control the font size and layout. This is especially important for tablet devices and smart phones.