Recovery of vanished URLs: Comparing the efficiency of Internet Archive and Google

IF 0.5 4区 管理学 Q3 INFORMATION SCIENCE & LIBRARY SCIENCE
D. V. Kumar, B. Kumar
{"title":"Recovery of vanished URLs: Comparing the efficiency of Internet Archive and Google","authors":"D. V. Kumar, B. Kumar","doi":"10.22452/MJLIS.VOL22NO2.3","DOIUrl":null,"url":null,"abstract":"This article examines the vanishing nature of URLs and recovery of vanished URLs through Internet Archive and Google search engine. For that purpose study investigates the URLs cited in the articles of two LIS journals published during 2009-2013. A total of 226 articles published in two open access LIS journals were selected. Of 5197 citations cited in 226 articles, 21.05 percent were URLs (1094). Study found that 38.12 percent (417 out of 5197) URLs were found missing and remaining 61.88 percent of URLs were active at the time of URL check with W3C link checker. The HTTP 404 error message – “page not found” was the overwhelming message encountered and represented 54.2 percent of all HTTP error message. Internet Archive and Google search engine were used to recover vanished URLs. However, the Internet Archive recovered 66.19 percent of the total vanished URLs, whereas, Google manages to recover only 30.70 percent of the total vanished URLs. The recovery of vanishing URLs through Internet Archive and Google increased the active URL’s rate from 61.88 per cent to 87.11 per cent and 73.58 per cent respectively. Study found that Internet Archive is a most efficient tool to recover vanished URLs compared to Google search engine.","PeriodicalId":45072,"journal":{"name":"Malaysian Journal of Library & Information Science","volume":"22 1","pages":"31-43"},"PeriodicalIF":0.5000,"publicationDate":"2017-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Library & Information Science","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.22452/MJLIS.VOL22NO2.3","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 1

Abstract

This article examines the vanishing nature of URLs and recovery of vanished URLs through Internet Archive and Google search engine. For that purpose study investigates the URLs cited in the articles of two LIS journals published during 2009-2013. A total of 226 articles published in two open access LIS journals were selected. Of 5197 citations cited in 226 articles, 21.05 percent were URLs (1094). Study found that 38.12 percent (417 out of 5197) URLs were found missing and remaining 61.88 percent of URLs were active at the time of URL check with W3C link checker. The HTTP 404 error message – “page not found” was the overwhelming message encountered and represented 54.2 percent of all HTTP error message. Internet Archive and Google search engine were used to recover vanished URLs. However, the Internet Archive recovered 66.19 percent of the total vanished URLs, whereas, Google manages to recover only 30.70 percent of the total vanished URLs. The recovery of vanishing URLs through Internet Archive and Google increased the active URL’s rate from 61.88 per cent to 87.11 per cent and 73.58 per cent respectively. Study found that Internet Archive is a most efficient tool to recover vanished URLs compared to Google search engine.
恢复消失的URL:比较互联网档案和谷歌的效率
本文研究了url消失的本质以及通过Internet Archive和谷歌搜索引擎恢复消失的url。为此,本研究调查了2009-2013年间发表的两份LIS期刊文章中引用的url。共选取了发表在两种开放获取LIS期刊上的226篇文章。226篇文章被引用5197次,其中21.05%是url(1094次)。研究发现,在使用W3C链接检查器进行URL检查时,38.12%(5197个URL中的417个)的URL被发现缺失,其余61.88%的URL是活跃的。HTTP 404错误信息——“页面未找到”是最常见的错误信息,占所有HTTP错误信息的54.2%。Internet Archive和谷歌搜索引擎被用来恢复消失的url。但是,Internet Archive恢复了66.19%的消失网址,而谷歌只恢复了30.70%的消失网址。通过互联网档案和谷歌恢复消失的网址,使活跃网址的比率分别由61.88%提高到87.11%和73.58%。研究发现,与谷歌搜索引擎相比,Internet Archive是恢复消失url的最有效工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Malaysian Journal of Library & Information Science
Malaysian Journal of Library & Information Science INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
2.00
自引率
7.70%
发文量
8
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信