{"title":"Empirical Study on Web Content Consistency","authors":"Chi-Hung Chi, Lin Liu, Choon-Keng Chua","doi":"10.1109/IRI.2006.252412","DOIUrl":null,"url":null,"abstract":"In this paper, we would like to perform detail analysis on the consistency situation of current Web content. Both data and the associated attributes of Web objects on replica/CDN (content delivery network) are monitored over the Internet and the correctness and appropriateness of various headers are discussed. It is found that there are lots of discrepancies in data object and attributes found by comparing the original copy and the retrieved copy of the content. This result is important to content delivery and distribution because incorrect headers can easily lead into wrong decision in network and content presentation related functions such as caching, content adaptation and personalization","PeriodicalId":402255,"journal":{"name":"2006 IEEE International Conference on Information Reuse & Integration","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Conference on Information Reuse & Integration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2006.252412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we would like to perform detail analysis on the consistency situation of current Web content. Both data and the associated attributes of Web objects on replica/CDN (content delivery network) are monitored over the Internet and the correctness and appropriateness of various headers are discussed. It is found that there are lots of discrepancies in data object and attributes found by comparing the original copy and the retrieved copy of the content. This result is important to content delivery and distribution because incorrect headers can easily lead into wrong decision in network and content presentation related functions such as caching, content adaptation and personalization