动态网页中静态克隆的解析

Nikita Synytskyy, J. Cordy, T. Dean
{"title":"动态网页中静态克隆的解析","authors":"Nikita Synytskyy, J. Cordy, T. Dean","doi":"10.1109/WSE.2003.1234008","DOIUrl":null,"url":null,"abstract":"Cloning is extremely likely to occur in Web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentages of 30% or higher-not uncommon in Web sites-suggest that some improvements can be made. Finding and resolving the clones in Web documents is rather challenging, however: syntax errors and routine use of multiple languages complicate parsing the documents and finding clones, while lack of native code reuse tools forces the analyst to rely on other technologies for resolution. Here we present a way to find clones in Multilanguage Web documents, and resolve them using one of several code reuse techniques that are available in a dynamic Web site. Rather than picking a single resolution technique and relying on it exclusively, we pick it based on the clone in question, to minimize disruption to the structure of original documents.","PeriodicalId":220870,"journal":{"name":"Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings.","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":"{\"title\":\"Resolution of static clones in dynamic Web pages\",\"authors\":\"Nikita Synytskyy, J. Cordy, T. Dean\",\"doi\":\"10.1109/WSE.2003.1234008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cloning is extremely likely to occur in Web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentages of 30% or higher-not uncommon in Web sites-suggest that some improvements can be made. Finding and resolving the clones in Web documents is rather challenging, however: syntax errors and routine use of multiple languages complicate parsing the documents and finding clones, while lack of native code reuse tools forces the analyst to rely on other technologies for resolution. Here we present a way to find clones in Multilanguage Web documents, and resolve them using one of several code reuse techniques that are available in a dynamic Web site. Rather than picking a single resolution technique and relying on it exclusively, we pick it based on the clone in question, to minimize disruption to the structure of original documents.\",\"PeriodicalId\":220870,\"journal\":{\"name\":\"Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings.\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"37\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSE.2003.1234008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSE.2003.1234008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37

摘要

克隆极有可能发生在网站上,比其他软件更容易发生。虽然有些克隆的存在是有正当理由的,或者因为太小而无法消除,但30%或更高的克隆百分比(在Web站点中并不罕见)表明可以进行一些改进。然而,查找和解决Web文档中的克隆是相当具有挑战性的:语法错误和多种语言的常规使用使解析文档和查找克隆变得复杂,同时缺乏本地代码重用工具迫使分析人员依赖其他技术进行解决。这里我们介绍一种在多语言Web文档中查找克隆的方法,并使用动态Web站点中可用的几种代码重用技术中的一种来解决它们。我们不是选择单一的分辨率技术并完全依赖它,而是根据所讨论的克隆来选择它,以尽量减少对原始文档结构的破坏。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Resolution of static clones in dynamic Web pages
Cloning is extremely likely to occur in Web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentages of 30% or higher-not uncommon in Web sites-suggest that some improvements can be made. Finding and resolving the clones in Web documents is rather challenging, however: syntax errors and routine use of multiple languages complicate parsing the documents and finding clones, while lack of native code reuse tools forces the analyst to rely on other technologies for resolution. Here we present a way to find clones in Multilanguage Web documents, and resolve them using one of several code reuse techniques that are available in a dynamic Web site. Rather than picking a single resolution technique and relying on it exclusively, we pick it based on the clone in question, to minimize disruption to the structure of original documents.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信