{"title":"MXML: Implementation of a web-based application for merging XML documents using XML-SIM","authors":"Waraporn Viyanon","doi":"10.1109/ICTKE.2015.7368462","DOIUrl":null,"url":null,"abstract":"This paper presents a design, implementation and evaluation of a web-based application called MergeXML (MXML). MXML was developed to integrate XML documents that are similar in terms of structure and content to complete information which can be used for information retrieval. XML documents are clustered into subtrees representing as instances using leaf-node parents as clustering points. The system finds subtree keys from unique values at leaf-node levels. Subtree keys play an important role for mapping subtrees between two documents. Matched subtrees are merged to complete the information on the base XML document. The result shows that MXML is able to cluster subtrees as proper instances. It can merge additional (different) information to the base XML document. MXML is recommended to run with not too large size of XML documents due to time-out settings on the web server.","PeriodicalId":128925,"journal":{"name":"2015 13th International Conference on ICT and Knowledge Engineering (ICT & Knowledge Engineering 2015)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 13th International Conference on ICT and Knowledge Engineering (ICT & Knowledge Engineering 2015)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTKE.2015.7368462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a design, implementation and evaluation of a web-based application called MergeXML (MXML). MXML was developed to integrate XML documents that are similar in terms of structure and content to complete information which can be used for information retrieval. XML documents are clustered into subtrees representing as instances using leaf-node parents as clustering points. The system finds subtree keys from unique values at leaf-node levels. Subtree keys play an important role for mapping subtrees between two documents. Matched subtrees are merged to complete the information on the base XML document. The result shows that MXML is able to cluster subtrees as proper instances. It can merge additional (different) information to the base XML document. MXML is recommended to run with not too large size of XML documents due to time-out settings on the web server.