{"title":"Multi-resolution indexing for XML data","authors":"Antoine Maghamez, Gongzhu Hu","doi":"10.1109/SERA.2005.52","DOIUrl":null,"url":null,"abstract":"As the Extendible Markup Language (XML) becoming a de facto standard for representing and exchanging data over the Internet, it is critical to be able to retrieve XML data efficiently. One way to achieve this is to use indexing, just like we index data stored in relational databases. In this paper, we present a multi-resolution structural index (MRI) method to facilitate fast retrieval of XML data. The indexing method is based on a new coding scheme that assigns unique numbers to the elements on all possible paths of the tree representing the XML document. The coding scheme is based on the DTD (data type definition) of the XML file. Elements are stored in internal data structures in such a way that they can be directly accessed via the unique coding. The ancestor-descendant relationships among the tree elements are easily identified to help fast process of user's queries. Our experiments show that the MRI indexing approach is effective, both in time and space.","PeriodicalId":424175,"journal":{"name":"Third ACIS Int'l Conference on Software Engineering Research, Management and Applications (SERA'05)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Third ACIS Int'l Conference on Software Engineering Research, Management and Applications (SERA'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA.2005.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
As the Extendible Markup Language (XML) becoming a de facto standard for representing and exchanging data over the Internet, it is critical to be able to retrieve XML data efficiently. One way to achieve this is to use indexing, just like we index data stored in relational databases. In this paper, we present a multi-resolution structural index (MRI) method to facilitate fast retrieval of XML data. The indexing method is based on a new coding scheme that assigns unique numbers to the elements on all possible paths of the tree representing the XML document. The coding scheme is based on the DTD (data type definition) of the XML file. Elements are stored in internal data structures in such a way that they can be directly accessed via the unique coding. The ancestor-descendant relationships among the tree elements are easily identified to help fast process of user's queries. Our experiments show that the MRI indexing approach is effective, both in time and space.