{"title":"Optimizing path expression queries of XML data","authors":"Yang Li, P. Yi, Qiyan Li","doi":"10.1109/ICEBE.2005.93","DOIUrl":null,"url":null,"abstract":"With the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on Patricia-tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on Patricia-tries. Second, the path indexes are compressed so that they can be stored in memory. Thirdly, because PT index includes structure and text of XML data, we can get the results from the PT index without reading original XML data from disk. We further analyze time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+ tree index and Index Fabric","PeriodicalId":118472,"journal":{"name":"IEEE International Conference on e-Business Engineering (ICEBE'05)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Conference on e-Business Engineering (ICEBE'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2005.93","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
With the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on Patricia-tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on Patricia-tries. Second, the path indexes are compressed so that they can be stored in memory. Thirdly, because PT index includes structure and text of XML data, we can get the results from the PT index without reading original XML data from disk. We further analyze time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+ tree index and Index Fabric