Optimizing path expression queries of XML data

Yang Li, P. Yi, Qiyan Li
{"title":"Optimizing path expression queries of XML data","authors":"Yang Li, P. Yi, Qiyan Li","doi":"10.1109/ICEBE.2005.93","DOIUrl":null,"url":null,"abstract":"With the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on Patricia-tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on Patricia-tries. Second, the path indexes are compressed so that they can be stored in memory. Thirdly, because PT index includes structure and text of XML data, we can get the results from the PT index without reading original XML data from disk. We further analyze time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+ tree index and Index Fabric","PeriodicalId":118472,"journal":{"name":"IEEE International Conference on e-Business Engineering (ICEBE'05)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Conference on e-Business Engineering (ICEBE'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2005.93","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

With the advent of XML as a standard for data representation and exchange on the Internet, storing and querying XML data becomes more and more important. This poses a new challenge concerning indexing and searching XML data, because conventional approaches based on relational model may not meet the processing requirements for XML data. In this paper, we propose a path index based on Patricia-tries, namely PT index. Our PT index structure offers several novel features. First, it can support to fast search data by its structure based on Patricia-tries. Second, the path indexes are compressed so that they can be stored in memory. Thirdly, because PT index includes structure and text of XML data, we can get the results from the PT index without reading original XML data from disk. We further analyze time complexity and space complexity of PT index. Experimental results from our prototype system implementation show that the PT index can outperform some representative index approaches, such as DataGuide, B+ tree index and Index Fabric
优化XML数据的路径表达式查询
随着XML作为Internet上数据表示和交换的标准的出现,存储和查询XML数据变得越来越重要。这对索引和搜索XML数据提出了新的挑战,因为基于关系模型的传统方法可能无法满足XML数据的处理需求。本文提出了一种基于Patricia-tries的路径指数,即PT指数。我们的PT指数结构提供了几个新特性。首先,它可以通过基于Patricia-tries的结构支持快速的数据搜索。其次,对路径索引进行压缩,以便将它们存储在内存中。第三,由于PT索引包含XML数据的结构和文本,因此我们无需从磁盘读取原始XML数据即可从PT索引获得结果。进一步分析了PT指标的时间复杂度和空间复杂度。原型系统实现的实验结果表明,PT索引优于DataGuide、B+树索引和index Fabric等具有代表性的索引方法
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信