A Twig Join Algorithm for a Query with ID References

Dong Li, Lin Zhao, Jing Li
{"title":"A Twig Join Algorithm for a Query with ID References","authors":"Dong Li, Lin Zhao, Jing Li","doi":"10.1109/APSCC.2014.26","DOIUrl":null,"url":null,"abstract":"ID/IDREF feature makes XML document model become graph structure rather than tree structure, while traditional Twig join algorithms can just process simple queries without ID references. Those queries with ID references often involve attribute node or predicates with expressions which do not exist in traditional Twig pattern, so it is necessary to design the Twig join algorithm for the implement of queries involving ID references. There are several typical Twig join algorithms like Twig2Stack, Twig List, Twig Mix. Twig2Stack use over-complicated data structures with large memory overhead. Twig List uses simple lists but lack efficient filtering of useless elements. Twig Mix simply introduces the getNext() function into Twig List to avoid manipulation of useless elements for the ancestor-descendent (AD) relationship in the stack and lists, but it will filter some useful elements when process the queries involving attribute node or predicates within expressions. To this end, we propose a new algorithm, called Twig Expand, which can process the queries involving attribute node or predicates within expressions by avoiding the manipulation of useless elements for both the parent-child (PC) relationship and AD relationship. In addition, Twig Graph is proposed by expanding Twig Expand, which can process the queries involving ID references, and it's much faster than binary structural join proved by experimental study.","PeriodicalId":393593,"journal":{"name":"2014 Asia-Pacific Services Computing Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Asia-Pacific Services Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSCC.2014.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

ID/IDREF feature makes XML document model become graph structure rather than tree structure, while traditional Twig join algorithms can just process simple queries without ID references. Those queries with ID references often involve attribute node or predicates with expressions which do not exist in traditional Twig pattern, so it is necessary to design the Twig join algorithm for the implement of queries involving ID references. There are several typical Twig join algorithms like Twig2Stack, Twig List, Twig Mix. Twig2Stack use over-complicated data structures with large memory overhead. Twig List uses simple lists but lack efficient filtering of useless elements. Twig Mix simply introduces the getNext() function into Twig List to avoid manipulation of useless elements for the ancestor-descendent (AD) relationship in the stack and lists, but it will filter some useful elements when process the queries involving attribute node or predicates within expressions. To this end, we propose a new algorithm, called Twig Expand, which can process the queries involving attribute node or predicates within expressions by avoiding the manipulation of useless elements for both the parent-child (PC) relationship and AD relationship. In addition, Twig Graph is proposed by expanding Twig Expand, which can process the queries involving ID references, and it's much faster than binary structural join proved by experimental study.
基于ID引用查询的小枝连接算法
ID/IDREF特性使XML文档模型从树形结构变成了图形结构,而传统的Twig join算法只能处理简单的查询,不需要ID引用。这些带有ID引用的查询往往涉及属性节点或带有表达式的谓词,这在传统的Twig模式中是不存在的,因此有必要设计Twig连接算法来实现包含ID引用的查询。有几种典型的Twig连接算法,如Twig2Stack, Twig List, Twig Mix。Twig2Stack使用了过于复杂的数据结构和大量的内存开销。Twig List使用简单的列表,但缺乏对无用元素的有效过滤。Twig Mix只是在Twig List中引入了getNext()函数,以避免对堆栈和列表中祖先-后代(AD)关系的无用元素进行操作,但是在处理涉及属性节点或表达式中的谓词的查询时,它会过滤一些有用的元素。为此,我们提出了一种名为Twig Expand的新算法,该算法可以处理表达式中涉及属性节点或谓词的查询,避免了对父子(PC)关系和AD关系的无用元素的操作。此外,通过扩展Twig Expand提出了Twig Graph,它可以处理涉及ID引用的查询,并且比实验研究证明的二元结构连接快得多。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信