{"title":"过滤器:一个有效的过滤器,支持XML流中的小枝查询模式","authors":"I-En Liao, Wen-Chiao Hsu, Chia-Fang Li","doi":"10.5220/0004527500750082","DOIUrl":null,"url":null,"abstract":"With the rapid development of the Internet, XML (eXtensible Markup Language) has become the standard data format for representation and exchange on the Internet. In many applications, XML files are transferred in a form of continuous streams. For example, in publishing- subscription systems, data is recorded in an XML format and the conditions of the user's subscription are expressed as queries. Thus, how to filter a continuous stream of XML documents against a large number of queries is an important issue. In this paper, we proposed an efficient filter called EFilter to support twig query patterns in XML streams. Users' queries are recorded in a compressed tree structure called Query Guide and a hash table called QLinkedList. Through a bottom-up search of Query Guide, the XML documents are processed only once as they arrive. Experimental results show that EFilter is more efficient than FiST (Kwon et al., 2005) and SFilter (Nizar, Babu and Kumar, 2009) in terms of filtering speed.","PeriodicalId":194465,"journal":{"name":"DCNET/ICE-B/OPTICS","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"EFilter: An Efficient Filter for Supporting Twig Query Patterns in XML Streams\",\"authors\":\"I-En Liao, Wen-Chiao Hsu, Chia-Fang Li\",\"doi\":\"10.5220/0004527500750082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of the Internet, XML (eXtensible Markup Language) has become the standard data format for representation and exchange on the Internet. In many applications, XML files are transferred in a form of continuous streams. For example, in publishing- subscription systems, data is recorded in an XML format and the conditions of the user's subscription are expressed as queries. Thus, how to filter a continuous stream of XML documents against a large number of queries is an important issue. In this paper, we proposed an efficient filter called EFilter to support twig query patterns in XML streams. Users' queries are recorded in a compressed tree structure called Query Guide and a hash table called QLinkedList. Through a bottom-up search of Query Guide, the XML documents are processed only once as they arrive. Experimental results show that EFilter is more efficient than FiST (Kwon et al., 2005) and SFilter (Nizar, Babu and Kumar, 2009) in terms of filtering speed.\",\"PeriodicalId\":194465,\"journal\":{\"name\":\"DCNET/ICE-B/OPTICS\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DCNET/ICE-B/OPTICS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0004527500750082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DCNET/ICE-B/OPTICS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0004527500750082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
随着Internet的迅速发展,XML (eXtensible Markup Language,可扩展标记语言)已经成为Internet上表示和交换的标准数据格式。在许多应用程序中,XML文件以连续流的形式传输。例如,在发布-订阅系统中,数据以XML格式记录,用户订阅的条件表示为查询。因此,如何针对大量查询过滤连续的XML文档流是一个重要的问题。在本文中,我们提出了一个名为effilter的高效过滤器来支持XML流中的分支查询模式。用户的查询记录在一个名为Query Guide的压缩树结构和一个名为QLinkedList的哈希表中。通过Query Guide的自底向上搜索,XML文档只在到达时处理一次。实验结果表明,在过滤速度方面,effilter比FiST (Kwon et al., 2005)和SFilter (Nizar, Babu and Kumar, 2009)效率更高。
EFilter: An Efficient Filter for Supporting Twig Query Patterns in XML Streams
With the rapid development of the Internet, XML (eXtensible Markup Language) has become the standard data format for representation and exchange on the Internet. In many applications, XML files are transferred in a form of continuous streams. For example, in publishing- subscription systems, data is recorded in an XML format and the conditions of the user's subscription are expressed as queries. Thus, how to filter a continuous stream of XML documents against a large number of queries is an important issue. In this paper, we proposed an efficient filter called EFilter to support twig query patterns in XML streams. Users' queries are recorded in a compressed tree structure called Query Guide and a hash table called QLinkedList. Through a bottom-up search of Query Guide, the XML documents are processed only once as they arrive. Experimental results show that EFilter is more efficient than FiST (Kwon et al., 2005) and SFilter (Nizar, Babu and Kumar, 2009) in terms of filtering speed.