22nd International Conference on Data Engineering Workshops (ICDEW'06)最新文献_第7页

Category-based Functional Information Modeling for eChronicles 基于分类的编年史功能信息建模

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.38

Pilho Kim, R. Jain

引用次数: 0

Clustering Multidimensional Trajectories based on Shape and Velocity 基于形状和速度的多维轨迹聚类

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.39

Y. Yanagisawa, T. Satoh

引用次数: 25

Estimating Top N Hosts in Cardinality Using Small Memory Resources 利用小内存资源估计基数排名前N的主机

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.56

K. Ishibashi, Tatsuya Mori, R. Kawahara, Yutaka Hirokawa, A. Kobayashi, K. Yamamoto, H. Sakamoto

{"title":"Estimating Top N Hosts in Cardinality Using Small Memory Resources","authors":"K. Ishibashi, Tatsuya Mori, R. Kawahara, Yutaka Hirokawa, A. Kobayashi, K. Yamamoto, H. Sakamoto","doi":"10.1109/ICDEW.2006.56","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.56","url":null,"abstract":"We propose a method to find N hosts that have the N highest cardinalities, where cardinality is the number of distinct items such as the number of flows, ports, or peer hosts. The method also estimates their cardinalities. While existing algorithms to find the top N frequent items can be directly applied to find N hosts that send the N largest numbers of packets through packet data stream, finding hosts that have the N highest cardinalities requires tables of previously seen items for each host to check whether an item of an arrival packet is new, which requires a lot of memory. Even if we use the existing cardinality estimation methods, we still need to have cardinality information about each host. In this paper, we use the property of cardinality estimation, in which the cardinality of intersections of multiple data sets can be estimated with cardinality information of each data set. Using the property, we propose an algorithm that does not need to maintain tables for each host, but only for partitioned addresses of a host and estimate the cardinality of a host as the intersection of cardinalities of partitioned addresses. We also propose a method to find top N hosts in cardinalities which is to be monitored to detect anomalous behavior in networks. We evaluate our algorithm through actual backbone traffic data. While the estimation accuracy of our scheme degrades for small cardinalities, as for the top 100 hosts, the accuracy of our algorithm with 4, 096 tables is almost the same as having tables of every hosts.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128138447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Mining Executive Compensation Data from SEC Filings 从SEC文件中挖掘高管薪酬数据

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.89

Chengmin Ding, Ping Chen

引用次数: 4

Pragmatics and Open Problems for Inter-schema Constraint Theory 图式间约束理论的语用与开放性问题

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.111

A. Rosenthal, Leonard J. Seligman

引用次数: 2

A Self-Organizing Search Engine for RSS Syndicated Web Contents RSS联合Web内容的自组织搜索引擎

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.19

Ying Zhou, Xin Chen, Chen Wang

{"title":"A Self-Organizing Search Engine for RSS Syndicated Web Contents","authors":"Ying Zhou, Xin Chen, Chen Wang","doi":"10.1109/ICDEW.2006.19","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.19","url":null,"abstract":"The exponentially growing information published on the Web relies largely on a few major search engines like Google to be brought to the public nowadays. This raises issues such as: 1. how many percents of coverage do these search engines provide for the whole shared contents over the Internet? 2. how easy is it to find less popular contents from the Web through the page ranking system of these search engines? In fact, the increasing dynamics of the information distributed on the Internet challenge the flexibility of these centralized search engines. With the amount of structured and semi-structured data increase on the Internet, self-organizing search engines that are capable of providing sufficient coverage for data that follow certain structures get more and more attractive. In this paper, we propose a self-organizing search engine soSpace for RSS syndicated web data. soSpace is built on structured peer-to-peer technology. It enables indexing and searching of frequently updated web information described by RSS feed. Our experiment results show that it has good scalability as the contents increase. The recall and precision rate of the result are satisfactory as well.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131973047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Semantic Model to Integrate Biological Resources 整合生物资源的语义模型

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.133

Z. Lacroix, L. Raschid, Maria-Esther Vidal

引用次数: 16

Quality Estimation of Local Contents Based on PageRank Values of Web Pages 基于网页PageRank值的本地内容质量估计

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.121

Y. Kabutoya, T. Yumoto, S. Oyama, Keishi Tajima, Katsumi Tanaka

引用次数: 3

Toward a Query Language for Network Attack Data 网络攻击数据查询语言研究

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.149

Bee-Chung Chen, V. Yegneswaran, P. Barford, R. Ramakrishnan

{"title":"Toward a Query Language for Network Attack Data","authors":"Bee-Chung Chen, V. Yegneswaran, P. Barford, R. Ramakrishnan","doi":"10.1109/ICDEW.2006.149","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.149","url":null,"abstract":"The growing sophistication and diversity of malicious activity in the Internet presents a serious challenge for network security analysts. In this paper, we describe our efforts to develop a database and query language for network attack data from firewalls, intrusion detection systems and honeynets. Our first step toward this objective is to develop a prototype database and query interface to identify coordinated scanning activity in network attack data. We have created a set of aggregate views and templatized SQL queries that consider timing, persistence, targeted services, spatial dispersion and temporal dispersion, thereby enabling us to evaluate coordinated scanning along these dimensions. We demonstrate the utility of the interface by conducting a case study on a set of firewall and intrusion detection system logs from Dshield.org. We show that the interface is able to identify general characteristics of coordinated activity as well as instances of unusual activity that would otherwise be difficult to mine from the data. These results highlight the potential for developing a more generalized query language for a broad class of network intrusion data. The case study also exposes some of the challenges we face in extending our system to more generalized queries over potentially vast quantities of data.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134207631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Towards a Quality Model for Effective Data Selection in Collaboratories 面向协作实验室有效数据选择的质量模型

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.150

Yogesh L. Simmhan, Beth Plale, Dennis Gannon

引用次数: 31