2012 Seventh ChinaGrid Annual Conference最新文献_第2页

Extracting Domain-Relevant Term Using Wikipedia Based on Random Walk Model 基于随机游走模型的维基百科领域相关术语提取

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/CHINAGRID.2012.20

Wenjuan Wu, Tao Liu, H. Hu, Xiaoyong Du

引用次数: 5

Inverted Grid-Based kNN Query Processing with MapReduce 基于倒网格的kNN查询处理与MapReduce

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.19

Changqing Ji, Tingting Dong, Yu Li, Yanming Shen, Keqiu Li, Wenming Qiu, W. Qu, M. Guo

{"title":"Inverted Grid-Based kNN Query Processing with MapReduce","authors":"Changqing Ji, Tingting Dong, Yu Li, Yanming Shen, Keqiu Li, Wenming Qiu, W. Qu, M. Guo","doi":"10.1109/ChinaGrid.2012.19","DOIUrl":"https://doi.org/10.1109/ChinaGrid.2012.19","url":null,"abstract":"With the increasing availability of LBS (Location Based Services) and mobile internet, the amount of spatial data is growing larger and larger. It poses new requirements and challenges towards cloud environments, such as how to accomplish efficient index and query processing on large scale spatial data. A scalable and distributed spatial data index is a best choice for the effective processing of the spatial data analysis and query. There are several approaches that implement distributed indices and query processing with MapReduce, such as R-tree and Voronoi-based index. However, R-tree is unsuitable for parallelization and query processing on Voronoi-based index needs extra computation for localization or local index reconstruction. The regularity of grid partition is much easier to scale and parallel comparing with the above two approaches. Inverted Index utilizes limited index entries to index unlimited data points. In this paper, we propose a new distributed spatial data index: Inverted Grid Index, which is a combination of inverted index and grid partition. Our index structure is more simple and suitable for large-scale parallel spatial query application. We present MapReduce-based approaches that both construct Inverted Grid Index and process kNN query over large spatial data sets. Extensive experiments have been done to evaluate the scalability and the performance of kNN query processing on our index structure. The results demonstrate the efficiency and scalability of our kNN query algorithm based on Inverted Grid Index.","PeriodicalId":371382,"journal":{"name":"2012 Seventh ChinaGrid Annual Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125934458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 38

A Heterogeneity-aware Data Distribution and Rebalance Method in Hadoop Cluster Hadoop集群中异构感知的数据分布与再平衡方法

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.22

Yuanquan Fan, Weiguo Wu, Haijun Cao, Huo Zhu, Xu Zhao, Wei Wei

引用次数: 21

EMA: Turning Multiple Address Spaces Transparent to CUDA Programming EMA:将多个地址空间透明到CUDA编程

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.23

Kun Tang, Yulong Yu, Yuxin Wang, Yong Zhou, He Guo

引用次数: 3

Improving the Effective IO Throughput by Adaptive Read-Ahead Strategy for Private Cloud Storage Service 利用自适应预读策略提高私有云存储服务的有效IO吞吐量

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.9

Qiuping Wang, Kang Chen, Yongwei Wu, Weimin Zheng

引用次数: 3

Security SLAs for IMS-based Cloud Services 基于ims的云服务的安全sla

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/CHINAGRID.2012.14

Guo Zhien, Dai Yi-qi

引用次数: 3

Improving the System Capacity by Client Cooperation in Distributed File Service 分布式文件服务中客户端协作提高系统容量

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.11

Gang Huang, Kang Chen, Yongwei Wu, Weimin Zheng, Q. Yue

引用次数: 1

A Parameter Dynamic-Tuning Scheduling Algorithm Based on History in Heterogeneous Environments 异构环境下基于历史的参数动态调优调度算法

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.24

Xu Zhao, Xiaoshe Dong, Haijun Cao, Yuanquan Fan, Huo Zhu

引用次数: 5

WaxElephant: A Realistic Hadoop Simulator for Parameters Tuning and Scalability Analysis WaxElephant:一个现实的Hadoop模拟器，用于参数调优和可伸缩性分析

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.25

Zujie Ren, Zhijun Liu, Xianghua Xu, Jian Wan, Weisong Shi, Min Zhou

引用次数: 10

A Heuristic Algorithm for Scheduling on Grid Computing Environment 网格计算环境下的启发式调度算法

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.13

Jing Wang, Gongqing Wu, Bin Zhang, Xuegang Hu

{"title":"A Heuristic Algorithm for Scheduling on Grid Computing Environment","authors":"Jing Wang, Gongqing Wu, Bin Zhang, Xuegang Hu","doi":"10.1109/ChinaGrid.2012.13","DOIUrl":"https://doi.org/10.1109/ChinaGrid.2012.13","url":null,"abstract":"With the conglomeration of large-scale heterogeneous systems, the grid computing environment makes the whole network into a powerful and reliable resource available nearly everywhere. Resource scheduling is a fundamental issue in grid computing. For this NP-hard problem, we take into account of the geographic distribution of resources and the requirement of job entity in the scheduling algorithm. To do so, we first consider the parameters of job entity and resource entity. Then the key characteristics as release time, processing time and delivery time determine the rules about the scheduling. We present HF (Harder First) strategy and DF (Larger Distance First) strategy. Let the H value denotes the sum of release time, length and delivery time of the job, the job with a higher H value is considered to be harder and should be assigned to a faster resource according to the HF strategy. Secondly, when the number of jobs is larger than the number of resources, the DF strategy makes sure that the job with a higher difference (distance) between the delivery time and the release time should be processed first. Based on the stated strategies, we provide a heuristic algorithm HFFP (Harder First Faster Prior) for resource scheduling on the grid computing environment. The experiment data of jobs scale from 10k to 80k, while the number of resources ranges from 2 to 6. The algorithm performance is demonstrated by simulation on the platform of GridSim. Our experiment results show that the algorithm HFFP can minimize the completion time of jobs especially when the number of jobs is much larger than the number of resources. By comparing our algorithm with classical scheduling algorithm as Min-min algorithm, we can see that our algorithm can assign the jobs to the resources reasonably from the criteria of make span. To better compare the performance of our algorithm with Max-min, we do some medication to the traditional Max-min algorithm and presents Max-min-L (Max-min-Local). Max-min-L chooses the local maximization instead of overall maximization, suitable for jobs with similar length. By comparing experiments with Max-min-L and Min-min, we can still get that our algorithm is better than Min-min and Max-min-L by the metrics of make span.","PeriodicalId":371382,"journal":{"name":"2012 Seventh ChinaGrid Annual Conference","volume":"285 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123727858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10