2012 Seventh ChinaGrid Annual Conference最新文献_第3页

A Hadoop-based Massive Molecular Data Storage Solution for Virtual Screening 基于hadoop的虚拟筛选海量分子数据存储解决方案

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.26

Yan Zhang, Ruisheng Zhang, Qiuqiang Chen, Xiaopan Gao, Rongjing Hu, Y. Zhang, Guangcai Liu

{"title":"A Hadoop-based Massive Molecular Data Storage Solution for Virtual Screening","authors":"Yan Zhang, Ruisheng Zhang, Qiuqiang Chen, Xiaopan Gao, Rongjing Hu, Y. Zhang, Guangcai Liu","doi":"10.1109/ChinaGrid.2012.26","DOIUrl":"https://doi.org/10.1109/ChinaGrid.2012.26","url":null,"abstract":"Virtual Screening involves massive computing tasks with millions of molecules docking on the targeted protein. Such data-intensive science always faces the challenge of managing tens of TB datasets, which gives rise to the requirement of large-scale storage. Furthermore, the efficient query and transmission of the large-scale datasets are the other key requirements during the virtual screening progress. Therefore, in this data-intensive application, a massive data storage solution is expected to improve the efficiency of storage and access of large-scale molecules and their docking results, as well as facilitating the data preparing and analysis phases of virtual screening. In order to address the key requirements mentioned above, we proposed a novel storage solution based on Hadoop for virtual screening. HBase was implemented as a distributed database to persist the properties of massive molecules and docking results. HDFS was utilized as a molecule source files storage system. The comparison of the system performance was also presented. Finally, we concluded that the storage solution we proposed could be considered as an alternative attempt to enable the efficient storage and access of large-scale molecules and docking results in virtual screening research.","PeriodicalId":371382,"journal":{"name":"2012 Seventh ChinaGrid Annual Conference","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127794597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Integrating Heterogeneous Grid Middleware to Support Large-Scale Bag-of-Tasks Applications 集成异构网格中间件，支持大规模任务包应用

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.29

Zhili Zhao, Lian Li, Ruisheng Zhang, A. Paschke, Jiazao Lin

引用次数: 0

A Dispatching-Rule-Based Task Scheduling Policy for MapReduce with Multi-type Jobs in Heterogeneous Environments 异构环境下基于调度规则的MapReduce多任务调度策略

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.27

Xiang Gao, Qinghua Chen, Yurong Chen, Qingwei Sun, Yan Liu, Mingzhu Li

{"title":"A Dispatching-Rule-Based Task Scheduling Policy for MapReduce with Multi-type Jobs in Heterogeneous Environments","authors":"Xiang Gao, Qinghua Chen, Yurong Chen, Qingwei Sun, Yan Liu, Mingzhu Li","doi":"10.1109/ChinaGrid.2012.27","DOIUrl":"https://doi.org/10.1109/ChinaGrid.2012.27","url":null,"abstract":"MapReduce has emerged as an important and widely used programming model for distributed and parallel computing, due to its ease of use, generality and scalability. This model is proposed to mainly solve large-scale data processing, i.e. data-intensive jobs, and it is optimized for homogenous environment, in which computing nodes are identical and dedicated. Today enterprise IT systems preserve massive, historical management and operational data, which need both data-intensive and computation-intensive analysis while using heterogeneous computing resources. In order to support enterprise data analysis application with the MapReduce model, it is important to improve MapReduce's task scheduling algorithm that can reduce the overall completion time with multi-type jobs and in heterogeneous environments. This paper formulates the scheduling problem as an optimization problem. Based on the job shop scheduling theory and existing approximation algorithms, we propose a new dispatching-rule-based and online scheduling policy LPT-θ. By using LPT-θ, the tasks with larger processing time and within a θ-space would be assigned with higher priorities. Numerical results show that LPT-θ can achieve a 12%~45% performance gain compared with the original scheduling algorithm in MapReduce.","PeriodicalId":371382,"journal":{"name":"2012 Seventh ChinaGrid Annual Conference","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115123486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Construct SaaS Applications from Multi-abstract-level: Method and System 从多抽象层次构建SaaS应用:方法和系统

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.12

Lei Wu, Ying Pan, Shijun Liu, Qian Li

引用次数: 1

EasyDeploy: Automatic Application Deployment in Virtual Clusters EasyDeploy:在虚拟集群中自动部署应用

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.28

Tao Gao, Yanjun Xu, Xiaoying Wang, Jinlei Jiang, Yongwei Wu

{"title":"EasyDeploy: Automatic Application Deployment in Virtual Clusters","authors":"Tao Gao, Yanjun Xu, Xiaoying Wang, Jinlei Jiang, Yongwei Wu","doi":"10.1109/ChinaGrid.2012.28","DOIUrl":"https://doi.org/10.1109/ChinaGrid.2012.28","url":null,"abstract":"Along with the fast development of Cloud computing, it has become a trend to use virtual clusters for scientific and business works. In spite of the fact, it is a big challenge to set up a virtual cluster to meet the user-specific requirement such as the applications to be used. In this paper we design and implement Easy Deploy, a system that can set up virtual clusters with user-specifying applications in Cloud computing environment automatically. Easy Deploy realizes its own automatic application deployment method in virtual clusters without the help of external tools for traditional clusters. It decouples application packages away from virtual machine images to save storage space. To reduce application package transfer time, cache and prefetching mechanism is provided. The experimental results show that in our settings we can create an eighteen nodes virtual cluster with Hadoop environment in less than 50 seconds. The cache and prefetching mechanism we designed can do reduce the transfer time of application packages. When we use both of them to create a virtual cluster, the transfer time will reduce by three times than that in the case without any optimization strategy.","PeriodicalId":371382,"journal":{"name":"2012 Seventh ChinaGrid Annual Conference","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114797875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Load Balancing Routing for Wireless Sensor Network in 2D Mesh 二维网格无线传感器网络的负载均衡路由

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.16

Yipiao Chen, Yi Yang, Yubo Deng, Lian Li

引用次数: 0

Translating Chemical Scripting Languages to Unified Job-Description Language on Chemical-Grid 化学网格上化学脚本语言到统一工作描述语言的翻译

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.30

Min Zhang, Ruisheng Zhang, Jiajun Xie, Shuping Li, Rongjing Hu, Jingfei Hou, Shuyi Zhang

引用次数: 0

Ontology Based Data Conversion from Spreadsheet to OWL 基于本体的电子表格到OWL的数据转换

2012 Seventh ChinaGrid Annual Conference Pub Date : 2012-09-20 DOI: 10.1109/ChinaGrid.2012.17

Xiaohui Zhang, Ruihua Di, Xiaochen Feng

引用次数: 3