2013 IEEE 9th International Conference on e-Science最新文献_第2页

Mining Common Spatial-Temporal Periodic Patterns of Animal Movement 挖掘动物运动的共同时空周期模式

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.11

Yuwei Wang, Ze Luo, Gang Qin, Yuanchun Zhou, Danhuai Guo, Baoping Yan

{"title":"Mining Common Spatial-Temporal Periodic Patterns of Animal Movement","authors":"Yuwei Wang, Ze Luo, Gang Qin, Yuanchun Zhou, Danhuai Guo, Baoping Yan","doi":"10.1109/eScience.2013.11","DOIUrl":"https://doi.org/10.1109/eScience.2013.11","url":null,"abstract":"Advanced satellite tracking technologies enable biologists to track animal movements at finer spatial and temporal scales. The resulting long-term movement data is very meaningful for understanding animal activities. Periodic pattern analysis can provide insightful approach to reveal animal activity patterns. However, individual GPS data is usually incomplete and in limited lifespan. In addition, individual periodic behaviors are inherently complicated with many uncertainties. In this paper, we address the problem of mining periodic patterns of animal movements by combining multiple individuals with similar periodicities. We formally define the problem of mining common periodicity and propose a novel periodicity measure. We introduce the information entropy in the proposed measure to detect common period. Data incompleteness, noises, and ambiguity of individual periodicity are considered in our method. Furthermore, we mine multiple common periodic patterns by grouping periodic segments w.r.t. the detected period, and provide a visualization method of common periodic patterns by designing a cyclical filled line chart. To assess effectiveness of our proposed method, we provide an experimental study using a real GPS dataset collected on 29 birds in Qinghai Lake, China.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126564506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Sharing Australia's Nationally Significant Terrestrial Ecosystem Data: A Collaboration between TERN and ANDS 共享澳大利亚国家级重要陆地生态系统数据:TERN和ANDS之间的合作

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/ESCIENCE.2013.28

S. Guru, Xiaobin Shen, C. Love, A. Treloar, S. Phinn, Ross Wilkinson, Cathrine Brady, P. Isaac, T. Clancy

引用次数: 7

An Algorithm for Cost-Effectively Storing Scientific Datasets with Multiple Service Providers in the Cloud 在云中与多个服务提供商经济有效地存储科学数据集的算法

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.34

Dong Yuan, X. Liu, Li-zhen Cui, Tiantian Zhang, Wenhao Li, Dahai Cao, Yun Yang

{"title":"An Algorithm for Cost-Effectively Storing Scientific Datasets with Multiple Service Providers in the Cloud","authors":"Dong Yuan, X. Liu, Li-zhen Cui, Tiantian Zhang, Wenhao Li, Dahai Cao, Yun Yang","doi":"10.1109/eScience.2013.34","DOIUrl":"https://doi.org/10.1109/eScience.2013.34","url":null,"abstract":"The proliferation of cloud computing allows scientists to deploy computation and data intensive applications without infrastructure investment, where large generated datasets can be flexibly stored with multiple cloud service providers. Due to the pay-as-you-go model, the total application cost largely depends on the usage of computation, storage and bandwidth resources, and cutting the cost of cloud-based data storage becomes a big concern for deploying scientific applications in the cloud. In this paper, we propose a novel algorithm that can automatically decide whether a generated dataset should be 1) stored in the current cloud, 2) deleted and re-generated whenever reused or 3) transferred to cheaper cloud service for storage. The algorithm finds the trade-off among computation, storage and bandwidth costs in the cloud, which are three key factors for the cost of storing generated application datasets with multiple cloud service providers. Simulations conducted with popular cloud service providers' pricing models show that the proposed algorithm is highly cost-effective to be utilised in the cloud.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127797984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Decentralized Prioritization-Based Management Systems for Distributed Computing 分布式计算的分散优先级管理系统

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.44

Per-Olov Östberg, E. Elmroth

引用次数: 4

A Geographical Approach for Metadata Quality Improvement in Biological Observation Databases 提高生物观测数据库元数据质量的地理方法

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.14

D. C. Cugler, C. B. Medeiros, S. Shekhar, L. F. Toledo

{"title":"A Geographical Approach for Metadata Quality Improvement in Biological Observation Databases","authors":"D. C. Cugler, C. B. Medeiros, S. Shekhar, L. F. Toledo","doi":"10.1109/eScience.2013.14","DOIUrl":"https://doi.org/10.1109/eScience.2013.14","url":null,"abstract":"This paper addresses the problem of improving the quality of metadata in biological observation databases, in particular those associated with observations of living beings, and which are often used as a starting point for biodiversity analyses. Poor quality metadata lead to incorrect scientific conclusions, and can mislead experts. Thus, it is important to design and develop methods to detect and correct metadata quality problems. This is a challenging problem because of the variety of issues concerning such metadata, e.g., misnaming of species, location uncertainty and imprecision concerning where observations were recorded. Related work is limited because it does not adequately model such issues. We propose a geographic approach based on expert-led classification of place and/or range mismatch anomalies detected by our algorithms. Our approach enables detection of anomalies in both species' reported geographic distributions and in species' identification. Our main contribution is our geographic algorithm that deals with uncertain/imprecise locations. Our work is tested using a case study with the Fonoteca Neotropical Jacques Vielliard, one of the 10 largest animal sound collections in the world.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128218935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Biocharts: Unifying Biological Hypotheses with Models and Experiments 生物图:用模型和实验统一生物学假说

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.41

H. Kugler

引用次数: 2

Protein Structure Modeling in a Grid Computing Environment 网格计算环境下的蛋白质结构建模

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.15

Daniel Li, B. Tsui, Charles Xue, J. Haga, Koheix Ichikawa, S. Date

{"title":"Protein Structure Modeling in a Grid Computing Environment","authors":"Daniel Li, B. Tsui, Charles Xue, J. Haga, Koheix Ichikawa, S. Date","doi":"10.1109/eScience.2013.15","DOIUrl":"https://doi.org/10.1109/eScience.2013.15","url":null,"abstract":"Advances in sequencing technology have resulted in an exponential increase in the availability of protein sequence information. In order to fully utilize information, it is important to translate the primary sequences into high-resolution tertiary protein structures. MODELLER is a leading homology modeling method that produces high quality protein structures. In this study, the function of MODELLER was expanded by configuring and deploying it on a parallel grid computing platform using a custom four-step workflow. The workflow consisted of template selection through a protein BLAST algorithm, target-template protein sequence alignment, distribution of model generation jobs among the compute clusters, and final protein model optimization. To test the validity of this workflow, we used the Dual Specificity Phosphatase (DSP) protein family, which shares high homology among each other. Comparison of the DSP member SSH-2 with its model counterpart revealed a minimal 1.3% difference in output energy scores. Furthermore, the Dali Pair wise Comparison Program demonstrated a 98% match among amino acid features and a Z-score of 26.6 indicating very significant similarities between the model and actual protein structure. After confirming the accuracy of our workflow, we generated 23 previously unknown DSP family protein structure models. Over 40,000 models were generated 30 times faster than conventional computing. Virtual receptor-ligand screening results of modeled protein DSP21 were compared with two known structures that had either higher or lower structural homology to DSP21. There was a significant difference (p!0.001) between the average ligand ranking discrepancy of a more homologous protein pair and a less homologous protein pair, suggesting that the protein models generated were sufficiently accurate for virtual screening. These results demonstrate the accuracy and usability of a grid-enabled MODELLER program and the increased efficiency of processing protein structure models. This workflow will help increase the speed of future drug development pipelines.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121220086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parallelizing Astronomical Source Extraction on the GPU GPU上的并行天文源提取

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.10

B. Zhao, Qiong Luo, Chao Wu

{"title":"Parallelizing Astronomical Source Extraction on the GPU","authors":"B. Zhao, Qiong Luo, Chao Wu","doi":"10.1109/eScience.2013.10","DOIUrl":"https://doi.org/10.1109/eScience.2013.10","url":null,"abstract":"In astronomical observatory projects, raw images are processed so that information about the celestial objects in the images is extracted into catalogs. As such, this source extraction is the basis for the various analysis tasks that are subsequently performed on the catalog products. With the rapid progress of new, large astronomical projects, observational images will be produced every few seconds. This high speed of image production requires fast source extraction. Unfortunately, current source extraction tools cannot meet the speed requirement. To address this problem, we propose to use the GPU (Graphics Processing Unit) to accelerate source extraction. Specifically, we start from SExtractor, an astronomical source extraction tool widely used in astronomy projects, and study its parallelization on the GPU. We identify the object detection and deblending components as the most complex and time-consuming, and design a parallel connected component labelling algorithm for detection and a parallel object tree pruning method for deblending respectively on the GPU. We further parallelize other components, including cleaning, background subtraction, and measurement, effectively on the GPU, such that the entire source extraction is done on the GPU. We have evaluated our GPU-SExtractor in comparison with the original SExtractor on a desktop with an Intel i7 CPU and an NVIDIA GTX670 GPU on a set of real-world and synthetic astronomical images of different sizes. Our results show that the GPU-SExtractor outperforms the original SExtractor by a factor of 6, taking a merely 1.9 second to process a typical 4KX4K image containing 167 thousands objects.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117062419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Dependency Provenance in Agent Based Modeling 基于Agent的建模中的依赖关系来源

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.39

Peng Chen, Beth Plale, Tom Evans

引用次数: 11

Balanced Task Clustering in Scientific Workflows 科学工作流中的均衡任务聚类

2013 IEEE 9th International Conference on e-Science Pub Date : 2013-10-22 DOI: 10.1109/eScience.2013.40

Weiwei Chen, Rafael Ferreira da Silva, E. Deelman, R. Sakellariou

{"title":"Balanced Task Clustering in Scientific Workflows","authors":"Weiwei Chen, Rafael Ferreira da Silva, E. Deelman, R. Sakellariou","doi":"10.1109/eScience.2013.40","DOIUrl":"https://doi.org/10.1109/eScience.2013.40","url":null,"abstract":"Scientific workflows can be composed of many fine computational granularity tasks. The runtime of these tasks may be shorter than the duration of system overheads, for example, when using multiple resources of a cloud infrastructure. Task clustering is a runtime optimization technique that merges multiple short tasks into a single job such that the scheduling overhead is reduced and the overall runtime performance is improved. However, existing task clustering strategies only provide a coarse-grained approach that relies on an over-simplified workflow model. In our work, we examine the reasons that cause Runtime Imbalance and Dependency Imbalance in task clustering. Next, we propose quantitative metrics to evaluate the severity of the two imbalance problems respectively. Furthermore, we propose a series of task balancing methods to address these imbalance problems. Finally, we analyze their relationship with the performance of these task balancing methods. A trace-based simulation shows our methods can significantly improve the runtime performance of two widely used workflows compared to the actual implementation of task clustering.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123301916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 53