2017 IEEE 13th International Conference on e-Science (e-Science)最新文献

筛选
英文 中文
Massive OceanColor Data Processing and Analysis System: TuPiX-OC 海量OceanColor数据处理与分析系统:TuPiX-OC
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.66
Jung-Ho Um, Sunggeun Han, Hyunwoo Kim, Kyongseok Park
{"title":"Massive OceanColor Data Processing and Analysis System: TuPiX-OC","authors":"Jung-Ho Um, Sunggeun Han, Hyunwoo Kim, Kyongseok Park","doi":"10.1109/eScience.2017.66","DOIUrl":"https://doi.org/10.1109/eScience.2017.66","url":null,"abstract":"Satellite image data generated from remote sensors around the world have different resolutions and are processed at varying levels from Level 0 to Level 3, with each level containing vast amounts of information. Due to the problem of data size, many ocean science researchers use L3 images, which have a spatial resolution of 4 km or 9 km. However, in order to overcome problems such as red tides or to analyze the marine ecosystem based on ocean color satellite research, researchers must generate data by changing the parameters of images at various levels. There is also a need for immediate access to satellite image information using analytical and visualization tools. Considering those requirements, TuPiX-OC (Turning PiXels into knowledge and science-OceanColor) provides an environment to design and execute algorithms for data processing and analysis of satellite image data by data type. TuPix-OC, which has a distributed architecture, is an analytical platform that supports data import, level conversion, DB integration, analysis and processing, and visualization. TuPiX-OC stores satellite data in a massive storage device, and provides an online platform for satellite data conversion/analysis/ visualization. For satellite data processing, TuPiX-OC converts NASA-provided binary files into files that can be analyzed by users. Moreover, TuPiX-OC includes various algorithms for satellite data selection and utilization of satellite images. Preliminary Experiments of TuPiX-OC's satellite image data processing capability showed that it was able to process 35 times as many images as the open source software SeaDAS.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133150808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Review of Privacy and Consent Management in Healthcare: A Focus on Emerging Data Sources 医疗保健中的隐私和同意管理综述:关注新兴数据源
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.84
M. R. Asghar, T. Lee, M. Baig, E. Ullah, G. Russello, G. Dobbie
{"title":"A Review of Privacy and Consent Management in Healthcare: A Focus on Emerging Data Sources","authors":"M. R. Asghar, T. Lee, M. Baig, E. Ullah, G. Russello, G. Dobbie","doi":"10.1109/eScience.2017.84","DOIUrl":"https://doi.org/10.1109/eScience.2017.84","url":null,"abstract":"The emergence of New Data Sources (NDS) in healthcare is revolutionising traditional electronic health records in terms of data availability, storage, and access. Increasingly, clinicians are using NDS to build a virtual holistic image of a patients health condition. This research is focused on a review and analysis of the current legislation and privacy rules available for healthcare professionals. NDS in this project refers to and includes patient-generated health data, consumer device data, wearable health and fitness data, and data from social media. This project reviewed legal and regulatory requirements for New Zealand, Australia, the European Union, and the United States to establish the ground reality of existing mechanisms in place concerning the use of NDS. The outcome of our research is to recommend changes and enhancements required to better prepare for the ’tsunami’ of NDS and applications in the currently evolving data-driven healthcare area and precision or personalised health initiatives such as Precision Driven Health (PDH) in New Zealand.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"122 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122545792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
A Novel Data Mining Testbed for User Centred Modelling and Personalisation of Digital Library Services 一种以用户为中心的数字图书馆服务建模与个性化的新型数据挖掘试验台
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.58
M. Almaghrabi, G. Chetty
{"title":"A Novel Data Mining Testbed for User Centred Modelling and Personalisation of Digital Library Services","authors":"M. Almaghrabi, G. Chetty","doi":"10.1109/eScience.2017.58","DOIUrl":"https://doi.org/10.1109/eScience.2017.58","url":null,"abstract":"Digital libraries can provide information services for users with diverse needs. Due to a large amount of data that exists in digital library systems, including text and multimedia resources, with different cohorts of users, and the challenges with existing digital library systems in terms of maintaining privacy and confidentiality, it is very difficult to provide personalised library services and improved user experience. However, novel data mining algorithms based on automatic user segmentation and borrowing behaviour modelling can leverage the relationship between users and borrowing records, to improve the library services. In this paper, we present an automatic approach for personalising the resources by segmenting the users and their preferences, based on a data mining strategy, involving, the classification based on Naïve Bayes, J48 and K-Nearest Neighbours Classification (K-NN) and using open source technology tools for evaluating the personalisation and improved user experience with digital library services.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128669879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Online Decision-Making Using Edge Resources for Content-Driven Stream Processing 使用边缘资源进行内容驱动流处理的在线决策
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.52
E. G. Renart, Daniel Balouek-Thomert, Xuan Hu, J. Gong, M. Parashar
{"title":"Online Decision-Making Using Edge Resources for Content-Driven Stream Processing","authors":"E. G. Renart, Daniel Balouek-Thomert, Xuan Hu, J. Gong, M. Parashar","doi":"10.1109/eScience.2017.52","DOIUrl":"https://doi.org/10.1109/eScience.2017.52","url":null,"abstract":"The Internet of Things (IoT) describes the emerging paradigm that connects sensors, often located at the edge of the network, to stream processing engines located at the core of the network to enable online data-driven monitoring, management, and control. As IoT applications require increasing volumes of streaming data to be processed by complex workflows in a timely manner, it is becoming important to also leverage resources closer to the edge. Furthermore, the topology of these workflows and where theyare executed is determined not only by application objectives and available resources, but also by the content of the data streams, however, current stream processing engines do not provide this flexibility. In this paper, we present a programming framework that enables applications to specify data-driven, location- and resource-aware processing of data streams. Specifically, it provides abstractions for specifying where and how a data stream is processed based on its content, spatial and temporal characteristics. We also present an implementation of the framework using an event-driven runtime, where events are associatively described. Finally, we demonstrate the effectiveness of the solution by an evaluation of scalability and performance using a disaster response application usecase.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116385257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Adapting Enterprise Architecture for eScience 为eScience调整企业架构
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.55
Richard Palmer, Kheeran Dharmawardena, H. Holewa
{"title":"Adapting Enterprise Architecture for eScience","authors":"Richard Palmer, Kheeran Dharmawardena, H. Holewa","doi":"10.1109/eScience.2017.55","DOIUrl":"https://doi.org/10.1109/eScience.2017.55","url":null,"abstract":"Enterprise Architecture thinking and techniques, like disciplined project management, translate well into the eScience domain - both in decomposing the problem to be solved and maintaining an optimal solution path. This holds true across the many perspectives that constitute eScience: discipline leaders, governments, institutions, research groups and investigators. We apply this thinking to the emerging EcoCloud which is representative of the complex, national, multi-stakeholder and collaborative e-science infrastructure projects currently being implemented within Australia.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117095165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mass Digitization of Individual Pinned Insects Using Conveyor-Driven Imaging 利用传送带驱动成像技术对单个被钉昆虫进行大规模数字化
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.85
Riitta Tegelberg, J. Kahanpaa, Janne Karppinen, T. Mononen, Zhe Wu, H. Saarenmaa
{"title":"Mass Digitization of Individual Pinned Insects Using Conveyor-Driven Imaging","authors":"Riitta Tegelberg, J. Kahanpaa, Janne Karppinen, T. Mononen, Zhe Wu, H. Saarenmaa","doi":"10.1109/eScience.2017.85","DOIUrl":"https://doi.org/10.1109/eScience.2017.85","url":null,"abstract":"Natural history museums of the world house hundreds of millions of insect specimens. Digitization of individual, pinned specimens is a challenging task, especially if the pinned data labels are expected to be imaged as well. Two automatic digitization lines for imaging of small objects were designed and built by the Digitarium team. Lines were used to test digitization efficiency of several subgroups of pinned insects: butterflies, beetles, flies and bees. Results showed that speed of digitization varied between 41-125 specimens an hour, depending on the number of operators, pre-treatments needed, and the number of cameras used. The amount of transcription work during the process also affected the pace of work by the operators. After testing it was calculated that basic in-house costs of automatized digitization of a specimen were around 0.40 - 0.60 €, while commercial prices varied between 1.20 - 1.40 €.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"204 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126128171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Developing a Volunteer Computing Project to Evolve Convolutional Neural Networks and Their Hyperparameters 开发一个志愿者计算项目来进化卷积神经网络及其超参数
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.14
Travis Desell
{"title":"Developing a Volunteer Computing Project to Evolve Convolutional Neural Networks and Their Hyperparameters","authors":"Travis Desell","doi":"10.1109/eScience.2017.14","DOIUrl":"https://doi.org/10.1109/eScience.2017.14","url":null,"abstract":"This work presents improvements to a neuroevolution algorithm called Evolutionary eXploration of Augmenting Convolutional Topologies (EXACT), which is capable of evolving the structure of convolutional neural networks (CNNs). While EXACT has multithreaded and parallel implementations, it has also been implemented as part of a volunteer computing project at the Citizen Science Grid to provide truly large scale computing resources through over 5,500 volunteered computers. Improvements include the development of a new mutation operator, which increased the evolution rate by over an order of magnitude and was also shown to be significantly more reliable in generating new CNNs than the traditional method. Further, EXACT has been extended with a simplex hyperparameter optimization (SHO) method which allows for the co-evolution of hyperparameters, simplifying the task of their selection while generating smaller CNNs with similar predictive ability to those generated with fixed hyperparameters. Lastly, the backpropagation method has been updated with batch normalization and dropout. Compared to previous work, which only achieved prediction rates of 98.32% on the MNIST handwritten digits testing data after 60,000 evolved CNNs, these new advances allowed EXACT to achieve prediction rates of 99.43% within only 12,500 evolved CNNs - rates which are comparable to some of the best human designed CNNs.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121543769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Supporting Data-Driven Workflows Enabled by Large Scale Observatories 支持大规模天文台启用的数据驱动工作流
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.95
A. Zamani, Moustafa AbdelBaky, Daniel Balouek-Thomert, I. Rodero, M. Parashar
{"title":"Supporting Data-Driven Workflows Enabled by Large Scale Observatories","authors":"A. Zamani, Moustafa AbdelBaky, Daniel Balouek-Thomert, I. Rodero, M. Parashar","doi":"10.1109/eScience.2017.95","DOIUrl":"https://doi.org/10.1109/eScience.2017.95","url":null,"abstract":"Large scale observatories are shared-use resources that provide open access to data from geographically distributed sensors and instruments. This data has the potential to accelerate scientific discovery. However, seamlessly integrating the data into scientific workflows remains a challenge. In this paper, we summarize our ongoing work in supporting data-driven and data-intensive workflows and outline our vision for how these observatories can improve large-scale science. Specifically, we present programming abstractions and runtime management services to enable the automatic integration of data in scientific workflows. Further, we show how approximation techniques can be used to address network and processing variations by studying constraint limitations and their associated latencies. We use the Ocean Observatories Initiative (OOI) as a driving use case for this work.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131384230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
SELFIE: Self-Aware Information Extraction from Digitized Biocollections 自拍:从数字化生物馆藏中提取自我意识信息
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.19
I. Alzuru, Andréa M. Matsunaga, Maurício O. Tsugawa, J. Fortes
{"title":"SELFIE: Self-Aware Information Extraction from Digitized Biocollections","authors":"I. Alzuru, Andréa M. Matsunaga, Maurício O. Tsugawa, J. Fortes","doi":"10.1109/eScience.2017.19","DOIUrl":"https://doi.org/10.1109/eScience.2017.19","url":null,"abstract":"Biological collections store information with broad societal and environmental impact. In the last 15 years, after worldwide investments and crowdsourcing efforts, 25% of the collected specimens have been digitized; a process that includes the imaging of text attached to specimens and subsequent extraction of information from the resulting image. This information extraction (IE) process is complex, thus slow and typically involving human tasks. We propose a hybrid (Human-Machine) information extraction model that efficiently uses resources of different cost (machines, volunteers and/or experts) and speeds up the biocollections' digitization process, while striving to maintain the same quality as human-only IE processes. In the proposed model, called SELFIE, self-aware IE processes determine whether their output quality is satisfactory. If the quality is unsatisfactory, additional or alternative processes that yield higher quality output at higher cost are triggered. The effectiveness of this model is demonstrated by three SELFIE workflows for the extraction of Darwin-core terms from specimens' images. Compared to the traditional human-driven IE approach, SELFIE workflows showed, on average, a reduction of 27% in the information-capture time and a decrease of 32% in the required number of humans and their associated cost, while the quality of the results was negligibly reduced by 0.27%.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"423 2-3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131707838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Task-Based Budget Distribution Strategies for Scientific Workflows with Coarse-Grained Billing Periods in IaaS Clouds IaaS云中具有粗粒度计费周期的科学工作流的基于任务的预算分配策略
2017 IEEE 13th International Conference on e-Science (e-Science) Pub Date : 2017-10-01 DOI: 10.1109/eScience.2017.25
M. Hilman, M. A. Rodriguez, R. Buyya
{"title":"Task-Based Budget Distribution Strategies for Scientific Workflows with Coarse-Grained Billing Periods in IaaS Clouds","authors":"M. Hilman, M. A. Rodriguez, R. Buyya","doi":"10.1109/eScience.2017.25","DOIUrl":"https://doi.org/10.1109/eScience.2017.25","url":null,"abstract":"The use of cloud computing, particularly of Infrastructure as a Service clouds, for the execution of largescale scientific workflows has been a topic of interest in recent years. These environments offer on-demand access to all of the infrastructure required for the deployment of workflows, allowing users to pay only for what they use. This leads to schedulers having to find a trade-off between two conflicting quality of service requirements: time and cost. The majority of research in this area has focused on developing scheduling algorithms that have as objective minimizing the infrastructure cost while meeting a deadline constraint. Few algorithms, however, have addressed the problem of minimizing the execution time of the workflow while meeting a budget constraint. This paper focuses on the latter case. We propose a budget-distribution algorithm that assigns a portion of the overall workflow budget to the individual tasks. This task-level budget then guides the dynamic scheduling process and is continuously refined to reflect any unexpected costs. When compared to the state-of-the-art algorithm, the performance evaluation results demonstrate that in 88% of the cases, our proposal achieves equal or better performance in terms of meeting the budget constraint and achieves lower execution times in 84% of the cases.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134381446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信