An IoT-Cloud Based Solution for Real-Time and Batch Processing of Big Data: Application in Healthcare

N. Taher, Imane Mallat, N. Agoulmine, Nour El-Mawass
{"title":"An IoT-Cloud Based Solution for Real-Time and Batch Processing of Big Data: Application in Healthcare","authors":"N. Taher, Imane Mallat, N. Agoulmine, Nour El-Mawass","doi":"10.1109/BIOSMART.2019.8734185","DOIUrl":null,"url":null,"abstract":"With the large use of Internet of Things (IoT) today, everything around us seems to generate data. The ever increasing number of connected things or objects (IoT) is coupled with a growing volume of data generated at a continually increasing rate. Especially where data is big or there is a need to process it, cloud infrastructures, with their scalability and easy access, are becoming the solution of choice for storage and processing. In the context of healthcare applications, where medical sensors collect health data from patients and send it to the cloud, two issues frequently appear in relation to “Big Data”. The first issue is related to real-time analysis introduced by the increasing velocity at which data is generated especially from connected devices (IoT). This data should be analyzed continuously in real-time in order to take appropriate actions regarding the patient’s care plan. Moreover, medical data accumulated from different patients over time constitutes an important training dataset that can be used to train machine learning models in order to perform smarter disease prediction and treatment. This gives rise to another issue regarding long-term batch processing of often huge volumes of stored data. To deal with these issues, we propose an IoT-Cloud based framework for real-time and batch processing of Big Data in the healthcare domain. We implement the proposed solution on Amazon Cloud operator known as Amazon Web Services (AWS) and use a Raspberry pi as an IoT device to generate data in real time. We test the solution with the specific application of ECG monitoring and abnormality reporting. We analyze the performance of the implemented system in terms of response time by varying the velocity and volume of the analyzed data. We also discuss how the cloud resources should be provisioned in order to guarantee processing performance for both long-term and real-time scenarios. To ensure a good tradeoff between cost and processing performance, resources provision should be adapted to the exact needs and characteristics of the considered application.","PeriodicalId":113918,"journal":{"name":"2019 3rd International Conference on Bio-engineering for Smart Technologies (BioSMART)","volume":"448 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 3rd International Conference on Bio-engineering for Smart Technologies (BioSMART)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIOSMART.2019.8734185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

With the large use of Internet of Things (IoT) today, everything around us seems to generate data. The ever increasing number of connected things or objects (IoT) is coupled with a growing volume of data generated at a continually increasing rate. Especially where data is big or there is a need to process it, cloud infrastructures, with their scalability and easy access, are becoming the solution of choice for storage and processing. In the context of healthcare applications, where medical sensors collect health data from patients and send it to the cloud, two issues frequently appear in relation to “Big Data”. The first issue is related to real-time analysis introduced by the increasing velocity at which data is generated especially from connected devices (IoT). This data should be analyzed continuously in real-time in order to take appropriate actions regarding the patient’s care plan. Moreover, medical data accumulated from different patients over time constitutes an important training dataset that can be used to train machine learning models in order to perform smarter disease prediction and treatment. This gives rise to another issue regarding long-term batch processing of often huge volumes of stored data. To deal with these issues, we propose an IoT-Cloud based framework for real-time and batch processing of Big Data in the healthcare domain. We implement the proposed solution on Amazon Cloud operator known as Amazon Web Services (AWS) and use a Raspberry pi as an IoT device to generate data in real time. We test the solution with the specific application of ECG monitoring and abnormality reporting. We analyze the performance of the implemented system in terms of response time by varying the velocity and volume of the analyzed data. We also discuss how the cloud resources should be provisioned in order to guarantee processing performance for both long-term and real-time scenarios. To ensure a good tradeoff between cost and processing performance, resources provision should be adapted to the exact needs and characteristics of the considered application.
基于物联网云的大数据实时批量处理解决方案:在医疗保健领域的应用
随着物联网(IoT)的广泛使用,我们周围的一切似乎都在产生数据。连接的事物或对象(IoT)数量的不断增加与以不断增长的速度生成的数据量相结合。特别是在数据量大或需要处理数据的情况下,云基础设施具有可扩展性和易于访问性,正在成为存储和处理的首选解决方案。在医疗保健应用的背景下,医疗传感器从患者那里收集健康数据并将其发送到云端,与“大数据”相关的两个问题经常出现。第一个问题与实时分析有关,这是由于数据生成的速度越来越快,特别是从连接设备(IoT)生成的数据。这些数据应该持续实时分析,以便针对患者的护理计划采取适当的行动。此外,随着时间的推移,从不同患者身上积累的医疗数据构成了一个重要的训练数据集,可以用来训练机器学习模型,以便进行更智能的疾病预测和治疗。这就产生了另一个关于长期批量处理通常是海量存储数据的问题。为了解决这些问题,我们提出了一个基于物联网云的框架,用于实时和批量处理医疗保健领域的大数据。我们在亚马逊云运营商亚马逊网络服务(AWS)上实现了提出的解决方案,并使用树莓派作为物联网设备实时生成数据。我们以心电监测和异常报告的具体应用为例对该方案进行了测试。我们通过改变所分析数据的速度和容量,从响应时间的角度来分析所实现系统的性能。我们还讨论了应该如何配置云资源,以保证长期和实时场景的处理性能。为了确保在成本和处理性能之间取得良好的平衡,应根据所考虑的应用程序的确切需求和特征调整资源供应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信