{"title":"心电图实时数据分析体系结构","authors":"Nur Banu Oğur, C. Çeken","doi":"10.1109/UBMK.2018.8566300","DOIUrl":null,"url":null,"abstract":"The concept of big data emerging from the expansion of data volumes on the Internet has begun to talk about its name in medicine as well as in many fields of life. Big data analytics, which also require the use of machine learning methods, enable the use of decision-making processes by extracting useful information from large and complex data sets. Implementing machine learning strategies on data sets within big data is an expensive process because it requires extensive use of resources such as CPU and memory. For this reason, platforms specially developed for big data analytics are designed. One of these systems, Apache Spark, has built-in machine learning algorithms ranging from regression to classification and clustering, and is a very powerful engine for real time stream processing. In this study, the first results of a system that provides real-time disease diagnosis from ECG data using Logistic Regression are presented. The first findings obtained show that this architecture, built with Apache Kafka and Apache Spark, can be an important design option in real time processing of ECG data.","PeriodicalId":293249,"journal":{"name":"2018 3rd International Conference on Computer Science and Engineering (UBMK)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Real Time Data Analytics Architecture for ECG\",\"authors\":\"Nur Banu Oğur, C. Çeken\",\"doi\":\"10.1109/UBMK.2018.8566300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The concept of big data emerging from the expansion of data volumes on the Internet has begun to talk about its name in medicine as well as in many fields of life. Big data analytics, which also require the use of machine learning methods, enable the use of decision-making processes by extracting useful information from large and complex data sets. Implementing machine learning strategies on data sets within big data is an expensive process because it requires extensive use of resources such as CPU and memory. For this reason, platforms specially developed for big data analytics are designed. One of these systems, Apache Spark, has built-in machine learning algorithms ranging from regression to classification and clustering, and is a very powerful engine for real time stream processing. In this study, the first results of a system that provides real-time disease diagnosis from ECG data using Logistic Regression are presented. The first findings obtained show that this architecture, built with Apache Kafka and Apache Spark, can be an important design option in real time processing of ECG data.\",\"PeriodicalId\":293249,\"journal\":{\"name\":\"2018 3rd International Conference on Computer Science and Engineering (UBMK)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 3rd International Conference on Computer Science and Engineering (UBMK)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UBMK.2018.8566300\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Computer Science and Engineering (UBMK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UBMK.2018.8566300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The concept of big data emerging from the expansion of data volumes on the Internet has begun to talk about its name in medicine as well as in many fields of life. Big data analytics, which also require the use of machine learning methods, enable the use of decision-making processes by extracting useful information from large and complex data sets. Implementing machine learning strategies on data sets within big data is an expensive process because it requires extensive use of resources such as CPU and memory. For this reason, platforms specially developed for big data analytics are designed. One of these systems, Apache Spark, has built-in machine learning algorithms ranging from regression to classification and clustering, and is a very powerful engine for real time stream processing. In this study, the first results of a system that provides real-time disease diagnosis from ECG data using Logistic Regression are presented. The first findings obtained show that this architecture, built with Apache Kafka and Apache Spark, can be an important design option in real time processing of ECG data.