生物医学瞬态信号的变量聚类方法

Int. J. Comput. Model. Algorithms Medicine Pub Date : 1900-01-01 DOI:10.4018/jcmam.2012010103

Pimwadee Chaovalit

{"title":"生物医学瞬态信号的变量聚类方法","authors":"Pimwadee Chaovalit","doi":"10.4018/jcmam.2012010103","DOIUrl":null,"url":null,"abstract":"Biomedical signals which help monitor patients’ physical conditions are a crucial part of the healthcare industry. The healthcare professionals’ ability to monitor patients and detect early signs of conditions such as blocked arteries and abnormal heart rhythms can be accomplished by performing data clustering on biomedical signals. More importantly, clustering on streams of biomedical signals make it possible to look for patterns that may indicate developing conditions. While there are a number of clustering algorithms that perform data streams clustering by example, few algorithms exist that perform clustering by variable. This paper presents POD-Clus, a clustering method which uses a model-based clustering principle and, in addition to clustering by example, also cluster data streams by variable. The clustering result from POD-Clus was superior to the result from ODAC, a baseline algorithm, for both with and without cluster evolutions. DOI: 10.4018/jcmam.2012010103 International Journal of Computational Models and Algorithms in Medicine, 3(1), 32-71, January-March 2012 33 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. leading to a total of 1.2 million data points. This is an enormous amount of data collected within a short period of time. As various heart conditions can be revealed from the analysis of heartbeats. For example, a heart-related condition such as blocked blood supply will generate tissue death and reflect in the abnormal height of heartbeat waves. Therefore, analyzing these large amounts of data in a timely manner for a quick diagnosis is challenging, as the data may become too large to either deliver over a network (Sun & Sclabassi, 1999) or store in the main memory of the device. For this reason, data streams processing needs to happen real-time while data streams arrive. As various types of biomedical signals can be considered data streams, there exists a need for effective data streams mining techniques that can handle such data streams efficiently. Data streams’ characteristics (Domingos & Hulten, 2000; Gama, Rodrigues, & Aguilar-Ruiz, 2007) can be described as follows: • Data from the streams usually come in at a detailed level, e.g., 1000 Hz. • Streaming data arrives at a fast pace, therefore agile data management and utilization is key. • Observations of data are potentially unbounded. • Storage and memory resources for processing data streams are possibly limited. Data streams clustering can be incorporated into a computer-aided analysis used by physicians to cluster biomedical signals for diagnosis on the patients. By grouping biomedical signals into homogeneous clusters, we learn about data characteristics which may indicate developing conditions. Results from clustering can then be developed into classification models or predictive models useful in healthcare diagnoses. As Chaovalit (2010) has proposed the POD-Clus algorithm (Probability and Distribution-based Clustering) for data streams clustering, the algorithm was for only its ability to cluster data streams by example. This paper focuses on the ability of POD-Clus to cluster by variable. Clustering by variable is another clustering perspective for data streams. POD-Cus shows a significant improvement on clustering results compared to the baseline competing algorithm from the same category.","PeriodicalId":162417,"journal":{"name":"Int. J. Comput. Model. Algorithms Medicine","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Methodology for Clustering Transient Biomedical Signals by Variable\",\"authors\":\"Pimwadee Chaovalit\",\"doi\":\"10.4018/jcmam.2012010103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Biomedical signals which help monitor patients’ physical conditions are a crucial part of the healthcare industry. The healthcare professionals’ ability to monitor patients and detect early signs of conditions such as blocked arteries and abnormal heart rhythms can be accomplished by performing data clustering on biomedical signals. More importantly, clustering on streams of biomedical signals make it possible to look for patterns that may indicate developing conditions. While there are a number of clustering algorithms that perform data streams clustering by example, few algorithms exist that perform clustering by variable. This paper presents POD-Clus, a clustering method which uses a model-based clustering principle and, in addition to clustering by example, also cluster data streams by variable. The clustering result from POD-Clus was superior to the result from ODAC, a baseline algorithm, for both with and without cluster evolutions. DOI: 10.4018/jcmam.2012010103 International Journal of Computational Models and Algorithms in Medicine, 3(1), 32-71, January-March 2012 33 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. leading to a total of 1.2 million data points. This is an enormous amount of data collected within a short period of time. As various heart conditions can be revealed from the analysis of heartbeats. For example, a heart-related condition such as blocked blood supply will generate tissue death and reflect in the abnormal height of heartbeat waves. Therefore, analyzing these large amounts of data in a timely manner for a quick diagnosis is challenging, as the data may become too large to either deliver over a network (Sun & Sclabassi, 1999) or store in the main memory of the device. For this reason, data streams processing needs to happen real-time while data streams arrive. As various types of biomedical signals can be considered data streams, there exists a need for effective data streams mining techniques that can handle such data streams efficiently. Data streams’ characteristics (Domingos & Hulten, 2000; Gama, Rodrigues, & Aguilar-Ruiz, 2007) can be described as follows: • Data from the streams usually come in at a detailed level, e.g., 1000 Hz. • Streaming data arrives at a fast pace, therefore agile data management and utilization is key. • Observations of data are potentially unbounded. • Storage and memory resources for processing data streams are possibly limited. Data streams clustering can be incorporated into a computer-aided analysis used by physicians to cluster biomedical signals for diagnosis on the patients. By grouping biomedical signals into homogeneous clusters, we learn about data characteristics which may indicate developing conditions. Results from clustering can then be developed into classification models or predictive models useful in healthcare diagnoses. As Chaovalit (2010) has proposed the POD-Clus algorithm (Probability and Distribution-based Clustering) for data streams clustering, the algorithm was for only its ability to cluster data streams by example. This paper focuses on the ability of POD-Clus to cluster by variable. Clustering by variable is another clustering perspective for data streams. POD-Cus shows a significant improvement on clustering results compared to the baseline competing algorithm from the same category.\",\"PeriodicalId\":162417,\"journal\":{\"name\":\"Int. J. Comput. Model. Algorithms Medicine\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Comput. Model. Algorithms Medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/jcmam.2012010103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Comput. Model. Algorithms Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/jcmam.2012010103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

帮助监测患者身体状况的生物医学信号是医疗保健行业的重要组成部分。医疗保健专业人员可以通过对生物医学信号进行数据聚类来监测患者并检测动脉阻塞和心律异常等疾病的早期迹象。更重要的是，在生物医学信号流上的聚类使得寻找可能指示发育状况的模式成为可能。虽然有许多聚类算法可以通过示例执行数据流聚类，但很少有算法可以执行变量聚类。本文提出了一种基于模型聚类原理的聚类方法POD-Clus，该方法除按实例聚类外，还按变量对数据流进行聚类。无论是否进行聚类进化，POD-Clus的聚类结果都优于基线算法ODAC。DOI: 10.4018 / jcmam.2012010103国际医学计算模型与算法杂志，3(1)，32-71,jan - march 2012 33版权所有©2012,IGI Global。未经IGI Global书面许可，禁止以印刷或电子形式复制或分发。总共有120万个数据点。这是在短时间内收集到的大量数据。因为通过对心跳的分析可以揭示各种心脏状况。例如，与心脏有关的疾病，如血液供应受阻，会导致组织死亡，并反映在心跳波的异常高度上。因此，及时分析这些大量数据以进行快速诊断是具有挑战性的，因为数据可能变得太大，无法通过网络传输(Sun & Sclabassi, 1999)或存储在设备的主存储器中。由于这个原因，数据流处理需要在数据流到达时实时进行。由于各种类型的生物医学信号可以被视为数据流，因此需要有效的数据流挖掘技术来有效地处理这些数据流。数据流的特征(Domingos & Hulten, 2000;Gama, Rodrigues， & Aguilar-Ruiz, 2007)可以描述如下:•来自流的数据通常以详细级别进入，例如1000 Hz。•流数据到达的速度很快，因此灵活的数据管理和利用是关键。•对数据的观察可能是无界的。•用于处理数据流的存储和内存资源可能有限。数据流聚类可以整合到计算机辅助分析中，用于医生对患者进行诊断的生物医学信号聚类。通过将生物医学信号分组为同质簇，我们可以了解可能指示发展条件的数据特征。然后，聚类的结果可以开发成分类模型或预测模型，用于医疗保健诊断。Chaovalit(2010)提出了用于数据流聚类的POD-Clus算法(Probability and Distribution-based Clustering)，该算法仅针对其通过实例对数据流进行聚类的能力。本文主要研究POD-Clus按变量聚类的能力。按变量聚类是数据流的另一种聚类视角。与来自同一类别的基线竞争算法相比，POD-Cus在聚类结果上有显着改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Methodology for Clustering Transient Biomedical Signals by Variable

Biomedical signals which help monitor patients’ physical conditions are a crucial part of the healthcare industry. The healthcare professionals’ ability to monitor patients and detect early signs of conditions such as blocked arteries and abnormal heart rhythms can be accomplished by performing data clustering on biomedical signals. More importantly, clustering on streams of biomedical signals make it possible to look for patterns that may indicate developing conditions. While there are a number of clustering algorithms that perform data streams clustering by example, few algorithms exist that perform clustering by variable. This paper presents POD-Clus, a clustering method which uses a model-based clustering principle and, in addition to clustering by example, also cluster data streams by variable. The clustering result from POD-Clus was superior to the result from ODAC, a baseline algorithm, for both with and without cluster evolutions. DOI: 10.4018/jcmam.2012010103 International Journal of Computational Models and Algorithms in Medicine, 3(1), 32-71, January-March 2012 33 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. leading to a total of 1.2 million data points. This is an enormous amount of data collected within a short period of time. As various heart conditions can be revealed from the analysis of heartbeats. For example, a heart-related condition such as blocked blood supply will generate tissue death and reflect in the abnormal height of heartbeat waves. Therefore, analyzing these large amounts of data in a timely manner for a quick diagnosis is challenging, as the data may become too large to either deliver over a network (Sun & Sclabassi, 1999) or store in the main memory of the device. For this reason, data streams processing needs to happen real-time while data streams arrive. As various types of biomedical signals can be considered data streams, there exists a need for effective data streams mining techniques that can handle such data streams efficiently. Data streams’ characteristics (Domingos & Hulten, 2000; Gama, Rodrigues, & Aguilar-Ruiz, 2007) can be described as follows: • Data from the streams usually come in at a detailed level, e.g., 1000 Hz. • Streaming data arrives at a fast pace, therefore agile data management and utilization is key. • Observations of data are potentially unbounded. • Storage and memory resources for processing data streams are possibly limited. Data streams clustering can be incorporated into a computer-aided analysis used by physicians to cluster biomedical signals for diagnosis on the patients. By grouping biomedical signals into homogeneous clusters, we learn about data characteristics which may indicate developing conditions. Results from clustering can then be developed into classification models or predictive models useful in healthcare diagnoses. As Chaovalit (2010) has proposed the POD-Clus algorithm (Probability and Distribution-based Clustering) for data streams clustering, the algorithm was for only its ability to cluster data streams by example. This paper focuses on the ability of POD-Clus to cluster by variable. Clustering by variable is another clustering perspective for data streams. POD-Cus shows a significant improvement on clustering results compared to the baseline competing algorithm from the same category.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Int. J. Comput. Model. Algorithms Medicine

自引率

0.00%

发文量