Samuel Ruipérez-Campillo , Michael Reiss , Elisa Ramírez , Antonio Cebrián , José Millet , Francisco Castells
{"title":"用于医学时间序列分类的聚类和机器学习框架","authors":"Samuel Ruipérez-Campillo , Michael Reiss , Elisa Ramírez , Antonio Cebrián , José Millet , Francisco Castells","doi":"10.1016/j.bbe.2024.07.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and motivation:</h3><p>The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals.</p></div><div><h3>Methods:</h3><p>To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes.</p></div><div><h3>Results:</h3><p>The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations.</p></div><div><h3>Conclusion:</h3><p>Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.</p></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"44 3","pages":"Pages 521-533"},"PeriodicalIF":5.3000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clustering and machine learning framework for medical time series classification\",\"authors\":\"Samuel Ruipérez-Campillo , Michael Reiss , Elisa Ramírez , Antonio Cebrián , José Millet , Francisco Castells\",\"doi\":\"10.1016/j.bbe.2024.07.005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and motivation:</h3><p>The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals.</p></div><div><h3>Methods:</h3><p>To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes.</p></div><div><h3>Results:</h3><p>The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations.</p></div><div><h3>Conclusion:</h3><p>Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.</p></div>\",\"PeriodicalId\":55381,\"journal\":{\"name\":\"Biocybernetics and Biomedical Engineering\",\"volume\":\"44 3\",\"pages\":\"Pages 521-533\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biocybernetics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0208521624000524\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521624000524","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
Clustering and machine learning framework for medical time series classification
Background and motivation:
The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals.
Methods:
To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes.
Results:
The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations.
Conclusion:
Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.
期刊介绍:
Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.