阿尔茨海默病分类中缺失异构数据的自动编码器输入。

IF 3.3 Q3 ENGINEERING, BIOMEDICAL

Healthcare Technology Letters Pub Date : 2024-09-15 DOI:10.1049/htl2.12091

Namitha Thalekkara Haridas, Jose M. Sanchez-Bornot, Paula L. McClean, KongFatt Wong-Lin, Alzheimer's Disease Neuroimaging Initiative (ADNI)

{"title":"阿尔茨海默病分类中缺失异构数据的自动编码器输入。","authors":"Namitha Thalekkara Haridas, Jose M. Sanchez-Bornot, Paula L. McClean, KongFatt Wong-Lin, Alzheimer's Disease Neuroimaging Initiative (ADNI)","doi":"10.1049/htl2.12091","DOIUrl":null,"url":null,"abstract":"<p>Missing Alzheimer's disease (AD) data is prevalent and poses significant challenges for AD diagnosis. Previous studies have explored various data imputation approaches on AD data, but the systematic evaluation of deep learning algorithms for imputing heterogeneous and comprehensive AD data is limited. This study investigates the efficacy of denoising autoencoder-based imputation of missing key features of heterogeneous data that comprised tau-PET, MRI, cognitive and functional assessments, genotype, sociodemographic, and medical history. The authors focused on extreme (≥40%) missing at random of key features which depend on AD progression; identified as the history of a mother having AD, APoE ε4 alleles, and clinical dementia rating. Along with features selected using traditional feature selection methods, latent features extracted from the denoising autoencoder are incorporated for subsequent classification. Using random forest classification with 10-fold cross-validation, robust AD predictive performance of imputed datasets (accuracy: 79%–85%; precision: 71%–85%) across missingness levels, and high recall values with 40% missingness are found. Further, the feature-selected dataset using feature selection methods, including autoencoder, demonstrated higher classification score than that of the original complete dataset. These results highlight the effectiveness and robustness of autoencoder in imputing crucial information for reliable AD prediction in AI-based clinical decision support systems.</p>","PeriodicalId":37474,"journal":{"name":"Healthcare Technology Letters","volume":"11 6","pages":"452-460"},"PeriodicalIF":3.3000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665783/pdf/","citationCount":"0","resultStr":"{\"title\":\"Autoencoder imputation of missing heterogeneous data for Alzheimer's disease classification\",\"authors\":\"Namitha Thalekkara Haridas, Jose M. Sanchez-Bornot, Paula L. McClean, KongFatt Wong-Lin, Alzheimer's Disease Neuroimaging Initiative (ADNI)\",\"doi\":\"10.1049/htl2.12091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Missing Alzheimer's disease (AD) data is prevalent and poses significant challenges for AD diagnosis. Previous studies have explored various data imputation approaches on AD data, but the systematic evaluation of deep learning algorithms for imputing heterogeneous and comprehensive AD data is limited. This study investigates the efficacy of denoising autoencoder-based imputation of missing key features of heterogeneous data that comprised tau-PET, MRI, cognitive and functional assessments, genotype, sociodemographic, and medical history. The authors focused on extreme (≥40%) missing at random of key features which depend on AD progression; identified as the history of a mother having AD, APoE ε4 alleles, and clinical dementia rating. Along with features selected using traditional feature selection methods, latent features extracted from the denoising autoencoder are incorporated for subsequent classification. Using random forest classification with 10-fold cross-validation, robust AD predictive performance of imputed datasets (accuracy: 79%–85%; precision: 71%–85%) across missingness levels, and high recall values with 40% missingness are found. Further, the feature-selected dataset using feature selection methods, including autoencoder, demonstrated higher classification score than that of the original complete dataset. These results highlight the effectiveness and robustness of autoencoder in imputing crucial information for reliable AD prediction in AI-based clinical decision support systems.</p>\",\"PeriodicalId\":37474,\"journal\":{\"name\":\"Healthcare Technology Letters\",\"volume\":\"11 6\",\"pages\":\"452-460\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11665783/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Healthcare Technology Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/htl2.12091\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/htl2.12091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

阿尔茨海默病（AD）数据的缺失是普遍存在的，这对阿尔茨海默病的诊断提出了重大挑战。以往的研究已经探索了各种AD数据的数据输入方法，但对深度学习算法对异构和综合AD数据的输入的系统评估有限。本研究探讨了基于自编码器的去噪方法对包括tau-PET、MRI、认知和功能评估、基因型、社会人口统计学和病史在内的异构数据缺失关键特征的归算效果。作者关注的是与AD进展相关的关键特征随机极度缺失（≥40%）；母亲有阿尔茨海默病病史、APoE ε4等位基因和临床痴呆评分。在传统特征选择方法选取特征的基础上，结合去噪自编码器提取的潜在特征进行后续分类。使用具有10倍交叉验证的随机森林分类，输入数据集具有鲁棒的AD预测性能(准确率：79%-85%；在缺失水平上，准确率为71%-85%)，并且在缺失率为40%时发现了高召回值。此外，使用特征选择方法（包括自动编码器）的特征选择数据集比原始完整数据集表现出更高的分类分数。这些结果突出了自编码器在输入关键信息方面的有效性和鲁棒性，从而为基于人工智能的临床决策支持系统提供可靠的AD预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Autoencoder imputation of missing heterogeneous data for Alzheimer's disease classification

查看原文本刊更多论文

Autoencoder imputation of missing heterogeneous data for Alzheimer's disease classification

Missing Alzheimer's disease (AD) data is prevalent and poses significant challenges for AD diagnosis. Previous studies have explored various data imputation approaches on AD data, but the systematic evaluation of deep learning algorithms for imputing heterogeneous and comprehensive AD data is limited. This study investigates the efficacy of denoising autoencoder-based imputation of missing key features of heterogeneous data that comprised tau-PET, MRI, cognitive and functional assessments, genotype, sociodemographic, and medical history. The authors focused on extreme (≥40%) missing at random of key features which depend on AD progression; identified as the history of a mother having AD, APoE ε4 alleles, and clinical dementia rating. Along with features selected using traditional feature selection methods, latent features extracted from the denoising autoencoder are incorporated for subsequent classification. Using random forest classification with 10-fold cross-validation, robust AD predictive performance of imputed datasets (accuracy: 79%–85%; precision: 71%–85%) across missingness levels, and high recall values with 40% missingness are found. Further, the feature-selected dataset using feature selection methods, including autoencoder, demonstrated higher classification score than that of the original complete dataset. These results highlight the effectiveness and robustness of autoencoder in imputing crucial information for reliable AD prediction in AI-based clinical decision support systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Healthcare Technology Letters Health Professions-Health Information Management

CiteScore

6.10

自引率

4.80%

发文量

审稿时长

22 weeks

期刊介绍： Healthcare Technology Letters aims to bring together an audience of biomedical and electrical engineers, physical and computer scientists, and mathematicians to enable the exchange of the latest ideas and advances through rapid online publication of original healthcare technology research. Major themes of the journal include (but are not limited to): Major technological/methodological areas: Biomedical signal processing Biomedical imaging and image processing Bioinstrumentation (sensors, wearable technologies, etc) Biomedical informatics Major application areas: Cardiovascular and respiratory systems engineering Neural engineering, neuromuscular systems Rehabilitation engineering Bio-robotics, surgical planning and biomechanics Therapeutic and diagnostic systems, devices and technologies Clinical engineering Healthcare information systems, telemedicine, mHealth.