PET淀粉样蛋白成像研究中基于重采样的稀疏聚类分类。

IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium Pub Date : 2011-10-01 DOI:10.1109/NSSMIC.2011.6152564

Wenzhu Bi, George C Tseng, Lisa A Weissfeld, Julie C Price

{"title":"PET淀粉样蛋白成像研究中基于重采样的稀疏聚类分类。","authors":"Wenzhu Bi, George C Tseng, Lisa A Weissfeld, Julie C Price","doi":"10.1109/NSSMIC.2011.6152564","DOIUrl":null,"url":null,"abstract":"Sparse k-means clustering (Sparse_kM) can exclude uninformative variables and yield reliable parsimonious clustering results, especially for p≫n. In this work, Sparse_kM and data resampling were combined to identify variables of greatest interest and define confidence levels for the clustering. The method was evaluated by statistical simulation and applied to PiB PET amyloid imaging data to identify normal control (NC) subjects with (+) or without (-) evidence of amyloid, i.e., PiB(+/-).Simulations: A dataset of n=60 observations (3 groups of 20) and p=500 variables was generated for each simulation run; only 50 variables were truly different across groups. The dataset was resampled 20 times, Sparse_kM was applied to each sample and average variable weights were calculated. Probabilities of cluster membership, also called confidence levels, were computed (n=60). Simulations were performed 250 times. The 50 truly different variables were identified by variable weights that were 13-32 times greater than those for the 450 uninformative variables.Human data: For the PiB PET dataset, images (ECAT HR+, 10-15 mCi, 90 min) were acquired for 64 cognitively normal subjects (74.1±5.4 yrs). Parametric PiB distribution volume ratio images were generated (Logan method, cerebellum reference) and normalized to the MNI template (SPM8) to produce a dataset of n=64 subjects and p=343,099 voxels/image. The dataset was resampled 10 times and Sparse_kM was applied. An average voxel weight image was computed that indicated cortical areas of greatest interest that included precuneus and frontal cortex; these are key areas linked to early amyloid deposition. Seven of 64 subjects were identified as PiB(+) and 47 as PiB(-) with confidence ≥ 90%, where another subject was PiB(+) at lower confidence (80%) and the other 9 subjects were PiB(-) at confidence in the range of 50-70%. In conclusion, Sparse_kM with resampling can help to establish confidence levels for clustering when p≫n and may be a promising method for revealing informative voxels/spatial patterns that distinguish levels of amyloid load, including that at the transitional amyloid +/- boundary.","PeriodicalId":73298,"journal":{"name":"IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium","volume":"2011 ","pages":"3108-3111"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/NSSMIC.2011.6152564","citationCount":"3","resultStr":"{\"title\":\"Sparse Clustering with Resampling for Subject Classification in PET Amyloid Imaging Studies.\",\"authors\":\"Wenzhu Bi, George C Tseng, Lisa A Weissfeld, Julie C Price\",\"doi\":\"10.1109/NSSMIC.2011.6152564\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sparse k-means clustering (Sparse_kM) can exclude uninformative variables and yield reliable parsimonious clustering results, especially for p≫n. In this work, Sparse_kM and data resampling were combined to identify variables of greatest interest and define confidence levels for the clustering. The method was evaluated by statistical simulation and applied to PiB PET amyloid imaging data to identify normal control (NC) subjects with (+) or without (-) evidence of amyloid, i.e., PiB(+/-).Simulations: A dataset of n=60 observations (3 groups of 20) and p=500 variables was generated for each simulation run; only 50 variables were truly different across groups. The dataset was resampled 20 times, Sparse_kM was applied to each sample and average variable weights were calculated. Probabilities of cluster membership, also called confidence levels, were computed (n=60). Simulations were performed 250 times. The 50 truly different variables were identified by variable weights that were 13-32 times greater than those for the 450 uninformative variables.Human data: For the PiB PET dataset, images (ECAT HR+, 10-15 mCi, 90 min) were acquired for 64 cognitively normal subjects (74.1±5.4 yrs). Parametric PiB distribution volume ratio images were generated (Logan method, cerebellum reference) and normalized to the MNI template (SPM8) to produce a dataset of n=64 subjects and p=343,099 voxels/image. The dataset was resampled 10 times and Sparse_kM was applied. An average voxel weight image was computed that indicated cortical areas of greatest interest that included precuneus and frontal cortex; these are key areas linked to early amyloid deposition. Seven of 64 subjects were identified as PiB(+) and 47 as PiB(-) with confidence ≥ 90%, where another subject was PiB(+) at lower confidence (80%) and the other 9 subjects were PiB(-) at confidence in the range of 50-70%. In conclusion, Sparse_kM with resampling can help to establish confidence levels for clustering when p≫n and may be a promising method for revealing informative voxels/spatial patterns that distinguish levels of amyloid load, including that at the transitional amyloid +/- boundary.\",\"PeriodicalId\":73298,\"journal\":{\"name\":\"IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium\",\"volume\":\"2011 \",\"pages\":\"3108-3111\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1109/NSSMIC.2011.6152564\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NSSMIC.2011.6152564\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NSSMIC.2011.6152564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

稀疏k-均值聚类(Sparse_kM)可以排除无信息变量并产生可靠的简约聚类结果，特别是对于p > n。在这项工作中，Sparse_kM和数据重采样相结合，以确定最感兴趣的变量，并定义聚类的置信水平。通过统计模拟对该方法进行评估，并将其应用于PiB PET淀粉样蛋白成像数据，以识别具有(+)或不具有(-)淀粉样蛋白证据的正常对照(NC)受试者，即PiB(+/-)。模拟:每次模拟运行生成n=60个观测值(3组，每组20个)和p=500个变量的数据集;两组之间只有50个变量是真正不同的。数据集重采样20次，对每个样本应用Sparse_kM并计算平均变量权。计算集群隶属度的概率，也称为置信水平(n=60)。模拟进行了250次。50个真正不同的变量的变量权重是450个无信息变量的13-32倍。人类数据:对于PiB PET数据集，获得64名认知正常受试者(74.1±5.4岁)的图像(ECAT HR+， 10-15 mCi, 90分钟)。生成参数PiB分布体积比图像(Logan方法，小脑参考)并归一化到MNI模板(SPM8)，得到n=64受试者，p=343,099体素/图像的数据集。数据集重新采样了10次，并应用了Sparse_kM。计算平均体素权重图像，显示最感兴趣的皮质区域，包括楔前叶和额叶皮质;这些是与早期淀粉样蛋白沉积有关的关键区域。64例受试者中有7例为PiB(+)， 47例为PiB(-)，置信度≥90%，其中1例为PiB(+)，置信度较低(80%)，其余9例为PiB(-)，置信度在50-70%范围内。综上所述，当p > n时，Sparse_kM与重采样可以帮助建立聚类的置信水平，并且可能是一种有前途的方法，用于揭示区分淀粉样蛋白负荷水平的信息体素/空间模式，包括过渡淀粉样蛋白+/-边界的水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Sparse Clustering with Resampling for Subject Classification in PET Amyloid Imaging Studies.

查看原文本刊更多论文

Sparse Clustering with Resampling for Subject Classification in PET Amyloid Imaging Studies.

Sparse k-means clustering (Sparse_kM) can exclude uninformative variables and yield reliable parsimonious clustering results, especially for p≫n. In this work, Sparse_kM and data resampling were combined to identify variables of greatest interest and define confidence levels for the clustering. The method was evaluated by statistical simulation and applied to PiB PET amyloid imaging data to identify normal control (NC) subjects with (+) or without (-) evidence of amyloid, i.e., PiB(+/-).

Simulations: A dataset of n=60 observations (3 groups of 20) and p=500 variables was generated for each simulation run; only 50 variables were truly different across groups. The dataset was resampled 20 times, Sparse_kM was applied to each sample and average variable weights were calculated. Probabilities of cluster membership, also called confidence levels, were computed (n=60). Simulations were performed 250 times. The 50 truly different variables were identified by variable weights that were 13-32 times greater than those for the 450 uninformative variables.

Human data: For the PiB PET dataset, images (ECAT HR+, 10-15 mCi, 90 min) were acquired for 64 cognitively normal subjects (74.1±5.4 yrs). Parametric PiB distribution volume ratio images were generated (Logan method, cerebellum reference) and normalized to the MNI template (SPM8) to produce a dataset of n=64 subjects and p=343,099 voxels/image. The dataset was resampled 10 times and Sparse_kM was applied. An average voxel weight image was computed that indicated cortical areas of greatest interest that included precuneus and frontal cortex; these are key areas linked to early amyloid deposition. Seven of 64 subjects were identified as PiB(+) and 47 as PiB(-) with confidence ≥ 90%, where another subject was PiB(+) at lower confidence (80%) and the other 9 subjects were PiB(-) at confidence in the range of 50-70%. In conclusion, Sparse_kM with resampling can help to establish confidence levels for clustering when p≫n and may be a promising method for revealing informative voxels/spatial patterns that distinguish levels of amyloid load, including that at the transitional amyloid +/- boundary.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium

自引率

0.00%

发文量