Nonparametric clustering for longitudinal functional data with the application to H-NMR spectra of kidney transplant patients. Longitudinal functional data clustering.

IF 1.2 4区生物学 Q4 BIOLOGY

Theoretical Biology Forum Pub Date : 2021-01-01 DOI:10.19272/202111401003

Minzhen Xie, Haiyan Liu, Jeanine Houwing-Duistermaat

{"title":"Nonparametric clustering for longitudinal functional data with the application to H-NMR spectra of kidney transplant patients. Longitudinal functional data clustering.","authors":"Minzhen Xie, Haiyan Liu, Jeanine Houwing-Duistermaat","doi":"10.19272/202111401003","DOIUrl":null,"url":null,"abstract":"Longitudinal functional data are increasingly common in the health domain. The motivated dataset for this paper comprises H-NMR spectra of kidney transplant patients [8]. Our aim is to cluster patients into different clinical outcome subgoups to reveal the success of the transplantation. The NMR spectra of each patient at each time point are functional data and the data are longitudinally collected at up to nine different time points. Existing methods are available for functional data collected at one time point, but not for longitudinal functional data collected at a grid of time points subject to missingness. We therefore first apply a method to extract the same number of functional feactures for each subject. Next we propose a novel nonparametric clustering method for mulitivariate functional data. We applied our proposed clustering method to the kidney transplant dataset both to a subset of the raw data with only two time points and the extacted functional features. It appeared that the proposed method achieves better clustering performance on the extracted functional features than on the subset of raw data. A data simulation study was performed to further evaluate the method. The design mimiced the kidney transplant dataset but with a larger sample size. Scenarios which have different levels of noise were considered. The simulation study showed the accuarcy of our proposed method.","PeriodicalId":55980,"journal":{"name":"Theoretical Biology Forum","volume":"114 1-2 1","pages":"15-28"},"PeriodicalIF":1.2000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Biology Forum","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.19272/202111401003","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 2

Abstract

Longitudinal functional data are increasingly common in the health domain. The motivated dataset for this paper comprises H-NMR spectra of kidney transplant patients [8]. Our aim is to cluster patients into different clinical outcome subgoups to reveal the success of the transplantation. The NMR spectra of each patient at each time point are functional data and the data are longitudinally collected at up to nine different time points. Existing methods are available for functional data collected at one time point, but not for longitudinal functional data collected at a grid of time points subject to missingness. We therefore first apply a method to extract the same number of functional feactures for each subject. Next we propose a novel nonparametric clustering method for mulitivariate functional data. We applied our proposed clustering method to the kidney transplant dataset both to a subset of the raw data with only two time points and the extacted functional features. It appeared that the proposed method achieves better clustering performance on the extracted functional features than on the subset of raw data. A data simulation study was performed to further evaluate the method. The design mimiced the kidney transplant dataset but with a larger sample size. Scenarios which have different levels of noise were considered. The simulation study showed the accuarcy of our proposed method.

查看原文本刊更多论文

纵向功能数据的非参数聚类及其在肾移植患者H-NMR谱中的应用。纵向功能数据聚类。

纵向功能数据在健康领域越来越常见。本文的动机数据集包括肾移植患者的H-NMR光谱[8]。我们的目的是将患者分为不同的临床结果亚组，以揭示移植的成功。每个患者在每个时间点的NMR光谱都是功能数据，并且在多达九个不同的时间点纵向收集数据。现有方法可用于在一个时间点收集的功能数据，但不适用于在可能丢失的时间点网格收集的纵向功能数据。因此，我们首先应用一种方法为每个受试者提取相同数量的功能特征。接下来，我们提出了一种新的多变量函数数据的非参数聚类方法。我们将我们提出的聚类方法应用于肾移植数据集，既应用于只有两个时间点的原始数据子集，也应用于提取的功能特征。与原始数据子集相比，所提出的方法在提取的功能特征上实现了更好的聚类性能。进行了数据模拟研究以进一步评估该方法。该设计模仿了肾移植数据集，但样本量更大。考虑了具有不同噪声水平的场景。仿真研究表明了该方法的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Theoretical Biology Forum Agricultural and Biological Sciences-General Agricultural and Biological Sciences

CiteScore

1.10

自引率

0.00%

发文量