Lucio José Pantazis, Gustavo Daniel Frechtel, Gloria Edith Cerrone, Rafael García, Andrea Elena Iglesias Molli
{"title":"Phenotype similarities in automatically grouped T2D patients by variation-based clustering of IL-1β gene expression.","authors":"Lucio José Pantazis, Gustavo Daniel Frechtel, Gloria Edith Cerrone, Rafael García, Andrea Elena Iglesias Molli","doi":"","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Analyzing longitudinal gene expression data is extremely challenging due to limited prior information, high dimensionality, and heterogeneity. Similar difficulties arise in research of multifactorial diseases such as Type 2 Diabetes. Clustering methods can be applied to automatically group similar observations. Common clinical values within the resulting groups suggest potential associations. However, applying traditional clustering methods to gene expression over time fails to capture variations in the response. Therefore, shape-based clustering could be applied to identify patient groups by gene expression variation in a large time metabolic compensatory intervention.</p><p><strong>Objectives: </strong>To search for clinical grouping patterns between subjects that showed similar structure in the variation of IL-1β gene expression over time.</p><p><strong>Methods: </strong>A new approach for shape-based clustering by IL-1β expression behavior was applied to a real longitudinal database of Type 2 Diabetes patients. In order to capture correctly variations in the response, we applied traditional clustering methods to slopes between measurements.</p><p><strong>Results: </strong>In this setting, the application of K-Medoids using the Manhattan distance yielded the best results for the corresponding database. Among the resulting groups, one of the clusters presented significant differences in many key clinical values regarding the metabolic syndrome in comparison to the rest of the data.</p><p><strong>Conclusions: </strong>The proposed method can be used to group patients according to variation patterns in gene expression (or other applications) and thus, provide clinical insights even when there is no previous knowledge on the subject clinical profile and few timepoints for each individual.</p>","PeriodicalId":37192,"journal":{"name":"Electronic Journal of the International Federation of Clinical Chemistry and Laboratory Medicine","volume":"34 3","pages":"228-244"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/cd/1b/ejifcc-34-228.PMC10588079.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Journal of the International Federation of Clinical Chemistry and Laboratory Medicine","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/10/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Analyzing longitudinal gene expression data is extremely challenging due to limited prior information, high dimensionality, and heterogeneity. Similar difficulties arise in research of multifactorial diseases such as Type 2 Diabetes. Clustering methods can be applied to automatically group similar observations. Common clinical values within the resulting groups suggest potential associations. However, applying traditional clustering methods to gene expression over time fails to capture variations in the response. Therefore, shape-based clustering could be applied to identify patient groups by gene expression variation in a large time metabolic compensatory intervention.
Objectives: To search for clinical grouping patterns between subjects that showed similar structure in the variation of IL-1β gene expression over time.
Methods: A new approach for shape-based clustering by IL-1β expression behavior was applied to a real longitudinal database of Type 2 Diabetes patients. In order to capture correctly variations in the response, we applied traditional clustering methods to slopes between measurements.
Results: In this setting, the application of K-Medoids using the Manhattan distance yielded the best results for the corresponding database. Among the resulting groups, one of the clusters presented significant differences in many key clinical values regarding the metabolic syndrome in comparison to the rest of the data.
Conclusions: The proposed method can be used to group patients according to variation patterns in gene expression (or other applications) and thus, provide clinical insights even when there is no previous knowledge on the subject clinical profile and few timepoints for each individual.