Semih Fazlı Kayahan , Muhammed Fatih Alaeddinoğlu , Mehmet Şeneş
{"title":"从目标分组到模糊参考区间:甲状腺功能测试的标准化机器学习方法。","authors":"Semih Fazlı Kayahan , Muhammed Fatih Alaeddinoğlu , Mehmet Şeneş","doi":"10.1016/j.cca.2025.120635","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Accurate interpretation of thyroid function tests (TFTs) requires reliable reference intervals (RIs). Indirect methods based on retrospective laboratory data are increasingly used, but current strategies face major limitations, including rigid age cut-offs, inconsistent partitioning, and lack of objective evaluation of subgrouping criteria. To address these challenges, we developed a novel machine learning (ML)-based framework for the objective, data-driven determination of continuous and personalized RIs.</div></div><div><h3>Methods</h3><div>A total of 48,397 records (2019–2021) from individuals aged ≥18 years were retrieved from the hospital laboratory information system. After applying inclusion and exclusion criteria, 9455 individuals constituted the reference sample group. Partitioning was based on age, sex, thyroid-stimulating hormone (TSH), and free thyroxine (fT4). The Elbow method suggested the optimal number of clusters; K-means clustering was used to form subgroups, and the Extra Trees Classifier quantified the relative importance of partitioning criteria. RIs were estimated using the non-parametric percentile method, and Fuzzy C-Means clustering was applied to smooth sharp transitions between groups.</div></div><div><h3>Results</h3><div>Six subgroups were identified, with age as the dominant determinant (feature importance score = 0.96), whereas sex had a negligible effect. Fuzzy clustering generated continuous and personalized RIs. In clinical evaluation, applying fuzzy RIs reclassified 7 % of patients, predominantly shifting diagnoses toward hyperthyroidism.</div></div><div><h3>Conclusions</h3><div>This study offers a universal and adaptable ML-based framework for generating continuous, personalized RIs. This advance toward precision laboratory medicine can enhance diagnostic accuracy, reduce misclassification, and support more patient-centered care.</div></div>","PeriodicalId":10205,"journal":{"name":"Clinica Chimica Acta","volume":"579 ","pages":"Article 120635"},"PeriodicalIF":2.9000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From objective grouping to fuzzy reference intervals: A standardized machine learning approach for thyroid function tests\",\"authors\":\"Semih Fazlı Kayahan , Muhammed Fatih Alaeddinoğlu , Mehmet Şeneş\",\"doi\":\"10.1016/j.cca.2025.120635\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Accurate interpretation of thyroid function tests (TFTs) requires reliable reference intervals (RIs). Indirect methods based on retrospective laboratory data are increasingly used, but current strategies face major limitations, including rigid age cut-offs, inconsistent partitioning, and lack of objective evaluation of subgrouping criteria. To address these challenges, we developed a novel machine learning (ML)-based framework for the objective, data-driven determination of continuous and personalized RIs.</div></div><div><h3>Methods</h3><div>A total of 48,397 records (2019–2021) from individuals aged ≥18 years were retrieved from the hospital laboratory information system. After applying inclusion and exclusion criteria, 9455 individuals constituted the reference sample group. Partitioning was based on age, sex, thyroid-stimulating hormone (TSH), and free thyroxine (fT4). The Elbow method suggested the optimal number of clusters; K-means clustering was used to form subgroups, and the Extra Trees Classifier quantified the relative importance of partitioning criteria. RIs were estimated using the non-parametric percentile method, and Fuzzy C-Means clustering was applied to smooth sharp transitions between groups.</div></div><div><h3>Results</h3><div>Six subgroups were identified, with age as the dominant determinant (feature importance score = 0.96), whereas sex had a negligible effect. Fuzzy clustering generated continuous and personalized RIs. In clinical evaluation, applying fuzzy RIs reclassified 7 % of patients, predominantly shifting diagnoses toward hyperthyroidism.</div></div><div><h3>Conclusions</h3><div>This study offers a universal and adaptable ML-based framework for generating continuous, personalized RIs. This advance toward precision laboratory medicine can enhance diagnostic accuracy, reduce misclassification, and support more patient-centered care.</div></div>\",\"PeriodicalId\":10205,\"journal\":{\"name\":\"Clinica Chimica Acta\",\"volume\":\"579 \",\"pages\":\"Article 120635\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinica Chimica Acta\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009898125005145\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinica Chimica Acta","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009898125005145","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
From objective grouping to fuzzy reference intervals: A standardized machine learning approach for thyroid function tests
Background
Accurate interpretation of thyroid function tests (TFTs) requires reliable reference intervals (RIs). Indirect methods based on retrospective laboratory data are increasingly used, but current strategies face major limitations, including rigid age cut-offs, inconsistent partitioning, and lack of objective evaluation of subgrouping criteria. To address these challenges, we developed a novel machine learning (ML)-based framework for the objective, data-driven determination of continuous and personalized RIs.
Methods
A total of 48,397 records (2019–2021) from individuals aged ≥18 years were retrieved from the hospital laboratory information system. After applying inclusion and exclusion criteria, 9455 individuals constituted the reference sample group. Partitioning was based on age, sex, thyroid-stimulating hormone (TSH), and free thyroxine (fT4). The Elbow method suggested the optimal number of clusters; K-means clustering was used to form subgroups, and the Extra Trees Classifier quantified the relative importance of partitioning criteria. RIs were estimated using the non-parametric percentile method, and Fuzzy C-Means clustering was applied to smooth sharp transitions between groups.
Results
Six subgroups were identified, with age as the dominant determinant (feature importance score = 0.96), whereas sex had a negligible effect. Fuzzy clustering generated continuous and personalized RIs. In clinical evaluation, applying fuzzy RIs reclassified 7 % of patients, predominantly shifting diagnoses toward hyperthyroidism.
Conclusions
This study offers a universal and adaptable ML-based framework for generating continuous, personalized RIs. This advance toward precision laboratory medicine can enhance diagnostic accuracy, reduce misclassification, and support more patient-centered care.
期刊介绍:
The Official Journal of the International Federation of Clinical Chemistry and Laboratory Medicine (IFCC)
Clinica Chimica Acta is a high-quality journal which publishes original Research Communications in the field of clinical chemistry and laboratory medicine, defined as the diagnostic application of chemistry, biochemistry, immunochemistry, biochemical aspects of hematology, toxicology, and molecular biology to the study of human disease in body fluids and cells.
The objective of the journal is to publish novel information leading to a better understanding of biological mechanisms of human diseases, their prevention, diagnosis, and patient management. Reports of an applied clinical character are also welcome. Papers concerned with normal metabolic processes or with constituents of normal cells or body fluids, such as reports of experimental or clinical studies in animals, are only considered when they are clearly and directly relevant to human disease. Evaluation of commercial products have a low priority for publication, unless they are novel or represent a technological breakthrough. Studies dealing with effects of drugs and natural products and studies dealing with the redox status in various diseases are not within the journal''s scope. Development and evaluation of novel analytical methodologies where applicable to diagnostic clinical chemistry and laboratory medicine, including point-of-care testing, and topics on laboratory management and informatics will also be considered. Studies focused on emerging diagnostic technologies and (big) data analysis procedures including digitalization, mobile Health, and artificial Intelligence applied to Laboratory Medicine are also of interest.