{"title":"Use of indirect methods and machine learning algorithms for the estimation of reference intervals, taking cortisol measurements as an example.","authors":"Fatma Demet Arslan, Georg Hoffmann","doi":"10.1515/cclm-2025-0984","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study aims to determine reliable reference intervals (RIs) for total cortisol (TC) in adults considering the effects of both age and blood collection time, using indirect methods and machine learning approaches.</p><p><strong>Methods: </strong>Serum TC results from blood samples collected between 08:00 and 10:00 am at the first outpatient visit were included in the study. Serum TC were measured using a Roche Elecsys Cortisol II kit. Estimated RIs by indirect methods with the support of R packages (refineR and reflimR) for the implementation of machine learning algorithms (mclust and rpart) were compared with the manufacturer's reference interval (RI) (48-195 μg/L).</p><p><strong>Results: </strong>Estimated RIs by refineR and reflimR (57-256 μg/L and 62-271 μg/L, respectively) were wider than the manufacturer's RI. When reflimR was applied to Box-Cox-transformed data with the lambda value of 0.284 suggested by refineR, an RI of 57-251 μg/L was obtained, which was like that obtained with refineR. An even better match with the manufacturer's RI was achieved using Gaussian mixture modelling with the mclust, which suggested one out of four clusters with an RI of 55.8-187 μg/L. Clustering the data with rpart suggested stratification into two age groups (≤35 and >35 years) and three blood collection periods (08:00-08:45, 08:45-09:35, and 09:35-10:00). The TC levels demonstrated the highest concentrations in the early morning (8:00-08:45) and in young adults (18-35 years).</p><p><strong>Conclusions: </strong>This study highlights the necessity of considering both age and blood collection time in clinical interpretation and demonstrates the effectiveness of indirect methods and machine learning approaches in the verification of RIs for hormones with known heterogeneity.</p>","PeriodicalId":10390,"journal":{"name":"Clinical chemistry and laboratory medicine","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry and laboratory medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1515/cclm-2025-0984","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: This study aims to determine reliable reference intervals (RIs) for total cortisol (TC) in adults considering the effects of both age and blood collection time, using indirect methods and machine learning approaches.
Methods: Serum TC results from blood samples collected between 08:00 and 10:00 am at the first outpatient visit were included in the study. Serum TC were measured using a Roche Elecsys Cortisol II kit. Estimated RIs by indirect methods with the support of R packages (refineR and reflimR) for the implementation of machine learning algorithms (mclust and rpart) were compared with the manufacturer's reference interval (RI) (48-195 μg/L).
Results: Estimated RIs by refineR and reflimR (57-256 μg/L and 62-271 μg/L, respectively) were wider than the manufacturer's RI. When reflimR was applied to Box-Cox-transformed data with the lambda value of 0.284 suggested by refineR, an RI of 57-251 μg/L was obtained, which was like that obtained with refineR. An even better match with the manufacturer's RI was achieved using Gaussian mixture modelling with the mclust, which suggested one out of four clusters with an RI of 55.8-187 μg/L. Clustering the data with rpart suggested stratification into two age groups (≤35 and >35 years) and three blood collection periods (08:00-08:45, 08:45-09:35, and 09:35-10:00). The TC levels demonstrated the highest concentrations in the early morning (8:00-08:45) and in young adults (18-35 years).
Conclusions: This study highlights the necessity of considering both age and blood collection time in clinical interpretation and demonstrates the effectiveness of indirect methods and machine learning approaches in the verification of RIs for hormones with known heterogeneity.
期刊介绍:
Clinical Chemistry and Laboratory Medicine (CCLM) publishes articles on novel teaching and training methods applicable to laboratory medicine. CCLM welcomes contributions on the progress in fundamental and applied research and cutting-edge clinical laboratory medicine. It is one of the leading journals in the field, with an impact factor over 3. CCLM is issued monthly, and it is published in print and electronically.
CCLM is the official journal of the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) and publishes regularly EFLM recommendations and news. CCLM is the official journal of the National Societies from Austria (ÖGLMKC); Belgium (RBSLM); Germany (DGKL); Hungary (MLDT); Ireland (ACBI); Italy (SIBioC); Portugal (SPML); and Slovenia (SZKK); and it is affiliated to AACB (Australia) and SFBC (France).
Topics:
- clinical biochemistry
- clinical genomics and molecular biology
- clinical haematology and coagulation
- clinical immunology and autoimmunity
- clinical microbiology
- drug monitoring and analysis
- evaluation of diagnostic biomarkers
- disease-oriented topics (cardiovascular disease, cancer diagnostics, diabetes)
- new reagents, instrumentation and technologies
- new methodologies
- reference materials and methods
- reference values and decision limits
- quality and safety in laboratory medicine
- translational laboratory medicine
- clinical metrology
Follow @cclm_degruyter on Twitter!