Carla S Möller-Levet, Simon N Archer, Derk-Jan Dijk
{"title":"Performance of Blood-Based Biomarkers for Human Circadian Pacemaker Phase: Training Sets Matter As Much As Feature-Selection Methods.","authors":"Carla S Möller-Levet, Simon N Archer, Derk-Jan Dijk","doi":"10.1177/07487304251358950","DOIUrl":null,"url":null,"abstract":"<p><p>Biomarkers are valuable tools in a wide range of human health areas including circadian medicine. Valid, low-burden, multivariate molecular approaches to assess circadian phase at scale in people living and working in the real world hold promise for translating basic circadian knowledge to practical applications. However, standards for the development and evaluation of these circadian biomarkers have not yet been established, even though several publications report such biomarkers and claim that the methods are universal. Here, we present a basic exploration of some of the determinants and confounds of blood-based biomarker development for suprachiasmatic nucleus (SCN) phase by reanalysing publicly available data sets. We compare performance of biomarkers based on three feature-selection methods: Partial Least Squares Regression, ZeitZeiger, and Elastic Net, as well as performance of a standard set of clock genes. We explore the effects of training sample size and the impact of the experimental protocols from which training samples are drawn and on which performance is tested. Approaches based on small sample sizes used for training are prone to poor performance due to overfitting. Performance to some extent depends on the feature-selection method, but at least as much on the experimental conditions from which the biomarker training samples were drawn. Performance of biomarkers developed under baseline conditions does not necessarily translate to protocols that mimic real-world scenarios such as shiftwork in which sleep may be restricted or desynchronized from the endogenous circadian SCN phase. The molecular features selected by the various approaches to develop biomarkers for the SCN phase show very little overlap although the processes associated with these features have common themes with response to steroid hormones, that is, cortisol being the most prominent. Overall, the findings indicate that establishment of circadian biomarkers should be guided by established biomarker-development concepts and foundational principles of human circadian biology.</p>","PeriodicalId":15056,"journal":{"name":"Journal of Biological Rhythms","volume":" ","pages":"7487304251358950"},"PeriodicalIF":2.1000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biological Rhythms","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1177/07487304251358950","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Biomarkers are valuable tools in a wide range of human health areas including circadian medicine. Valid, low-burden, multivariate molecular approaches to assess circadian phase at scale in people living and working in the real world hold promise for translating basic circadian knowledge to practical applications. However, standards for the development and evaluation of these circadian biomarkers have not yet been established, even though several publications report such biomarkers and claim that the methods are universal. Here, we present a basic exploration of some of the determinants and confounds of blood-based biomarker development for suprachiasmatic nucleus (SCN) phase by reanalysing publicly available data sets. We compare performance of biomarkers based on three feature-selection methods: Partial Least Squares Regression, ZeitZeiger, and Elastic Net, as well as performance of a standard set of clock genes. We explore the effects of training sample size and the impact of the experimental protocols from which training samples are drawn and on which performance is tested. Approaches based on small sample sizes used for training are prone to poor performance due to overfitting. Performance to some extent depends on the feature-selection method, but at least as much on the experimental conditions from which the biomarker training samples were drawn. Performance of biomarkers developed under baseline conditions does not necessarily translate to protocols that mimic real-world scenarios such as shiftwork in which sleep may be restricted or desynchronized from the endogenous circadian SCN phase. The molecular features selected by the various approaches to develop biomarkers for the SCN phase show very little overlap although the processes associated with these features have common themes with response to steroid hormones, that is, cortisol being the most prominent. Overall, the findings indicate that establishment of circadian biomarkers should be guided by established biomarker-development concepts and foundational principles of human circadian biology.
期刊介绍:
Journal of Biological Rhythms is the official journal of the Society for Research on Biological Rhythms and offers peer-reviewed original research in all aspects of biological rhythms, using genetic, biochemical, physiological, behavioral, epidemiological & modeling approaches, as well as clinical trials. Emphasis is on circadian and seasonal rhythms, but timely reviews and research on other periodicities are also considered. The journal is a member of the Committee on Publication Ethics (COPE).