Rini S S Veeravalli, Laura J Horsfall, Kenan Direk, Irene Petersen
{"title":"Prevalence and Data-Driven Exploration of Pre-Diagnostic Symptoms and Features of Gilbert's Syndrome in the UK Primary Care Population.","authors":"Rini S S Veeravalli, Laura J Horsfall, Kenan Direk, Irene Petersen","doi":"10.2147/CLEP.S520589","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Gilbert's syndrome (GS) is a common genetic disorder marked by elevated bilirubin levels due to UGT1A1 enzyme deficiency. While jaundice and some adverse drug reactions are the primary recognised clinical features, individuals with GS frequently report non-specific symptoms like fatigue, brain fog, and abdominal pain. This study investigates the symptoms and diagnostic triggers of GS using UK primary care electronic health records.</p><p><strong>Methods: </strong>We analysed data from the IQVIA Medical Research Database, covering over 11 million active UK patients. Individuals with a recorded GS diagnosis were identified and their sociodemographic profiles described. Using a nested case-control design, we applied machine learning-based feature selection to pinpoint key clinical features recorded up to five years before diagnosis. These features were then examined longitudinally by sex to distinguish persistent symptoms from short-term diagnostic triggers.</p><p><strong>Results: </strong>The estimated UK prevalence of GS was 180.4 per 100,000 (95% CI: 174.4-186.6), with diagnoses more common in men, peaking around age 35, and more frequent in areas of least social deprivation. Among 9,240 GS cases and 150,846 controls, machine learning identified key diagnostic themes including jaundice, abnormal liver function tests, abdominal pain, fatigue, bowel changes, and sleep disturbances. While most of these features appeared primarily in the year prior to diagnosis, only abdominal pain and fatigue were consistently more common in GS cases up to five years before diagnosis.</p><p><strong>Conclusion: </strong>Our findings highlight both expected and novel GS diagnostic triggers. While many features likely reflect known symptomology or incidental detection via routine testing, the persistent presence of fatigue and abdominal pain suggests they may be under-recognised symptoms of GS. These findings warrant further investigation, and the data-driven approach used here may help uncover early signs of other underdiagnosed genetic conditions.</p>","PeriodicalId":10362,"journal":{"name":"Clinical Epidemiology","volume":"17 ","pages":"721-733"},"PeriodicalIF":3.2000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12402429/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/CLEP.S520589","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Gilbert's syndrome (GS) is a common genetic disorder marked by elevated bilirubin levels due to UGT1A1 enzyme deficiency. While jaundice and some adverse drug reactions are the primary recognised clinical features, individuals with GS frequently report non-specific symptoms like fatigue, brain fog, and abdominal pain. This study investigates the symptoms and diagnostic triggers of GS using UK primary care electronic health records.
Methods: We analysed data from the IQVIA Medical Research Database, covering over 11 million active UK patients. Individuals with a recorded GS diagnosis were identified and their sociodemographic profiles described. Using a nested case-control design, we applied machine learning-based feature selection to pinpoint key clinical features recorded up to five years before diagnosis. These features were then examined longitudinally by sex to distinguish persistent symptoms from short-term diagnostic triggers.
Results: The estimated UK prevalence of GS was 180.4 per 100,000 (95% CI: 174.4-186.6), with diagnoses more common in men, peaking around age 35, and more frequent in areas of least social deprivation. Among 9,240 GS cases and 150,846 controls, machine learning identified key diagnostic themes including jaundice, abnormal liver function tests, abdominal pain, fatigue, bowel changes, and sleep disturbances. While most of these features appeared primarily in the year prior to diagnosis, only abdominal pain and fatigue were consistently more common in GS cases up to five years before diagnosis.
Conclusion: Our findings highlight both expected and novel GS diagnostic triggers. While many features likely reflect known symptomology or incidental detection via routine testing, the persistent presence of fatigue and abdominal pain suggests they may be under-recognised symptoms of GS. These findings warrant further investigation, and the data-driven approach used here may help uncover early signs of other underdiagnosed genetic conditions.
期刊介绍:
Clinical Epidemiology is an international, peer reviewed, open access journal. Clinical Epidemiology focuses on the application of epidemiological principles and questions relating to patients and clinical care in terms of prevention, diagnosis, prognosis, and treatment.
Clinical Epidemiology welcomes papers covering these topics in form of original research and systematic reviews.
Clinical Epidemiology has a special interest in international electronic medical patient records and other routine health care data, especially as applied to safety of medical interventions, clinical utility of diagnostic procedures, understanding short- and long-term clinical course of diseases, clinical epidemiological and biostatistical methods, and systematic reviews.
When considering submission of a paper utilizing publicly-available data, authors should ensure that such studies add significantly to the body of knowledge and that they use appropriate validated methods for identifying health outcomes.
The journal has launched special series describing existing data sources for clinical epidemiology, international health care systems and validation studies of algorithms based on databases and registries.