The PCOS Phenotypes in Unselected Populations (P-PUP) study: participant clinical features and data harmonization on analysis of individual participant data.
Asmamaw Demis Bizuneh, Sylvia Kiconco, Arul Earnest, Mahnaz Bahri Khomami, Raja Ram Dhungana, Ricardo Azziz, Larisa V Suturina, Xiaomiao Zhao, Alessandra Gambineri, Fahimeh Ramezani Tehrani, Bulent O Yildiz, Jin Ju Kim, Liangzhi Xu, Christian Chigozie Makwe, Helena J Teede, Anju E Joham, Chau Thien Tay
{"title":"The PCOS Phenotypes in Unselected Populations (P-PUP) study: participant clinical features and data harmonization on analysis of individual participant data.","authors":"Asmamaw Demis Bizuneh, Sylvia Kiconco, Arul Earnest, Mahnaz Bahri Khomami, Raja Ram Dhungana, Ricardo Azziz, Larisa V Suturina, Xiaomiao Zhao, Alessandra Gambineri, Fahimeh Ramezani Tehrani, Bulent O Yildiz, Jin Ju Kim, Liangzhi Xu, Christian Chigozie Makwe, Helena J Teede, Anju E Joham, Chau Thien Tay","doi":"10.1186/s12916-025-04221-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Polycystic ovary syndrome (PCOS) is a multifaceted condition with diagnostic challenges and clinical heterogeneity across populations. Research priorities include enhanced accuracy in defining cut-offs for diagnostic features. Here, we aim to describe participant clinical features and data harmonization in the international PCOS Phenotype in Unselected Populations (P-PUP) study.</p><p><strong>Methods: </strong>We searched EMBASE and Medline (Ovid) from 1990 to October 2, 2020, in population-based, medically unbiased study cohorts. Included studies had ≥ 300 participants, directly assessed PCOS-related features, and provided Individual Participant Data (IPD). Risk of bias was assessed using the AXIS tool. Data integrity was ensured via cross-referencing, identifying outliers/implausible data, and variable harmonization. Reporting follows PRISMA-IPD guidelines, summarizing findings with frequencies and proportions.</p><p><strong>Results: </strong>The study included 9979 reproductive-age women from 12 studies across eight countries (China, Iran, Italy, Nigeria, Russia, South Korea, Turkey, and the USA), representing 11 ethnicities. Ovulatory dysfunction was variably recorded, from mean menstrual cycle length, minimum or maximum cycle length, number of cycles per year, or urinary progesterone measurements. Clinical hyperandrogenism was assessed via modified Ferriman-Gallwey (mFG) scores, with a few also including acne and alopecia. Biochemical hyperandrogenism thresholds varied (95th, 97.5th, or 98th percentile of healthy controls). Polycystic ovary morphology was assessed via transvaginal, transabdominal, or transrectal approaches. Harmonization adhered to International PCOS Guidelines for ovulatory dysfunction, ethnicity-specific cut-offs for hirsutism (via k-means clustering), and 95th percentile thresholds for biochemical hyperandrogenism. PCOS prevalence ranged from 3.3 to 19.8% in the original studies and was 11.0% overall after harmonization.</p><p><strong>Conclusions: </strong>The P-PUP study offers an unprecedented, ethnically diverse, medically unbiased population-based cohort, an extraordinarily valuable tool to enhance knowledge and research in PCOS. However, variability in data collection methods and definitions of PCOS diagnostic features across studies limited the ability to fully integrate data for analysis. Despite these limitations, we optimized harmonization in this IPD, and the findings provided valuable insights into the challenges of data harmonization and established a foundation for future collaborative research. Future research should focus on standardizing data collection, establishing normative cut-offs based on true natural groupings, and linking diagnostic clusters to outcomes in diverse populations.</p><p><strong>Protocol registration: </strong>CRD42021267847.</p>","PeriodicalId":9188,"journal":{"name":"BMC Medicine","volume":"23 1","pages":"420"},"PeriodicalIF":7.0000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261540/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12916-025-04221-9","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Polycystic ovary syndrome (PCOS) is a multifaceted condition with diagnostic challenges and clinical heterogeneity across populations. Research priorities include enhanced accuracy in defining cut-offs for diagnostic features. Here, we aim to describe participant clinical features and data harmonization in the international PCOS Phenotype in Unselected Populations (P-PUP) study.
Methods: We searched EMBASE and Medline (Ovid) from 1990 to October 2, 2020, in population-based, medically unbiased study cohorts. Included studies had ≥ 300 participants, directly assessed PCOS-related features, and provided Individual Participant Data (IPD). Risk of bias was assessed using the AXIS tool. Data integrity was ensured via cross-referencing, identifying outliers/implausible data, and variable harmonization. Reporting follows PRISMA-IPD guidelines, summarizing findings with frequencies and proportions.
Results: The study included 9979 reproductive-age women from 12 studies across eight countries (China, Iran, Italy, Nigeria, Russia, South Korea, Turkey, and the USA), representing 11 ethnicities. Ovulatory dysfunction was variably recorded, from mean menstrual cycle length, minimum or maximum cycle length, number of cycles per year, or urinary progesterone measurements. Clinical hyperandrogenism was assessed via modified Ferriman-Gallwey (mFG) scores, with a few also including acne and alopecia. Biochemical hyperandrogenism thresholds varied (95th, 97.5th, or 98th percentile of healthy controls). Polycystic ovary morphology was assessed via transvaginal, transabdominal, or transrectal approaches. Harmonization adhered to International PCOS Guidelines for ovulatory dysfunction, ethnicity-specific cut-offs for hirsutism (via k-means clustering), and 95th percentile thresholds for biochemical hyperandrogenism. PCOS prevalence ranged from 3.3 to 19.8% in the original studies and was 11.0% overall after harmonization.
Conclusions: The P-PUP study offers an unprecedented, ethnically diverse, medically unbiased population-based cohort, an extraordinarily valuable tool to enhance knowledge and research in PCOS. However, variability in data collection methods and definitions of PCOS diagnostic features across studies limited the ability to fully integrate data for analysis. Despite these limitations, we optimized harmonization in this IPD, and the findings provided valuable insights into the challenges of data harmonization and established a foundation for future collaborative research. Future research should focus on standardizing data collection, establishing normative cut-offs based on true natural groupings, and linking diagnostic clusters to outcomes in diverse populations.
期刊介绍:
BMC Medicine is an open access, transparent peer-reviewed general medical journal. It is the flagship journal of the BMC series and publishes outstanding and influential research in various areas including clinical practice, translational medicine, medical and health advances, public health, global health, policy, and general topics of interest to the biomedical and sociomedical professional communities. In addition to research articles, the journal also publishes stimulating debates, reviews, unique forum articles, and concise tutorials. All articles published in BMC Medicine are included in various databases such as Biological Abstracts, BIOSIS, CAS, Citebase, Current contents, DOAJ, Embase, MEDLINE, PubMed, Science Citation Index Expanded, OAIster, SCImago, Scopus, SOCOLAR, and Zetoc.