David Bogumil, Xin Sheng, Peggy Wan, Lucy Xia, Loreall Pooler, Iona Cheng, Samantha Streicher, Brian Z Huang, Fei Chen, Daniel Stram, Sylvia Shen, Gillian King, Charleston W K Chiang, Chrissie Ongaco, Marcia Adams, Ivy McMullen, Peng Zhang, Hua Ling, Michelle Mawhinney, Kimberly F Doheny, Loïc Le Marchand, Lynne R Wilkens, Christopher A Haiman, David V Conti
{"title":"The Multiethnic Cohort: A Resource for the study of Genetic and non-Genetic Cancer Risk Across Populations.","authors":"David Bogumil, Xin Sheng, Peggy Wan, Lucy Xia, Loreall Pooler, Iona Cheng, Samantha Streicher, Brian Z Huang, Fei Chen, Daniel Stram, Sylvia Shen, Gillian King, Charleston W K Chiang, Chrissie Ongaco, Marcia Adams, Ivy McMullen, Peng Zhang, Hua Ling, Michelle Mawhinney, Kimberly F Doheny, Loïc Le Marchand, Lynne R Wilkens, Christopher A Haiman, David V Conti","doi":"10.1101/2025.06.09.25328993","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The Multiethnic Cohort Study (MEC) is a U.S. prospective cohort of over 215,000 participants, designed to investigate variation in risk factors and disease across diverse racial and ethnic groups. Over 74,000 participants contributed biospecimens for genetic studies. We describe this sub-cohort and demonstrate the types of analyses it enables.</p><p><strong>Methods: </strong>The MEC recruited adults aged 45-75 in California and Hawaii between 1993 and 1996. Cancer diagnoses were identified via state tumor registries. The MEC Genetics Database includes 73,139 participants with germline genotype data. We evaluated genetic similarity, its relationship with self-reported race/ethnicity, and baseline characteristics, including neighborhood socioeconomic status. Using breast, colorectal, and prostate cancer as examples, the database supports multi-ancestry genome-wide association studies (GWAS), evaluation of non-genetic factors, and time-to-event analyses.</p><p><strong>Results: </strong>Participants included 10,962 African Americans, 24,234 Japanese Americans, 17,242 Latinos, 5,488 Native Hawaiians, 14,649 Whites, and 564 other. Principal component analysis revealed substantial diversity in ancestry. Multiethnic GWAS demonstrated effective control of population stratification while replicating many previously discovered variants. Polygenic risk score (PRS) effects varied by racial and ethnic group. Time-to-event analysis showed associations between cancer incidence and neighborhood socioeconomic status, population descriptors, and genetic similarity.</p><p><strong>Discussion: </strong>The MEC Genetics Database enables comprehensive assessment of genetic and non-genetic cancer risk, revealing differences in absolute risk by race and ethnicity. Studying both types of risk factors in diverse and admixed populations is critical for improving risk characterization and reducing disparities. This resource supports future research in polygenic traits, gene-environment interactions, and integrated risk prediction.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204407/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.06.09.25328993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: The Multiethnic Cohort Study (MEC) is a U.S. prospective cohort of over 215,000 participants, designed to investigate variation in risk factors and disease across diverse racial and ethnic groups. Over 74,000 participants contributed biospecimens for genetic studies. We describe this sub-cohort and demonstrate the types of analyses it enables.
Methods: The MEC recruited adults aged 45-75 in California and Hawaii between 1993 and 1996. Cancer diagnoses were identified via state tumor registries. The MEC Genetics Database includes 73,139 participants with germline genotype data. We evaluated genetic similarity, its relationship with self-reported race/ethnicity, and baseline characteristics, including neighborhood socioeconomic status. Using breast, colorectal, and prostate cancer as examples, the database supports multi-ancestry genome-wide association studies (GWAS), evaluation of non-genetic factors, and time-to-event analyses.
Results: Participants included 10,962 African Americans, 24,234 Japanese Americans, 17,242 Latinos, 5,488 Native Hawaiians, 14,649 Whites, and 564 other. Principal component analysis revealed substantial diversity in ancestry. Multiethnic GWAS demonstrated effective control of population stratification while replicating many previously discovered variants. Polygenic risk score (PRS) effects varied by racial and ethnic group. Time-to-event analysis showed associations between cancer incidence and neighborhood socioeconomic status, population descriptors, and genetic similarity.
Discussion: The MEC Genetics Database enables comprehensive assessment of genetic and non-genetic cancer risk, revealing differences in absolute risk by race and ethnicity. Studying both types of risk factors in diverse and admixed populations is critical for improving risk characterization and reducing disparities. This resource supports future research in polygenic traits, gene-environment interactions, and integrated risk prediction.