Sangjun Lee, Kwang-Pil Ko, Jung Eun Lee, Inah Kim, Sun Ha Jee, Aesun Shin, Sun-Seog Kweon, Min-Ho Shin, Sangmin Park, Seungho Ryu, Sun Young Yang, Seung Ho Choi, Jeongseon Kim, Sang-Wook Yi, Daehee Kang, Keun-Young Yoo, Sue K Park
{"title":"The Korea Cohort Consortium: The Future of Pooling Cohort Studies.","authors":"Sangjun Lee, Kwang-Pil Ko, Jung Eun Lee, Inah Kim, Sun Ha Jee, Aesun Shin, Sun-Seog Kweon, Min-Ho Shin, Sangmin Park, Seungho Ryu, Sun Young Yang, Seung Ho Choi, Jeongseon Kim, Sang-Wook Yi, Daehee Kang, Keun-Young Yoo, Sue K Park","doi":"10.3961/jpmph.22.299","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>We introduced the cohort studies included in the Korea Cohort Consortium (KCC), focusing on large-scale cohort studies established in Korea with a prolonged follow-up period. Moreover, we also provided projections of the follow-up and estimates of the sample size that would be necessary for big-data analyses based on pooling established cohort studies, including population-based genomic studies.</p><p><strong>Methods: </strong>We mainly focused on the characteristics of individual cohort studies from the KCC. We developed \"PROFAN\", a Shiny application for projecting the follow-up period to achieve a certain number of cases when pooling established cohort studies. As examples, we projected the follow-up periods for 5000 cases of gastric cancer, 2500 cases of prostate and breast cancer, and 500 cases of non-Hodgkin lymphoma. The sample sizes for sequencing-based analyses based on a 1:1 case-control study were also calculated.</p><p><strong>Results: </strong>The KCC consisted of 8 individual cohort studies, of which 3 were community-based and 5 were health screening-based cohorts. The population-based cohort studies were mainly organized by Korean government agencies and research institutes. The projected follow-up period was at least 10 years to achieve 5000 cases based on a cohort of 0.5 million participants. The mean of the minimum to maximum sample sizes for performing sequencing analyses was 5917-72 102.</p><p><strong>Conclusions: </strong>We propose an approach to establish a large-scale consortium based on the standardization and harmonization of existing cohort studies to obtain adequate statistical power with a sufficient sample size to analyze high-risk groups or rare cancer subtypes.</p>","PeriodicalId":520687,"journal":{"name":"Journal of preventive medicine and public health = Yebang Uihakhoe chi","volume":" ","pages":"464-474"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/2c/99/jpmph-22-299.PMC9561144.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of preventive medicine and public health = Yebang Uihakhoe chi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3961/jpmph.22.299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/9/12 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Objectives: We introduced the cohort studies included in the Korea Cohort Consortium (KCC), focusing on large-scale cohort studies established in Korea with a prolonged follow-up period. Moreover, we also provided projections of the follow-up and estimates of the sample size that would be necessary for big-data analyses based on pooling established cohort studies, including population-based genomic studies.
Methods: We mainly focused on the characteristics of individual cohort studies from the KCC. We developed "PROFAN", a Shiny application for projecting the follow-up period to achieve a certain number of cases when pooling established cohort studies. As examples, we projected the follow-up periods for 5000 cases of gastric cancer, 2500 cases of prostate and breast cancer, and 500 cases of non-Hodgkin lymphoma. The sample sizes for sequencing-based analyses based on a 1:1 case-control study were also calculated.
Results: The KCC consisted of 8 individual cohort studies, of which 3 were community-based and 5 were health screening-based cohorts. The population-based cohort studies were mainly organized by Korean government agencies and research institutes. The projected follow-up period was at least 10 years to achieve 5000 cases based on a cohort of 0.5 million participants. The mean of the minimum to maximum sample sizes for performing sequencing analyses was 5917-72 102.
Conclusions: We propose an approach to establish a large-scale consortium based on the standardization and harmonization of existing cohort studies to obtain adequate statistical power with a sufficient sample size to analyze high-risk groups or rare cancer subtypes.