Eric Giunta, Dawson Stutzman, Sarah S Cohen, Benjamin French, Linda Walsh, Lawrence T Dauer, John D Boice, Steve R Blattnig, Dan Andresen, Amir A Bahadori
{"title":"Colossus: software for radiation epidemiological studies with big data.","authors":"Eric Giunta, Dawson Stutzman, Sarah S Cohen, Benjamin French, Linda Walsh, Lawrence T Dauer, John D Boice, Steve R Blattnig, Dan Andresen, Amir A Bahadori","doi":"10.1088/1361-6498/adcd80","DOIUrl":null,"url":null,"abstract":"<p><p>Colossus is designed to meet a growing need for survival analysis software capable of analyzing tens of millions of rows of radiation epidemiological data. Colossus is an R package devised to offer scalable survival analysis for the Million Person Study. The total and relative rate equations available in Colossus are outlined in this article, which are used in conjunction with Cox proportional hazards, Poisson, and Fine-Grey regression models. Following a comparison with existing software, validation with epidemiological cohort data is described. Exposure data and specific causes of death among workers at Los Alamos National Laboratory and U.S. nuclear power plants were analyzed by Colossus and 32-bit Epicure and compared with published results. Colossus results agreed with the results of existing software and previous publications.</p>","PeriodicalId":50068,"journal":{"name":"Journal of Radiological Protection","volume":"45 2","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiological Protection","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1088/1361-6498/adcd80","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Colossus is designed to meet a growing need for survival analysis software capable of analyzing tens of millions of rows of radiation epidemiological data. Colossus is an R package devised to offer scalable survival analysis for the Million Person Study. The total and relative rate equations available in Colossus are outlined in this article, which are used in conjunction with Cox proportional hazards, Poisson, and Fine-Grey regression models. Following a comparison with existing software, validation with epidemiological cohort data is described. Exposure data and specific causes of death among workers at Los Alamos National Laboratory and U.S. nuclear power plants were analyzed by Colossus and 32-bit Epicure and compared with published results. Colossus results agreed with the results of existing software and previous publications.
期刊介绍:
Journal of Radiological Protection publishes articles on all aspects of radiological protection, including non-ionising as well as ionising radiations. Fields of interest range from research, development and theory to operational matters, education and training. The very wide spectrum of its topics includes: dosimetry, instrument development, specialized measuring techniques, epidemiology, biological effects (in vivo and in vitro) and risk and environmental impact assessments.
The journal encourages publication of data and code as well as results.