探索Lleida地区的癌症发病率、危险因素和死亡率:用于癌症数据分析的交互式、开源R Shiny应用程序。

IF 3.3 Q2 ONCOLOGY

JMIR Cancer Pub Date : 2023-04-20 DOI:10.2196/44695

Didac Florensa, Jordi Mateo-Fornes, Sergi Lopez Sorribes, Anna Torres Tuca, Francesc Solsona, Pere Godoy

{"title":"探索Lleida地区的癌症发病率、危险因素和死亡率:用于癌症数据分析的交互式、开源R Shiny应用程序。","authors":"Didac Florensa, Jordi Mateo-Fornes, Sergi Lopez Sorribes, Anna Torres Tuca, Francesc Solsona, Pere Godoy","doi":"10.2196/44695","DOIUrl":null,"url":null,"abstract":"Background: The cancer incidence rate is essential to public health surveillance. The analysis of this information allows authorities to know the cancer situation in their regions, especially to determine cancer patterns, monitor cancer trends, and help prioritize the allocation of health resource.Objective: This study aimed to present the design and implementation of an R Shiny application to assist cancer registries conduct rapid descriptive and predictive analytics in a user-friendly, intuitive, portable, and scalable way. Moreover, we wanted to describe the design and implementation road map to inspire other population registries to exploit their data sets and develop similar tools and models.Methods: The first step was to consolidate the data into the population registry cancer database. These data were cross validated by ASEDAT software, checked later, and reviewed by experts. Next, we developed an online tool to visualize the data and generate reports to assist decision-making under the R Shiny framework. Currently, the application can generate descriptive analytics using population variables, such as age, sex, and cancer type; cancer incidence in region-level geographical heat maps; line plots to visualize temporal trends; and typical risk factor plots. The application also showed descriptive plots about cancer mortality in the Lleida region. This web platform was built as a microservices cloud platform. The web back end consists of an application programming interface and a database, which NodeJS and MongoDB have implemented. All these parts were encapsulated and deployed by Docker and Docker Compose.Results: The results provide a successful case study in which the tool was applied to the cancer registry of the Lleida region. The study illustrates how researchers and cancer registries can use the application to analyze cancer databases. Furthermore, the results highlight the analytics related to risk factors, second tumors, and cancer mortality. The application shows the incidence and evolution of each cancer during a specific period for gender, age groups, and cancer location, among other functionalities. The risk factors view permitted us to detect that approximately 60% of cancer patients were diagnosed with excess weight at diagnosis. Regarding mortality, the application showed that lung cancer registered the highest number of deaths for both genders. Breast cancer was the lethal cancer in women. Finally, a customization guide was included as a result of this implementation to deploy the architecture presented.Conclusions: This paper aimed to document a successful methodology for exploiting the data in population cancer registries and propose guidelines for other similar records to develop similar tools. We intend to inspire other entities to build an application that can help decision-making and make data more accessible and transparent for the community of users.","PeriodicalId":45538,"journal":{"name":"JMIR Cancer","volume":"9 ","pages":"e44695"},"PeriodicalIF":3.3000,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160936/pdf/","citationCount":"0","resultStr":"{\"title\":\"Exploring Cancer Incidence, Risk Factors, and Mortality in the Lleida Region: Interactive, Open-source R Shiny Application for Cancer Data Analysis.\",\"authors\":\"Didac Florensa, Jordi Mateo-Fornes, Sergi Lopez Sorribes, Anna Torres Tuca, Francesc Solsona, Pere Godoy\",\"doi\":\"10.2196/44695\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: The cancer incidence rate is essential to public health surveillance. The analysis of this information allows authorities to know the cancer situation in their regions, especially to determine cancer patterns, monitor cancer trends, and help prioritize the allocation of health resource.Objective: This study aimed to present the design and implementation of an R Shiny application to assist cancer registries conduct rapid descriptive and predictive analytics in a user-friendly, intuitive, portable, and scalable way. Moreover, we wanted to describe the design and implementation road map to inspire other population registries to exploit their data sets and develop similar tools and models.Methods: The first step was to consolidate the data into the population registry cancer database. These data were cross validated by ASEDAT software, checked later, and reviewed by experts. Next, we developed an online tool to visualize the data and generate reports to assist decision-making under the R Shiny framework. Currently, the application can generate descriptive analytics using population variables, such as age, sex, and cancer type; cancer incidence in region-level geographical heat maps; line plots to visualize temporal trends; and typical risk factor plots. The application also showed descriptive plots about cancer mortality in the Lleida region. This web platform was built as a microservices cloud platform. The web back end consists of an application programming interface and a database, which NodeJS and MongoDB have implemented. All these parts were encapsulated and deployed by Docker and Docker Compose.Results: The results provide a successful case study in which the tool was applied to the cancer registry of the Lleida region. The study illustrates how researchers and cancer registries can use the application to analyze cancer databases. Furthermore, the results highlight the analytics related to risk factors, second tumors, and cancer mortality. The application shows the incidence and evolution of each cancer during a specific period for gender, age groups, and cancer location, among other functionalities. The risk factors view permitted us to detect that approximately 60% of cancer patients were diagnosed with excess weight at diagnosis. Regarding mortality, the application showed that lung cancer registered the highest number of deaths for both genders. Breast cancer was the lethal cancer in women. Finally, a customization guide was included as a result of this implementation to deploy the architecture presented.Conclusions: This paper aimed to document a successful methodology for exploiting the data in population cancer registries and propose guidelines for other similar records to develop similar tools. We intend to inspire other entities to build an application that can help decision-making and make data more accessible and transparent for the community of users.\",\"PeriodicalId\":45538,\"journal\":{\"name\":\"JMIR Cancer\",\"volume\":\"9 \",\"pages\":\"e44695\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2023-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160936/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Cancer\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/44695\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cancer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/44695","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景:癌症发病率是公共卫生监测的重要指标。对这些信息的分析使当局能够了解其所在区域的癌症情况，特别是确定癌症模式，监测癌症趋势，并帮助确定卫生资源分配的优先次序。目的:本研究旨在展示R Shiny应用程序的设计和实现，以用户友好、直观、便携和可扩展的方式帮助癌症登记处进行快速描述和预测分析。此外，我们希望描述设计和实施路线图，以激励其他人口登记处利用他们的数据集并开发类似的工具和模型。方法:第一步是将数据整合到人口登记癌症数据库中。这些数据由ASEDAT软件进行交叉验证，随后进行检查，并由专家进行评审。接下来，我们开发了一个在线工具来可视化数据，并在R Shiny框架下生成报告，以协助决策。目前，该应用程序可以使用人口变量(如年龄、性别和癌症类型)生成描述性分析;区域一级地理热图中的癌症发病率;线图显示时间趋势;还有典型的危险因素图。该应用程序还显示了莱伊达地区癌症死亡率的描述性图。这个web平台是作为一个微服务云平台构建的。web后端由应用程序编程接口和数据库组成，由NodeJS和MongoDB实现。所有这些部分都是由Docker和Docker Compose封装和部署的。结果:结果提供了一个成功的案例研究，其中该工具被应用于莱伊达地区的癌症登记。这项研究说明了研究人员和癌症登记处如何使用该应用程序来分析癌症数据库。此外，结果突出了与危险因素、第二肿瘤和癌症死亡率相关的分析。该应用程序显示了性别、年龄组、癌症位置等特定时期内每种癌症的发病率和演变情况。从危险因素的角度来看，我们发现大约60%的癌症患者在诊断时被诊断为超重。关于死亡率，该申请表明，肺癌登记的死亡人数在男女中都是最高的。乳腺癌是女性的致命癌症。最后，作为该实现的结果，包含了一个自定义指南，用于部署所呈现的体系结构。结论:本文旨在记录一种成功的方法来利用人口癌症登记处的数据，并为其他类似记录开发类似工具提出指导方针。我们打算激励其他实体构建一个可以帮助决策的应用程序，并使数据对用户社区更容易访问和透明。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Exploring Cancer Incidence, Risk Factors, and Mortality in the Lleida Region: Interactive, Open-source R Shiny Application for Cancer Data Analysis.

查看原文本刊更多论文

Exploring Cancer Incidence, Risk Factors, and Mortality in the Lleida Region: Interactive, Open-source R Shiny Application for Cancer Data Analysis.

Background: The cancer incidence rate is essential to public health surveillance. The analysis of this information allows authorities to know the cancer situation in their regions, especially to determine cancer patterns, monitor cancer trends, and help prioritize the allocation of health resource.

Objective: This study aimed to present the design and implementation of an R Shiny application to assist cancer registries conduct rapid descriptive and predictive analytics in a user-friendly, intuitive, portable, and scalable way. Moreover, we wanted to describe the design and implementation road map to inspire other population registries to exploit their data sets and develop similar tools and models.

Methods: The first step was to consolidate the data into the population registry cancer database. These data were cross validated by ASEDAT software, checked later, and reviewed by experts. Next, we developed an online tool to visualize the data and generate reports to assist decision-making under the R Shiny framework. Currently, the application can generate descriptive analytics using population variables, such as age, sex, and cancer type; cancer incidence in region-level geographical heat maps; line plots to visualize temporal trends; and typical risk factor plots. The application also showed descriptive plots about cancer mortality in the Lleida region. This web platform was built as a microservices cloud platform. The web back end consists of an application programming interface and a database, which NodeJS and MongoDB have implemented. All these parts were encapsulated and deployed by Docker and Docker Compose.

Results: The results provide a successful case study in which the tool was applied to the cancer registry of the Lleida region. The study illustrates how researchers and cancer registries can use the application to analyze cancer databases. Furthermore, the results highlight the analytics related to risk factors, second tumors, and cancer mortality. The application shows the incidence and evolution of each cancer during a specific period for gender, age groups, and cancer location, among other functionalities. The risk factors view permitted us to detect that approximately 60% of cancer patients were diagnosed with excess weight at diagnosis. Regarding mortality, the application showed that lung cancer registered the highest number of deaths for both genders. Breast cancer was the lethal cancer in women. Finally, a customization guide was included as a result of this implementation to deploy the architecture presented.

Conclusions: This paper aimed to document a successful methodology for exploiting the data in population cancer registries and propose guidelines for other similar records to develop similar tools. We intend to inspire other entities to build an application that can help decision-making and make data more accessible and transparent for the community of users.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

JMIR Cancer ONCOLOGY-

CiteScore

4.10

自引率

0.00%

发文量

审稿时长

12 weeks