Predicting the Incidence and Trend of Breast Cancer Using Time Series Analysis for 2007-2016 in Qazvin

F. Hajiabadi, H. Bagheri, Nasrin Tonokaboni, M. Zamanian, Z. Hosseinkhani
{"title":"Predicting the Incidence and Trend of Breast Cancer Using Time Series Analysis for 2007-2016 in Qazvin","authors":"F. Hajiabadi, H. Bagheri, Nasrin Tonokaboni, M. Zamanian, Z. Hosseinkhani","doi":"10.30699/ijbd.14.1.11","DOIUrl":null,"url":null,"abstract":"Introduction: Breast cancer is the most common cancer and the second leading cause of death in women worldwide. The aim of this study was to analyze the trend and predict the incidence of breast cancer using time series analysis. Methods: In this study, data on breast cancer incidence in Qazvin province between 2007 and 2016 were analyzed using time series analysis with autoregressive integrated moving average (ARIMA) modeling to forecast the future pattern. The Box-Jenkins time series model and its diagnosis and evaluation methods were used to show the trend and forecasting the next year new cancers. To describe and fit the appropriate models, R statistical software version 3.6.3 was used. Results: Between 2007 and 2016, a total number of 1229 new patients had been registered (monthly mean [SD]: 10.24 [1.03]). Although the overall trend in the raw number of new breast cancer cases has been increasing over time, the change in observations over time has been increasing and decreasing. According to Bartlett test results, the variances of the data were not constant. Also, according to the results of Kolmogorov-Smirnov test, breast cancer series data were not normal. Among the studied models, ARIMA (1, 1, 1) was selected due to lower AIC criteria than other models, and this model was selected as the final model for predicting breast cancer for the next year. The confidence interval of the predicted values was relatively narrow, which indicates the appropriateness of the final model in the prediction. Conclusion: Time series analysis is an efficient tool to model the past and future data on the raw number of new cancer cases, and the goodness-of-fit indicators of the model showed that the Box-Jenkins model is a reliable model for fitting similar data.","PeriodicalId":405995,"journal":{"name":"Iranian Quarterly Journal of Breast Diseases","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iranian Quarterly Journal of Breast Diseases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30699/ijbd.14.1.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Introduction: Breast cancer is the most common cancer and the second leading cause of death in women worldwide. The aim of this study was to analyze the trend and predict the incidence of breast cancer using time series analysis. Methods: In this study, data on breast cancer incidence in Qazvin province between 2007 and 2016 were analyzed using time series analysis with autoregressive integrated moving average (ARIMA) modeling to forecast the future pattern. The Box-Jenkins time series model and its diagnosis and evaluation methods were used to show the trend and forecasting the next year new cancers. To describe and fit the appropriate models, R statistical software version 3.6.3 was used. Results: Between 2007 and 2016, a total number of 1229 new patients had been registered (monthly mean [SD]: 10.24 [1.03]). Although the overall trend in the raw number of new breast cancer cases has been increasing over time, the change in observations over time has been increasing and decreasing. According to Bartlett test results, the variances of the data were not constant. Also, according to the results of Kolmogorov-Smirnov test, breast cancer series data were not normal. Among the studied models, ARIMA (1, 1, 1) was selected due to lower AIC criteria than other models, and this model was selected as the final model for predicting breast cancer for the next year. The confidence interval of the predicted values was relatively narrow, which indicates the appropriateness of the final model in the prediction. Conclusion: Time series analysis is an efficient tool to model the past and future data on the raw number of new cancer cases, and the goodness-of-fit indicators of the model showed that the Box-Jenkins model is a reliable model for fitting similar data.
用时间序列分析预测2007-2016年Qazvin地区乳腺癌发病率及趋势
导读:乳腺癌是最常见的癌症,也是全世界妇女死亡的第二大原因。本研究的目的是利用时间序列分析分析乳腺癌的趋势并预测其发病率。方法:采用时间序列分析和自回归综合移动平均(ARIMA)模型对2007 - 2016年加兹温省乳腺癌发病率数据进行分析,预测未来趋势。使用Box-Jenkins时间序列模型及其诊断和评估方法来显示趋势并预测下一年的新发癌症。为了描述和拟合合适的模型,使用R统计软件3.6.3版本。结果:2007 - 2016年,共新登记1229例患者(月平均[SD]: 10.24[1.03])。尽管随着时间的推移,新发乳腺癌病例的总体趋势一直在增加,但观察结果的变化一直在增加和减少。根据Bartlett检验结果,数据的方差不是恒定的。此外,根据Kolmogorov-Smirnov试验结果,乳腺癌系列数据不正常。在研究的模型中,由于AIC标准低于其他模型,我们选择了ARIMA(1,1,1)模型,并将其作为预测下一年乳腺癌的最终模型。预测值的置信区间较窄,表明最终模型在预测中的适用性。结论:时间序列分析是对癌症新发病例原始数量的过去和未来数据进行建模的有效工具,模型的拟合优度指标表明Box-Jenkins模型是拟合类似数据的可靠模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信