{"title":"Comparison of classical, xgboost and neural network methods for parameter estimation in epidemic processes on random graphs","authors":"Ágnes Backhausz , Edit Bognár , Villő Csiszár , Damján Tárkányi , András Zempléni","doi":"10.1016/j.fraope.2025.100271","DOIUrl":null,"url":null,"abstract":"<div><div>The main goal of this paper is to quantitatively compare the performance of classical methods to XGBoost and convolutional neural networks in a parameter estimation problem for SIR epidemic spread. Since we model the underlying social network by flexible two-layer random graphs, we can also study how the structural difference between the graphs in the training set and the test set influences the error of the estimate. We also quantify the improvement of the results when additional information (such as the average degree of infected vertices) is available, compared to the case when only the time series of the number of susceptible and infected individuals is observed. Furthermore, the simulation results show how the accuracy of the methods varies with the time elapsed from the start of the epidemic.</div></div>","PeriodicalId":100554,"journal":{"name":"Franklin Open","volume":"11 ","pages":"Article 100271"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Franklin Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2773186325000611","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The main goal of this paper is to quantitatively compare the performance of classical methods to XGBoost and convolutional neural networks in a parameter estimation problem for SIR epidemic spread. Since we model the underlying social network by flexible two-layer random graphs, we can also study how the structural difference between the graphs in the training set and the test set influences the error of the estimate. We also quantify the improvement of the results when additional information (such as the average degree of infected vertices) is available, compared to the case when only the time series of the number of susceptible and infected individuals is observed. Furthermore, the simulation results show how the accuracy of the methods varies with the time elapsed from the start of the epidemic.