{"title":"利用有监督的机器学习算法确定重力式卫生下水道系统检查的优先次序","authors":"Karthikeyan Loganathan, Mohammad Najafi, Sharareh Kermanshachi, Praveen Kumar Maduri, Apurva Pamidimukkala","doi":"10.1186/s43065-024-00101-3","DOIUrl":null,"url":null,"abstract":"Underground wastewater collection systems degrade with time, necessitating utility owners to engage in ongoing evaluations and enhancements of their asset management frameworks to preserve the performance of their assets. The inspection and condition assessment of sewer pipes are crucial for the effective operation and maintenance of sewer systems. The closed-circuit television (CCTV) is frequently employed to examine sewer pipes in the United States. This procedure is both costly and laborious because of the extensive number of pipes in a metropolis. Prioritisation of inspection for sanitary sewage pipe segments requiring repair or maintenance can be done in advance depending on their past performance. Hence, the aim of this study is to construct a predictive model for the state of sanitary sewer pipes, utilising data collected from a city located in the southcentral region of the United States. The main contribution is that this study used multiclass classification and predicted PACP scores of the pipes. Condition prediction models were developed using extensively utilised supervised machine learning algorithms including logistic regression (LR), k-nearest neighbors (k-NN), and random forest (RF). However, the bulk of the constructed models were assessed using a limited number of assessment measures, such as the receiver operator characteristic (ROC) curve and the area under the curve (AUC) value. This paper asserts that the assessment of the predictive capacity of these models cannot be determined only by relying on ROC and AUC values. Out of the three models evaluated in this study, the LR model had an AUC value of 0.76. However, this model had a higher number of misclassifications or inaccurate predictions compared to the other models. Consequently, these models were assessed using additional assessment measures, including precision, recall, and F-1 scores (which represent the harmonic mean of precision and recall). Curiously, the LR model achieved an F1-score of 0.28 on a scale ranging from 0 to 1. The RF model yielded an F1-score of 0.45 and an AUC value of 0.86. The existing model can be enhanced before it is employed by asset managers during the inspection phase to assess the state of their sanitary sewers and identify essential sewers that require immediate care.","PeriodicalId":73793,"journal":{"name":"Journal of infrastructure preservation and resilience","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inspection prioritization of gravity sanitary sewer systems using supervised machine learning algorithms\",\"authors\":\"Karthikeyan Loganathan, Mohammad Najafi, Sharareh Kermanshachi, Praveen Kumar Maduri, Apurva Pamidimukkala\",\"doi\":\"10.1186/s43065-024-00101-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Underground wastewater collection systems degrade with time, necessitating utility owners to engage in ongoing evaluations and enhancements of their asset management frameworks to preserve the performance of their assets. The inspection and condition assessment of sewer pipes are crucial for the effective operation and maintenance of sewer systems. The closed-circuit television (CCTV) is frequently employed to examine sewer pipes in the United States. This procedure is both costly and laborious because of the extensive number of pipes in a metropolis. Prioritisation of inspection for sanitary sewage pipe segments requiring repair or maintenance can be done in advance depending on their past performance. Hence, the aim of this study is to construct a predictive model for the state of sanitary sewer pipes, utilising data collected from a city located in the southcentral region of the United States. The main contribution is that this study used multiclass classification and predicted PACP scores of the pipes. Condition prediction models were developed using extensively utilised supervised machine learning algorithms including logistic regression (LR), k-nearest neighbors (k-NN), and random forest (RF). However, the bulk of the constructed models were assessed using a limited number of assessment measures, such as the receiver operator characteristic (ROC) curve and the area under the curve (AUC) value. This paper asserts that the assessment of the predictive capacity of these models cannot be determined only by relying on ROC and AUC values. Out of the three models evaluated in this study, the LR model had an AUC value of 0.76. However, this model had a higher number of misclassifications or inaccurate predictions compared to the other models. Consequently, these models were assessed using additional assessment measures, including precision, recall, and F-1 scores (which represent the harmonic mean of precision and recall). Curiously, the LR model achieved an F1-score of 0.28 on a scale ranging from 0 to 1. The RF model yielded an F1-score of 0.45 and an AUC value of 0.86. The existing model can be enhanced before it is employed by asset managers during the inspection phase to assess the state of their sanitary sewers and identify essential sewers that require immediate care.\",\"PeriodicalId\":73793,\"journal\":{\"name\":\"Journal of infrastructure preservation and resilience\",\"volume\":\"68 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of infrastructure preservation and resilience\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s43065-024-00101-3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of infrastructure preservation and resilience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s43065-024-00101-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Inspection prioritization of gravity sanitary sewer systems using supervised machine learning algorithms
Underground wastewater collection systems degrade with time, necessitating utility owners to engage in ongoing evaluations and enhancements of their asset management frameworks to preserve the performance of their assets. The inspection and condition assessment of sewer pipes are crucial for the effective operation and maintenance of sewer systems. The closed-circuit television (CCTV) is frequently employed to examine sewer pipes in the United States. This procedure is both costly and laborious because of the extensive number of pipes in a metropolis. Prioritisation of inspection for sanitary sewage pipe segments requiring repair or maintenance can be done in advance depending on their past performance. Hence, the aim of this study is to construct a predictive model for the state of sanitary sewer pipes, utilising data collected from a city located in the southcentral region of the United States. The main contribution is that this study used multiclass classification and predicted PACP scores of the pipes. Condition prediction models were developed using extensively utilised supervised machine learning algorithms including logistic regression (LR), k-nearest neighbors (k-NN), and random forest (RF). However, the bulk of the constructed models were assessed using a limited number of assessment measures, such as the receiver operator characteristic (ROC) curve and the area under the curve (AUC) value. This paper asserts that the assessment of the predictive capacity of these models cannot be determined only by relying on ROC and AUC values. Out of the three models evaluated in this study, the LR model had an AUC value of 0.76. However, this model had a higher number of misclassifications or inaccurate predictions compared to the other models. Consequently, these models were assessed using additional assessment measures, including precision, recall, and F-1 scores (which represent the harmonic mean of precision and recall). Curiously, the LR model achieved an F1-score of 0.28 on a scale ranging from 0 to 1. The RF model yielded an F1-score of 0.45 and an AUC value of 0.86. The existing model can be enhanced before it is employed by asset managers during the inspection phase to assess the state of their sanitary sewers and identify essential sewers that require immediate care.