Prognostic Evaluation of Cervical Cancer Using Various Classifiers

2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP) Pub Date : 2022-12-23 DOI:10.1109/CCIP57447.2022.10058662

T. Thotakura, Sumitra Kopparapu, Reeja S R

{"title":"Prognostic Evaluation of Cervical Cancer Using Various Classifiers","authors":"T. Thotakura, Sumitra Kopparapu, Reeja S R","doi":"10.1109/CCIP57447.2022.10058662","DOIUrl":null,"url":null,"abstract":"Cervical HPV (human papillomavirus) infection is almost invariably the prominent reason for cervical cancer. The most prevalent malignancy in female is cervical cancer and it substantially affects the risk factor for mortality. Cervical cancer is bound by a wide range of risk factors. Analyzing risk factors from each patient's medical history and screening outcomes, we created a prediction model in this study to forecast the course of cervical cancer patients. Early detection and diagnosis are the most effective methods of lowering the incidence of cervical cancer globally since it is a condition that is largely preventable. Early detection of the signs of this gynecologic disease can be challenging, particularly in nations without screening systems. Machine learning techniques can be utilized to identify malignant cancerous cells when cervical cancer was first discovered. We created a machine-learning technique that can handle massive volumes of data concurrently with greater accuracy. Based upon many parameters, such as age and lifestyle, can forecast a person's likelihood of developing cervical cancer. A Kaggle data repository Cervical Cancer dataset was retrieved, it had eight hundred and fifty-eight unique data sets from distinct cases involving thirty-six variants comprised of diverse risk and protective factors. And it had four types of attributes: Cytology, Hinselmann, Biopsy, and Schiller. Using these class attributes as a basis, this dataset was divided into four groups. To detect cancer and facilitate prompt identification, ML classification algorithms like Decision Trees (DT), Logistic Regression (LR), Random Forest (RT), Support vector machines (SVM), AdaBoost, and artificial neural networks have been implemented. After gathering the data, in order to investigate the prevalence of characteristics and interconnections between numerous unrelated parameters and the propensity to acquire cervical malignancy, we first carried out a descriptive statistical study. According to the study's inferences, appropriate network architecture, categorization, and ML algorithms are capable of properly and effectively detecting Cervical Cancer in its early stages utilizing diagnostic information. Eventually, while comparing the acquired outcomes, we scrutinized that SVM, AdaBoost, DT, and LR algorithms had exhibited hundred percentage of performance, while RF and ANN algorithms seemed to have ninety-nine percentage of performance. Since the goal of our research was an early cancer forecast. Than such, we suggest that future research should take into account expanding patient histories, for example, by integrating primary health information before hospital referral.","PeriodicalId":309964,"journal":{"name":"2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIP57447.2022.10058662","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Cervical HPV (human papillomavirus) infection is almost invariably the prominent reason for cervical cancer. The most prevalent malignancy in female is cervical cancer and it substantially affects the risk factor for mortality. Cervical cancer is bound by a wide range of risk factors. Analyzing risk factors from each patient's medical history and screening outcomes, we created a prediction model in this study to forecast the course of cervical cancer patients. Early detection and diagnosis are the most effective methods of lowering the incidence of cervical cancer globally since it is a condition that is largely preventable. Early detection of the signs of this gynecologic disease can be challenging, particularly in nations without screening systems. Machine learning techniques can be utilized to identify malignant cancerous cells when cervical cancer was first discovered. We created a machine-learning technique that can handle massive volumes of data concurrently with greater accuracy. Based upon many parameters, such as age and lifestyle, can forecast a person's likelihood of developing cervical cancer. A Kaggle data repository Cervical Cancer dataset was retrieved, it had eight hundred and fifty-eight unique data sets from distinct cases involving thirty-six variants comprised of diverse risk and protective factors. And it had four types of attributes: Cytology, Hinselmann, Biopsy, and Schiller. Using these class attributes as a basis, this dataset was divided into four groups. To detect cancer and facilitate prompt identification, ML classification algorithms like Decision Trees (DT), Logistic Regression (LR), Random Forest (RT), Support vector machines (SVM), AdaBoost, and artificial neural networks have been implemented. After gathering the data, in order to investigate the prevalence of characteristics and interconnections between numerous unrelated parameters and the propensity to acquire cervical malignancy, we first carried out a descriptive statistical study. According to the study's inferences, appropriate network architecture, categorization, and ML algorithms are capable of properly and effectively detecting Cervical Cancer in its early stages utilizing diagnostic information. Eventually, while comparing the acquired outcomes, we scrutinized that SVM, AdaBoost, DT, and LR algorithms had exhibited hundred percentage of performance, while RF and ANN algorithms seemed to have ninety-nine percentage of performance. Since the goal of our research was an early cancer forecast. Than such, we suggest that future research should take into account expanding patient histories, for example, by integrating primary health information before hospital referral.

查看原文本刊更多论文

不同分类对宫颈癌预后的评价

宫颈HPV(人乳头瘤病毒)感染几乎总是宫颈癌的主要原因。女性中最常见的恶性肿瘤是子宫颈癌，它对死亡的危险因素有很大影响。子宫颈癌与多种危险因素有关。本研究从每位患者的病史和筛查结果分析危险因素，建立预测模型，预测宫颈癌患者的病程。早期发现和诊断是全球降低宫颈癌发病率的最有效方法，因为它在很大程度上是可以预防的。早期发现这种妇科疾病的迹象可能具有挑战性，特别是在没有筛查系统的国家。当宫颈癌首次被发现时，机器学习技术可以用来识别恶性癌细胞。我们创造了一种机器学习技术，可以以更高的精度同时处理大量数据。根据许多参数，如年龄和生活方式，可以预测一个人患宫颈癌的可能性。检索了Kaggle数据存储库宫颈癌数据集，它有来自不同病例的858个独特数据集，涉及36个由不同风险和保护因素组成的变体。它有四种类型的属性:细胞学，Hinselmann，活检和席勒。以这些类属性为基础，将该数据集分为四组。为了检测癌症并促进及时识别，ML分类算法如决策树(DT)、逻辑回归(LR)、随机森林(RT)、支持向量机(SVM)、AdaBoost和人工神经网络已经实现。在收集数据后，为了调查许多不相关参数与获得宫颈恶性肿瘤倾向之间的特征和相互关系的患病率，我们首先进行了描述性统计研究。根据该研究的推论，适当的网络架构、分类和ML算法能够利用诊断信息在早期正确有效地检测出宫颈癌。最后，在比较获得的结果时，我们仔细检查了SVM、AdaBoost、DT和LR算法表现出百分之百的性能，而RF和ANN算法似乎有百分之九十九的性能。因为我们的研究目标是早期癌症预测。因此，我们建议未来的研究应考虑扩大患者的病史，例如，通过整合医院转诊前的初级健康信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP)

自引率

0.00%

发文量