{"title":"基于梯度直方图和k近邻算法的宫颈癌前细胞图像分类系统","authors":"Y. Jusman, Maryza Intan Rahmawati, S. N. Sulaiman","doi":"10.1109/ICCSCE54767.2022.9935660","DOIUrl":null,"url":null,"abstract":"Cervical cancer is a dangerous disease, with more than 99% of which contain Human Papillomavirus (HPV), threatening women worldwide. The Global Burden of Cancer Study (Globocan) has recorded 36,633 cervical cancer cases, ranking second in Indonesia. Analysis of Pap smear results manually as an early detection effort possesses many weaknesses. Therefore, as an early detection step in diagnosing cervical pre-cancerous cell images, an artificial intelligence system is highly required to assist medical personnel in providing fast and accurate diagnostic evaluations. This study utilized 972 training image data and 108 testing image data, with a cervical pre-cancerous cells image classification system using Histogram of Oriented Gradients (HOG) algorithms for feature extraction and KNN machine learning for the classification system. The gray level in the contrasting images between the texture of the nucleus, cytoplasm, and background had different pixel and bit depth intensity values. Hence, HOG obtained bin orientation for each pixel in the cell. The cosine KNN model demonstrated the best matrix performance, acquiring classification results of 0.8 for accuracy, 0.8 for precision, 0.889 for recall, 0.846 for specificity, and 0.771 for f-score. Moreover, the training data generated an accuracy of 69.3% and the fastest training time of 4.2359s.","PeriodicalId":346014,"journal":{"name":"2022 IEEE 12th International Conference on Control System, Computing and Engineering (ICCSCE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cervical Pre-cancerous Cell Image Classification System Using Histogram of Oriented Gradients and K-Nearest Neighbor Algorithms\",\"authors\":\"Y. Jusman, Maryza Intan Rahmawati, S. N. Sulaiman\",\"doi\":\"10.1109/ICCSCE54767.2022.9935660\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cervical cancer is a dangerous disease, with more than 99% of which contain Human Papillomavirus (HPV), threatening women worldwide. The Global Burden of Cancer Study (Globocan) has recorded 36,633 cervical cancer cases, ranking second in Indonesia. Analysis of Pap smear results manually as an early detection effort possesses many weaknesses. Therefore, as an early detection step in diagnosing cervical pre-cancerous cell images, an artificial intelligence system is highly required to assist medical personnel in providing fast and accurate diagnostic evaluations. This study utilized 972 training image data and 108 testing image data, with a cervical pre-cancerous cells image classification system using Histogram of Oriented Gradients (HOG) algorithms for feature extraction and KNN machine learning for the classification system. The gray level in the contrasting images between the texture of the nucleus, cytoplasm, and background had different pixel and bit depth intensity values. Hence, HOG obtained bin orientation for each pixel in the cell. The cosine KNN model demonstrated the best matrix performance, acquiring classification results of 0.8 for accuracy, 0.8 for precision, 0.889 for recall, 0.846 for specificity, and 0.771 for f-score. Moreover, the training data generated an accuracy of 69.3% and the fastest training time of 4.2359s.\",\"PeriodicalId\":346014,\"journal\":{\"name\":\"2022 IEEE 12th International Conference on Control System, Computing and Engineering (ICCSCE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 12th International Conference on Control System, Computing and Engineering (ICCSCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSCE54767.2022.9935660\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 12th International Conference on Control System, Computing and Engineering (ICCSCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSCE54767.2022.9935660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
宫颈癌是一种危险的疾病,其中99%以上含有人类乳头瘤病毒(HPV),威胁着全世界的妇女。全球癌症负担研究(Globocan)记录了36,633例宫颈癌病例,在印度尼西亚排名第二。人工分析子宫颈抹片检查结果作为早期检测工作具有许多弱点。因此,作为诊断宫颈癌前细胞图像的早期检测步骤,高度需要人工智能系统协助医护人员提供快速准确的诊断评估。本研究利用972张训练图像数据和108张测试图像数据,采用直方图定向梯度(Histogram of Oriented Gradients, HOG)算法进行特征提取,采用KNN机器学习进行分类,建立了宫颈癌前细胞图像分类系统。对比图像中细胞核、细胞质和背景纹理的灰度值具有不同的像素和比特深度强度值。因此,HOG获得了单元格中每个像素的bin方向。余弦KNN模型表现出最佳的矩阵性能,分类结果准确率为0.8,精密度为0.8,召回率为0.889,特异性为0.846,f分数为0.771。训练数据的准确率为69.3%,最快训练时间为4.2359s。
Cervical Pre-cancerous Cell Image Classification System Using Histogram of Oriented Gradients and K-Nearest Neighbor Algorithms
Cervical cancer is a dangerous disease, with more than 99% of which contain Human Papillomavirus (HPV), threatening women worldwide. The Global Burden of Cancer Study (Globocan) has recorded 36,633 cervical cancer cases, ranking second in Indonesia. Analysis of Pap smear results manually as an early detection effort possesses many weaknesses. Therefore, as an early detection step in diagnosing cervical pre-cancerous cell images, an artificial intelligence system is highly required to assist medical personnel in providing fast and accurate diagnostic evaluations. This study utilized 972 training image data and 108 testing image data, with a cervical pre-cancerous cells image classification system using Histogram of Oriented Gradients (HOG) algorithms for feature extraction and KNN machine learning for the classification system. The gray level in the contrasting images between the texture of the nucleus, cytoplasm, and background had different pixel and bit depth intensity values. Hence, HOG obtained bin orientation for each pixel in the cell. The cosine KNN model demonstrated the best matrix performance, acquiring classification results of 0.8 for accuracy, 0.8 for precision, 0.889 for recall, 0.846 for specificity, and 0.771 for f-score. Moreover, the training data generated an accuracy of 69.3% and the fastest training time of 4.2359s.