Oktavian Lantang, G. Terdik, A. Hajdu, Attila Tiba
{"title":"基于单个和集成的卷积神经网络在癌症图像分类中的比较","authors":"Oktavian Lantang, G. Terdik, A. Hajdu, Attila Tiba","doi":"10.33039/AMI.2021.03.013","DOIUrl":null,"url":null,"abstract":"In this work, we investigated the ability of several Convolutional Neural Network (CNN) models for predicting the spread of cancer using medical images. We used a dataset released by the Kaggle, namely PatchCamelyon. The dataset consists of 220,025 pathology images digitized by a tissue scanner. A clinical expert labeled each image as cancerous or non-cancerous. We used 70% of the images as a training set and 30% of them as a validation set. We design three models based on three commonly used modules: VGG, Inception, and Residual Network (ResNet), to develop an ensemble model and implement a voting system to determine the final decision. Then, we compared the performance of this ensemble model to the performance of each single model. Additionally, we used a weighted majority voting system, where the final prediction is equal to the weighted average of the prediction produced by each network. Our results show that the classification of the two ensemble models reaches 96%. Thus these results prove that the ensemble model outperforms single network architectures.","PeriodicalId":43454,"journal":{"name":"Annales Mathematicae et Informaticae","volume":"5 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison of single and ensemble-based convolutional neural networks for cancerous image classification\",\"authors\":\"Oktavian Lantang, G. Terdik, A. Hajdu, Attila Tiba\",\"doi\":\"10.33039/AMI.2021.03.013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we investigated the ability of several Convolutional Neural Network (CNN) models for predicting the spread of cancer using medical images. We used a dataset released by the Kaggle, namely PatchCamelyon. The dataset consists of 220,025 pathology images digitized by a tissue scanner. A clinical expert labeled each image as cancerous or non-cancerous. We used 70% of the images as a training set and 30% of them as a validation set. We design three models based on three commonly used modules: VGG, Inception, and Residual Network (ResNet), to develop an ensemble model and implement a voting system to determine the final decision. Then, we compared the performance of this ensemble model to the performance of each single model. Additionally, we used a weighted majority voting system, where the final prediction is equal to the weighted average of the prediction produced by each network. Our results show that the classification of the two ensemble models reaches 96%. Thus these results prove that the ensemble model outperforms single network architectures.\",\"PeriodicalId\":43454,\"journal\":{\"name\":\"Annales Mathematicae et Informaticae\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.3000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annales Mathematicae et Informaticae\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33039/AMI.2021.03.013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MATHEMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annales Mathematicae et Informaticae","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33039/AMI.2021.03.013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS","Score":null,"Total":0}
Comparison of single and ensemble-based convolutional neural networks for cancerous image classification
In this work, we investigated the ability of several Convolutional Neural Network (CNN) models for predicting the spread of cancer using medical images. We used a dataset released by the Kaggle, namely PatchCamelyon. The dataset consists of 220,025 pathology images digitized by a tissue scanner. A clinical expert labeled each image as cancerous or non-cancerous. We used 70% of the images as a training set and 30% of them as a validation set. We design three models based on three commonly used modules: VGG, Inception, and Residual Network (ResNet), to develop an ensemble model and implement a voting system to determine the final decision. Then, we compared the performance of this ensemble model to the performance of each single model. Additionally, we used a weighted majority voting system, where the final prediction is equal to the weighted average of the prediction produced by each network. Our results show that the classification of the two ensemble models reaches 96%. Thus these results prove that the ensemble model outperforms single network architectures.