Asif Mahbub Uddin, A. A. Al Miraj, Moumita Sen Sarma, Avishek Das, Md. Manjurul Gani
{"title":"基于卷积神经网络的传统孟加拉食物分类","authors":"Asif Mahbub Uddin, A. A. Al Miraj, Moumita Sen Sarma, Avishek Das, Md. Manjurul Gani","doi":"10.1109/TENSYMP52854.2021.9550874","DOIUrl":null,"url":null,"abstract":"Image classification is turning into a significant and promising perspective in the fields of object recognition using computer vision. However, researchers have barely scratched the superficials of food image classification till now. To evaluate the dietary aptitudes of people from various ethnicities, the classification of their traditional foods makes a huge impact. That’s what steered us into the classification of seven traditional foods in Bangladesh. In this regard, our key contribution to this aspect is the development of a dataset of Traditional Bengali Food Image (TBFI) including images of seven different classes of traditional Bengali foods: Biriyani, Panta Ilish, Khichuri, Fuchka, Roshogolla, Dim Vuna & Kala Vuna. For this, a scratch model incorporating Convolutional Neural Network (CNN) has been developed, rectifying to another vital contribution. As conventional Neural Network doesn’t perform well in case of image datasets, the CNN approach has been followed in view of its high accuracy, computational power with efficiency and automatic recognition of important features without any human oversight. Moreover, transfer learning approach with fine tuned VGG16 has also been used for TBFI classification. The proposed model in this paper has generated a culminated outcome upon our TBFI dataset with an average accuracy of 98% in classifying the traditional Bengali food images.","PeriodicalId":137485,"journal":{"name":"2021 IEEE Region 10 Symposium (TENSYMP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Traditional Bengali Food Classification Using Convolutional Neural Network\",\"authors\":\"Asif Mahbub Uddin, A. A. Al Miraj, Moumita Sen Sarma, Avishek Das, Md. Manjurul Gani\",\"doi\":\"10.1109/TENSYMP52854.2021.9550874\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image classification is turning into a significant and promising perspective in the fields of object recognition using computer vision. However, researchers have barely scratched the superficials of food image classification till now. To evaluate the dietary aptitudes of people from various ethnicities, the classification of their traditional foods makes a huge impact. That’s what steered us into the classification of seven traditional foods in Bangladesh. In this regard, our key contribution to this aspect is the development of a dataset of Traditional Bengali Food Image (TBFI) including images of seven different classes of traditional Bengali foods: Biriyani, Panta Ilish, Khichuri, Fuchka, Roshogolla, Dim Vuna & Kala Vuna. For this, a scratch model incorporating Convolutional Neural Network (CNN) has been developed, rectifying to another vital contribution. As conventional Neural Network doesn’t perform well in case of image datasets, the CNN approach has been followed in view of its high accuracy, computational power with efficiency and automatic recognition of important features without any human oversight. Moreover, transfer learning approach with fine tuned VGG16 has also been used for TBFI classification. The proposed model in this paper has generated a culminated outcome upon our TBFI dataset with an average accuracy of 98% in classifying the traditional Bengali food images.\",\"PeriodicalId\":137485,\"journal\":{\"name\":\"2021 IEEE Region 10 Symposium (TENSYMP)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Region 10 Symposium (TENSYMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENSYMP52854.2021.9550874\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Region 10 Symposium (TENSYMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENSYMP52854.2021.9550874","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
图像分类正在成为计算机视觉对象识别领域一个重要而有前途的研究方向。然而,迄今为止,研究人员对食品图像分类的研究还只是停留在表面。为了评估不同种族的人的饮食习惯,他们传统食物的分类产生了巨大的影响。这就是我们对孟加拉国七种传统食物进行分类的原因。在这方面,我们在这方面的主要贡献是开发了传统孟加拉食品图像(TBFI)数据集,其中包括七种不同类别的传统孟加拉食品的图像:Biriyani, Panta Ilish, Khichuri, Fuchka, Roshogolla, Dim Vuna和Kala Vuna。为此,一个包含卷积神经网络(CNN)的scratch模型已经开发出来,这是另一个重要的贡献。由于传统的神经网络在图像数据集上表现不佳,因此采用了CNN方法,因为它具有精度高,计算能力强,效率高,并且可以在不需要人为监督的情况下自动识别重要特征。此外,采用微调VGG16的迁移学习方法也被用于TBFI分类。本文中提出的模型在我们的TBFI数据集上产生了一个最终结果,对传统孟加拉食物图像进行分类的平均准确率为98%。
Traditional Bengali Food Classification Using Convolutional Neural Network
Image classification is turning into a significant and promising perspective in the fields of object recognition using computer vision. However, researchers have barely scratched the superficials of food image classification till now. To evaluate the dietary aptitudes of people from various ethnicities, the classification of their traditional foods makes a huge impact. That’s what steered us into the classification of seven traditional foods in Bangladesh. In this regard, our key contribution to this aspect is the development of a dataset of Traditional Bengali Food Image (TBFI) including images of seven different classes of traditional Bengali foods: Biriyani, Panta Ilish, Khichuri, Fuchka, Roshogolla, Dim Vuna & Kala Vuna. For this, a scratch model incorporating Convolutional Neural Network (CNN) has been developed, rectifying to another vital contribution. As conventional Neural Network doesn’t perform well in case of image datasets, the CNN approach has been followed in view of its high accuracy, computational power with efficiency and automatic recognition of important features without any human oversight. Moreover, transfer learning approach with fine tuned VGG16 has also been used for TBFI classification. The proposed model in this paper has generated a culminated outcome upon our TBFI dataset with an average accuracy of 98% in classifying the traditional Bengali food images.