{"title":"An Instance Segmentation approach to Food Calorie Estimation using Mask R-CNN","authors":"Parth Poply, A. J.","doi":"10.1145/3432291.3432295","DOIUrl":null,"url":null,"abstract":"The aim of this paper is to build a Deep Learning and Computer vision-based model for estimating the calorie contents of any food item (to an extent) using its picture. Deep Learning-based Convolutional Neural Network (CNN) called Mask R-CNN is used to perform the task of instance segmentation. The Mask R-CNN recognizes distinct instances of distinct food objects and outputs a mask for the food objects. The surface area of the detected food item(s) is then computed using the mask. The surface area along with the calorie per square inch value of the food item is used to estimate the calories present in the food. The developed model achieves a mean average precision (mAP) of about 93.7% on food item detection and an accuracy of about 95.5% on calorie estimation.","PeriodicalId":126684,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3432291.3432295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The aim of this paper is to build a Deep Learning and Computer vision-based model for estimating the calorie contents of any food item (to an extent) using its picture. Deep Learning-based Convolutional Neural Network (CNN) called Mask R-CNN is used to perform the task of instance segmentation. The Mask R-CNN recognizes distinct instances of distinct food objects and outputs a mask for the food objects. The surface area of the detected food item(s) is then computed using the mask. The surface area along with the calorie per square inch value of the food item is used to estimate the calories present in the food. The developed model achieves a mean average precision (mAP) of about 93.7% on food item detection and an accuracy of about 95.5% on calorie estimation.