{"title":"基于深度神经网络的食品材料脆度分类。","authors":"Rafael Z. Lopes, Gustavo C. Dacanal","doi":"10.1111/jtxs.12792","DOIUrl":null,"url":null,"abstract":"<p>Crispness is a textural characteristic that influences consumer choices, requiring a comprehensive understanding for product customization. Previous studies employing neural networks focused on acquiring audio through mechanical crushing of crispy samples. This research investigates the representation of crispy sound in time intervals and frequency domains, identifying key parameters to distinguish different foods. Two machine learning architectures, multi-layer perceptron (MLP) and residual neural network (ResNet), were used to analyze mel frequency cepstral coefficients (MFCC) and discrete Fourier transform (DFT) data, respectively. The models achieved over 95% accuracy “in-sample” successfully classifying fried chicken, potato chips, and toast using randomly extracted audio from ASMR videos. The MLP (MFCC) model demonstrated superior robustness compared to ResNet and predicted external inputs, such as freshly toasted bread acquired by a microphone or ASMR audio of toast in milk. In contrast, the ResNet model proved to be more responsive to variations in DFT spectrum and unable to predict the similarity of external audio sources, making it useful for classifying pretrained “in-samples”. These findings are useful for classifying crispness among individual food sources. Additionally, the study explores the promising utilization of ASMR audio from Internet platforms to pretrain artificial neural network models, expanding the dataset for investigating the texture of crispy foods.</p>","PeriodicalId":17175,"journal":{"name":"Journal of texture studies","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of crispness of food materials by deep neural networks\",\"authors\":\"Rafael Z. Lopes, Gustavo C. Dacanal\",\"doi\":\"10.1111/jtxs.12792\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Crispness is a textural characteristic that influences consumer choices, requiring a comprehensive understanding for product customization. Previous studies employing neural networks focused on acquiring audio through mechanical crushing of crispy samples. This research investigates the representation of crispy sound in time intervals and frequency domains, identifying key parameters to distinguish different foods. Two machine learning architectures, multi-layer perceptron (MLP) and residual neural network (ResNet), were used to analyze mel frequency cepstral coefficients (MFCC) and discrete Fourier transform (DFT) data, respectively. The models achieved over 95% accuracy “in-sample” successfully classifying fried chicken, potato chips, and toast using randomly extracted audio from ASMR videos. The MLP (MFCC) model demonstrated superior robustness compared to ResNet and predicted external inputs, such as freshly toasted bread acquired by a microphone or ASMR audio of toast in milk. In contrast, the ResNet model proved to be more responsive to variations in DFT spectrum and unable to predict the similarity of external audio sources, making it useful for classifying pretrained “in-samples”. These findings are useful for classifying crispness among individual food sources. Additionally, the study explores the promising utilization of ASMR audio from Internet platforms to pretrain artificial neural network models, expanding the dataset for investigating the texture of crispy foods.</p>\",\"PeriodicalId\":17175,\"journal\":{\"name\":\"Journal of texture studies\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of texture studies\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jtxs.12792\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of texture studies","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jtxs.12792","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
Classification of crispness of food materials by deep neural networks
Crispness is a textural characteristic that influences consumer choices, requiring a comprehensive understanding for product customization. Previous studies employing neural networks focused on acquiring audio through mechanical crushing of crispy samples. This research investigates the representation of crispy sound in time intervals and frequency domains, identifying key parameters to distinguish different foods. Two machine learning architectures, multi-layer perceptron (MLP) and residual neural network (ResNet), were used to analyze mel frequency cepstral coefficients (MFCC) and discrete Fourier transform (DFT) data, respectively. The models achieved over 95% accuracy “in-sample” successfully classifying fried chicken, potato chips, and toast using randomly extracted audio from ASMR videos. The MLP (MFCC) model demonstrated superior robustness compared to ResNet and predicted external inputs, such as freshly toasted bread acquired by a microphone or ASMR audio of toast in milk. In contrast, the ResNet model proved to be more responsive to variations in DFT spectrum and unable to predict the similarity of external audio sources, making it useful for classifying pretrained “in-samples”. These findings are useful for classifying crispness among individual food sources. Additionally, the study explores the promising utilization of ASMR audio from Internet platforms to pretrain artificial neural network models, expanding the dataset for investigating the texture of crispy foods.
期刊介绍:
The Journal of Texture Studies is a fully peer-reviewed international journal specialized in the physics, physiology, and psychology of food oral processing, with an emphasis on the food texture and structure, sensory perception and mouth-feel, food oral behaviour, food liking and preference. The journal was first published in 1969 and has been the primary source for disseminating advances in knowledge on all of the sciences that relate to food texture. In recent years, Journal of Texture Studies has expanded its coverage to a much broader range of texture research and continues to publish high quality original and innovative experimental-based (including numerical analysis and simulation) research concerned with all aspects of eating and food preference.
Journal of Texture Studies welcomes research articles, research notes, reviews, discussion papers, and communications from contributors of all relevant disciplines. Some key coverage areas/topics include (but not limited to):
• Physical, mechanical, and micro-structural principles of food texture
• Oral physiology
• Psychology and brain responses of eating and food sensory
• Food texture design and modification for specific consumers
• In vitro and in vivo studies of eating and swallowing
• Novel technologies and methodologies for the assessment of sensory properties
• Simulation and numerical analysis of eating and swallowing