基于“steam”方法的巧克力产品类型聚类缺失值插值优化神经网络算法

Mason Chen, Chen Chen
{"title":"基于“steam”方法的巧克力产品类型聚类缺失值插值优化神经网络算法","authors":"Mason Chen, Chen Chen","doi":"10.29007/4jgz","DOIUrl":null,"url":null,"abstract":"A “STEAMS” (Science, Technology, Engineering, Artificial Intelligence, Math, Statistics) approach was conducted to handle the missing value imputation of clustering Chocolate Science patterns. Hierarchical clustering and dendrogram analysis were utilized to cluster the commercial chocolate products into different product groups which can indicate the nutrition compositions and product health. To further handle the missing value imputation, a neural network algorithm was utilized to predict the missed Cocoa percentage (Cocoa%), based on other available nutritional components. The Hyperbolic Tangent activation function was used to create the hidden layer with three nodes. Neural networks are very flexible models and tend to over-fit data. A Definitive Screening Design (DSD) was conducted to optimize the neural setting in order to minimize the over-fit concern. Both the Goodness Fit of Training set and Validation set can reach 99% R-Square. The Profiler Sensitivity analysis has shown that the Chocolate Type and Vitamin C are the most sensitive factors to predict the missed Cocoa%. The results also indicated that the “Fruit” Chocolate can be added as the 4 th Chocolate Type. The Neural Black-Box algorithm revealed the hidden Chocolate Science and Product. This paper demonstrates the power of using the Engineering Design of Experiment (DOE) and Neural Network algorithm through STEAMS for the particular application of modeling chocolate products.","PeriodicalId":264035,"journal":{"name":"International Conference on Computers and Their Applications","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Optimize Neural Network Algorithm of Missing Value Imputation for Clustering Chocolate Product Types Following \\\"STEAMS\\\" Methodology\",\"authors\":\"Mason Chen, Chen Chen\",\"doi\":\"10.29007/4jgz\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A “STEAMS” (Science, Technology, Engineering, Artificial Intelligence, Math, Statistics) approach was conducted to handle the missing value imputation of clustering Chocolate Science patterns. Hierarchical clustering and dendrogram analysis were utilized to cluster the commercial chocolate products into different product groups which can indicate the nutrition compositions and product health. To further handle the missing value imputation, a neural network algorithm was utilized to predict the missed Cocoa percentage (Cocoa%), based on other available nutritional components. The Hyperbolic Tangent activation function was used to create the hidden layer with three nodes. Neural networks are very flexible models and tend to over-fit data. A Definitive Screening Design (DSD) was conducted to optimize the neural setting in order to minimize the over-fit concern. Both the Goodness Fit of Training set and Validation set can reach 99% R-Square. The Profiler Sensitivity analysis has shown that the Chocolate Type and Vitamin C are the most sensitive factors to predict the missed Cocoa%. The results also indicated that the “Fruit” Chocolate can be added as the 4 th Chocolate Type. The Neural Black-Box algorithm revealed the hidden Chocolate Science and Product. This paper demonstrates the power of using the Engineering Design of Experiment (DOE) and Neural Network algorithm through STEAMS for the particular application of modeling chocolate products.\",\"PeriodicalId\":264035,\"journal\":{\"name\":\"International Conference on Computers and Their Applications\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Computers and Their Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29007/4jgz\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Computers and Their Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/4jgz","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

采用“steam”(科学,技术,工程,人工智能,数学,统计)方法处理聚类巧克力科学模式的缺失值插补。采用分层聚类和树形图分析方法,将商品巧克力产品聚类成不同的产品组,以反映产品的营养成分和健康状况。为了进一步处理缺失值的输入,基于其他可用的营养成分,利用神经网络算法预测缺失可可百分比(Cocoa%)。使用双曲正切激活函数创建包含三个节点的隐藏层。神经网络是非常灵活的模型,倾向于过度拟合数据。最终筛选设计(DSD)进行了优化神经设置,以尽量减少过度拟合的担忧。训练集和验证集的拟合优度均可达到99% r方。Profiler敏感性分析表明,巧克力类型和维生素C是预测可可遗漏率的最敏感因素。结果还表明,“水果”巧克力可以添加为第4种巧克力类型。神经黑箱算法揭示了隐藏的巧克力科学和产品。本文展示了通过steam使用工程实验设计(DOE)和神经网络算法对巧克力产品建模的特殊应用的强大功能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimize Neural Network Algorithm of Missing Value Imputation for Clustering Chocolate Product Types Following "STEAMS" Methodology
A “STEAMS” (Science, Technology, Engineering, Artificial Intelligence, Math, Statistics) approach was conducted to handle the missing value imputation of clustering Chocolate Science patterns. Hierarchical clustering and dendrogram analysis were utilized to cluster the commercial chocolate products into different product groups which can indicate the nutrition compositions and product health. To further handle the missing value imputation, a neural network algorithm was utilized to predict the missed Cocoa percentage (Cocoa%), based on other available nutritional components. The Hyperbolic Tangent activation function was used to create the hidden layer with three nodes. Neural networks are very flexible models and tend to over-fit data. A Definitive Screening Design (DSD) was conducted to optimize the neural setting in order to minimize the over-fit concern. Both the Goodness Fit of Training set and Validation set can reach 99% R-Square. The Profiler Sensitivity analysis has shown that the Chocolate Type and Vitamin C are the most sensitive factors to predict the missed Cocoa%. The results also indicated that the “Fruit” Chocolate can be added as the 4 th Chocolate Type. The Neural Black-Box algorithm revealed the hidden Chocolate Science and Product. This paper demonstrates the power of using the Engineering Design of Experiment (DOE) and Neural Network algorithm through STEAMS for the particular application of modeling chocolate products.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信