在输入敏感性约束下的基于知识的学习架构训练,以提高可解释性

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Hasan Sildir , Emrullah Erturk , Deniz Tuna Edizer , Ozgun Deliismail , Yusuf Muhammed Durna , Bahtiyar Hamit
{"title":"在输入敏感性约束下的基于知识的学习架构训练,以提高可解释性","authors":"Hasan Sildir ,&nbsp;Emrullah Erturk ,&nbsp;Deniz Tuna Edizer ,&nbsp;Ozgun Deliismail ,&nbsp;Yusuf Muhammed Durna ,&nbsp;Bahtiyar Hamit","doi":"10.1016/j.compchemeng.2025.109382","DOIUrl":null,"url":null,"abstract":"<div><div>The traditional machine learning (ML) training problem is unconstrained and lacks an explicit formulation of the underlying driving phenomena. Such a formulation, based solely on experimental data, does not ensure the delivery of qualitative knowledge among variables due to many theoretical issues in the optimization task. This study further tightens Artificial Neural Networks (ANNs) training by including input sensitivities as additional constraints and applies to regression and classification tasks based on literature data. In theory, such sensitivity represents the change direction of the target variable per change in measurements from indicators. The resulting nonlinear optimization problem is solved th rough a rigorous solver and includes the sensitivity expressions through algorithmic differentiation. Compared to traditional methods, with an acceptable decrease in the prediction capability, the proposed model delivers more intuitive, explainable, and experimentally verifiable predictions under input variable variations, under robustness to overfitting, while serving robust identification tasks. A classification case study includes a patient-oriented clinical decision support system development based on the impact of cancer-indicating variables. A competitive test prediction accuracy is obtained compared to commonly used algorithms despite 10 % decrease in the training. The regression case is built upon the energy load estimation to account for prominent considerations to obtain desired sensitivity patterns and proposed methodology delivers significant accuracy drop compared to some formulations to address knowledge patterns. The approach delivers a compatible pattern with practitioner expertise and is compared to widely used machine learning algorithms, whose performances are evaluated through common statistics in addition to multi-variable response graphs.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"204 ","pages":"Article 109382"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge-based training of learning architectures under input sensitivity constraints for improved explainability\",\"authors\":\"Hasan Sildir ,&nbsp;Emrullah Erturk ,&nbsp;Deniz Tuna Edizer ,&nbsp;Ozgun Deliismail ,&nbsp;Yusuf Muhammed Durna ,&nbsp;Bahtiyar Hamit\",\"doi\":\"10.1016/j.compchemeng.2025.109382\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The traditional machine learning (ML) training problem is unconstrained and lacks an explicit formulation of the underlying driving phenomena. Such a formulation, based solely on experimental data, does not ensure the delivery of qualitative knowledge among variables due to many theoretical issues in the optimization task. This study further tightens Artificial Neural Networks (ANNs) training by including input sensitivities as additional constraints and applies to regression and classification tasks based on literature data. In theory, such sensitivity represents the change direction of the target variable per change in measurements from indicators. The resulting nonlinear optimization problem is solved th rough a rigorous solver and includes the sensitivity expressions through algorithmic differentiation. Compared to traditional methods, with an acceptable decrease in the prediction capability, the proposed model delivers more intuitive, explainable, and experimentally verifiable predictions under input variable variations, under robustness to overfitting, while serving robust identification tasks. A classification case study includes a patient-oriented clinical decision support system development based on the impact of cancer-indicating variables. A competitive test prediction accuracy is obtained compared to commonly used algorithms despite 10 % decrease in the training. The regression case is built upon the energy load estimation to account for prominent considerations to obtain desired sensitivity patterns and proposed methodology delivers significant accuracy drop compared to some formulations to address knowledge patterns. The approach delivers a compatible pattern with practitioner expertise and is compared to widely used machine learning algorithms, whose performances are evaluated through common statistics in addition to multi-variable response graphs.</div></div>\",\"PeriodicalId\":286,\"journal\":{\"name\":\"Computers & Chemical Engineering\",\"volume\":\"204 \",\"pages\":\"Article 109382\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Chemical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0098135425003850\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425003850","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

传统的机器学习(ML)训练问题是不受约束的,缺乏对潜在驱动现象的明确表述。由于优化任务中存在许多理论问题,这种仅基于实验数据的公式不能保证变量间的定性知识传递。本研究通过将输入灵敏度作为附加约束进一步收紧人工神经网络(ann)的训练,并将其应用于基于文献数据的回归和分类任务。理论上,这种灵敏度代表了指标测量值每变化时目标变量的变化方向。所得到的非线性优化问题通过严格求解器求解,并通过算法微分包含灵敏度表达式。与传统方法相比,在可接受的预测能力下降的情况下,该模型在输入变量变化、对过拟合的鲁棒性下提供了更直观、可解释和实验可验证的预测,同时服务于鲁棒识别任务。分类案例研究包括基于癌症指示变量影响的以患者为导向的临床决策支持系统开发。在训练量降低10%的情况下,与常用算法相比,获得了具有竞争力的测试预测精度。回归案例建立在能量负荷估计的基础上,考虑了获得所需灵敏度模式的重要考虑因素,与解决知识模式的一些公式相比,所提出的方法提供了显着的准确性下降。该方法提供了与从业者专业知识兼容的模式,并与广泛使用的机器学习算法进行了比较,机器学习算法的性能通过常见统计数据和多变量响应图进行评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Knowledge-based training of learning architectures under input sensitivity constraints for improved explainability

Knowledge-based training of learning architectures under input sensitivity constraints for improved explainability
The traditional machine learning (ML) training problem is unconstrained and lacks an explicit formulation of the underlying driving phenomena. Such a formulation, based solely on experimental data, does not ensure the delivery of qualitative knowledge among variables due to many theoretical issues in the optimization task. This study further tightens Artificial Neural Networks (ANNs) training by including input sensitivities as additional constraints and applies to regression and classification tasks based on literature data. In theory, such sensitivity represents the change direction of the target variable per change in measurements from indicators. The resulting nonlinear optimization problem is solved th rough a rigorous solver and includes the sensitivity expressions through algorithmic differentiation. Compared to traditional methods, with an acceptable decrease in the prediction capability, the proposed model delivers more intuitive, explainable, and experimentally verifiable predictions under input variable variations, under robustness to overfitting, while serving robust identification tasks. A classification case study includes a patient-oriented clinical decision support system development based on the impact of cancer-indicating variables. A competitive test prediction accuracy is obtained compared to commonly used algorithms despite 10 % decrease in the training. The regression case is built upon the energy load estimation to account for prominent considerations to obtain desired sensitivity patterns and proposed methodology delivers significant accuracy drop compared to some formulations to address knowledge patterns. The approach delivers a compatible pattern with practitioner expertise and is compared to widely used machine learning algorithms, whose performances are evaluated through common statistics in addition to multi-variable response graphs.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信