Hasan Sildir , Emrullah Erturk , Deniz Tuna Edizer , Ozgun Deliismail , Yusuf Muhammed Durna , Bahtiyar Hamit
{"title":"Knowledge-based training of learning architectures under input sensitivity constraints for improved explainability","authors":"Hasan Sildir , Emrullah Erturk , Deniz Tuna Edizer , Ozgun Deliismail , Yusuf Muhammed Durna , Bahtiyar Hamit","doi":"10.1016/j.compchemeng.2025.109382","DOIUrl":null,"url":null,"abstract":"<div><div>The traditional machine learning (ML) training problem is unconstrained and lacks an explicit formulation of the underlying driving phenomena. Such a formulation, based solely on experimental data, does not ensure the delivery of qualitative knowledge among variables due to many theoretical issues in the optimization task. This study further tightens Artificial Neural Networks (ANNs) training by including input sensitivities as additional constraints and applies to regression and classification tasks based on literature data. In theory, such sensitivity represents the change direction of the target variable per change in measurements from indicators. The resulting nonlinear optimization problem is solved th rough a rigorous solver and includes the sensitivity expressions through algorithmic differentiation. Compared to traditional methods, with an acceptable decrease in the prediction capability, the proposed model delivers more intuitive, explainable, and experimentally verifiable predictions under input variable variations, under robustness to overfitting, while serving robust identification tasks. A classification case study includes a patient-oriented clinical decision support system development based on the impact of cancer-indicating variables. A competitive test prediction accuracy is obtained compared to commonly used algorithms despite 10 % decrease in the training. The regression case is built upon the energy load estimation to account for prominent considerations to obtain desired sensitivity patterns and proposed methodology delivers significant accuracy drop compared to some formulations to address knowledge patterns. The approach delivers a compatible pattern with practitioner expertise and is compared to widely used machine learning algorithms, whose performances are evaluated through common statistics in addition to multi-variable response graphs.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"204 ","pages":"Article 109382"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425003850","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The traditional machine learning (ML) training problem is unconstrained and lacks an explicit formulation of the underlying driving phenomena. Such a formulation, based solely on experimental data, does not ensure the delivery of qualitative knowledge among variables due to many theoretical issues in the optimization task. This study further tightens Artificial Neural Networks (ANNs) training by including input sensitivities as additional constraints and applies to regression and classification tasks based on literature data. In theory, such sensitivity represents the change direction of the target variable per change in measurements from indicators. The resulting nonlinear optimization problem is solved th rough a rigorous solver and includes the sensitivity expressions through algorithmic differentiation. Compared to traditional methods, with an acceptable decrease in the prediction capability, the proposed model delivers more intuitive, explainable, and experimentally verifiable predictions under input variable variations, under robustness to overfitting, while serving robust identification tasks. A classification case study includes a patient-oriented clinical decision support system development based on the impact of cancer-indicating variables. A competitive test prediction accuracy is obtained compared to commonly used algorithms despite 10 % decrease in the training. The regression case is built upon the energy load estimation to account for prominent considerations to obtain desired sensitivity patterns and proposed methodology delivers significant accuracy drop compared to some formulations to address knowledge patterns. The approach delivers a compatible pattern with practitioner expertise and is compared to widely used machine learning algorithms, whose performances are evaluated through common statistics in addition to multi-variable response graphs.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.