{"title":"MonoKAN: Certified Monotonic Kolmogorov-Arnold Network","authors":"Alejandro Polo-Molina, David Alfaya, Jose Portela","doi":"arxiv-2409.11078","DOIUrl":null,"url":null,"abstract":"Artificial Neural Networks (ANNs) have significantly advanced various fields\nby effectively recognizing patterns and solving complex problems. Despite these\nadvancements, their interpretability remains a critical challenge, especially\nin applications where transparency and accountability are essential. To address\nthis, explainable AI (XAI) has made progress in demystifying ANNs, yet\ninterpretability alone is often insufficient. In certain applications, model\npredictions must align with expert-imposed requirements, sometimes exemplified\nby partial monotonicity constraints. While monotonic approaches are found in\nthe literature for traditional Multi-layer Perceptrons (MLPs), they still face\ndifficulties in achieving both interpretability and certified partial\nmonotonicity. Recently, the Kolmogorov-Arnold Network (KAN) architecture, based\non learnable activation functions parametrized as splines, has been proposed as\na more interpretable alternative to MLPs. Building on this, we introduce a\nnovel ANN architecture called MonoKAN, which is based on the KAN architecture\nand achieves certified partial monotonicity while enhancing interpretability.\nTo achieve this, we employ cubic Hermite splines, which guarantee monotonicity\nthrough a set of straightforward conditions. Additionally, by using positive\nweights in the linear combinations of these splines, we ensure that the network\npreserves the monotonic relationships between input and output. Our experiments\ndemonstrate that MonoKAN not only enhances interpretability but also improves\npredictive performance across the majority of benchmarks, outperforming\nstate-of-the-art monotonic MLP approaches.","PeriodicalId":501347,"journal":{"name":"arXiv - CS - Neural and Evolutionary Computing","volume":"46 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Neural and Evolutionary Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial Neural Networks (ANNs) have significantly advanced various fields
by effectively recognizing patterns and solving complex problems. Despite these
advancements, their interpretability remains a critical challenge, especially
in applications where transparency and accountability are essential. To address
this, explainable AI (XAI) has made progress in demystifying ANNs, yet
interpretability alone is often insufficient. In certain applications, model
predictions must align with expert-imposed requirements, sometimes exemplified
by partial monotonicity constraints. While monotonic approaches are found in
the literature for traditional Multi-layer Perceptrons (MLPs), they still face
difficulties in achieving both interpretability and certified partial
monotonicity. Recently, the Kolmogorov-Arnold Network (KAN) architecture, based
on learnable activation functions parametrized as splines, has been proposed as
a more interpretable alternative to MLPs. Building on this, we introduce a
novel ANN architecture called MonoKAN, which is based on the KAN architecture
and achieves certified partial monotonicity while enhancing interpretability.
To achieve this, we employ cubic Hermite splines, which guarantee monotonicity
through a set of straightforward conditions. Additionally, by using positive
weights in the linear combinations of these splines, we ensure that the network
preserves the monotonic relationships between input and output. Our experiments
demonstrate that MonoKAN not only enhances interpretability but also improves
predictive performance across the majority of benchmarks, outperforming
state-of-the-art monotonic MLP approaches.