重新定义ADHD诊断中的参数效率：一个轻量级的注意力驱动的kolmogorov-arnold网络，降低了参数复杂性和一种新的激活函数

IF 2.1 4区医学 Q3 CLINICAL NEUROLOGY

Psychiatry Research: Neuroimaging Pub Date : 2025-06-13 DOI:10.1016/j.pscychresns.2025.112016

Deepika, Meghna Sharma, Shaveta Arora

{"title":"重新定义ADHD诊断中的参数效率：一个轻量级的注意力驱动的kolmogorov-arnold网络，降低了参数复杂性和一种新的激活函数","authors":"Deepika, Meghna Sharma, Shaveta Arora","doi":"10.1016/j.pscychresns.2025.112016","DOIUrl":null,"url":null,"abstract":"<div><div>As deep learning continues to advance in medical analysis, the increasing complexity of models, particularly Convolutional Neural Networks (CNNs), presents significant challenges related to interpretability, computational costs, and real-world applicability. These issues are critical in the medical domain, e.g., Attention Deficit Hyperactivity Disorder (ADHD) diagnosis, where model efficiency and interpretability are paramount. This paper proposes a novel parameter-efficient framework based on the Kolmogorov-Arnold Network (KAN) to overcome these challenges. Unlike CNNs, KAN restructures feature transformations, significantly reducing parameter overhead while preserving high classification accuracy. An attention-driven feature selection mechanism dynamically prioritizes the most significant features, minimizing irrelevant features and unnecessary computational load. Recognizing the complex and diverse nature of ADHD- related brain connectivity features, a novel activation function with learnable coefficients is introduced, enabling adaptive transformation based on specific data patterns. To further enhance model generalization, an advanced sliding window-based data augmentation technique is incorporated to meet substantial data requirements for training. Extensive experimentation on the benchmark ADHD-200 dataset demonstrates the model's superiority, achieving an accuracy of 79.25 %, an F1-score of 78. 75 % and a precision of 78.23 %, surpassing many state-of-the-art ADHD studies. Remarkably, these results are achieved using only a few thousand parameters compared to the millions required by many existing approaches, making it valuable for various resource-constrained researchers and organizations. The proposed framework, seamlessly fusing KAN, attention-driven feature selection, adaptive activation, and robust data augmentation, achieves substantial parameter reduction with enhanced performance. This lightweight architecture, combined with superior performance and interpretability, makes the proposed model highly promising for ADHD diagnosis and other complex medical applications.</div></div>","PeriodicalId":20776,"journal":{"name":"Psychiatry Research: Neuroimaging","volume":"351 ","pages":"Article 112016"},"PeriodicalIF":2.1000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Redefining parameter-efficiency in ADHD diagnosis: A lightweight attention-driven kolmogorov-arnold network with reduced parameter complexity and a novel activation function\",\"authors\":\"Deepika, Meghna Sharma, Shaveta Arora\",\"doi\":\"10.1016/j.pscychresns.2025.112016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As deep learning continues to advance in medical analysis, the increasing complexity of models, particularly Convolutional Neural Networks (CNNs), presents significant challenges related to interpretability, computational costs, and real-world applicability. These issues are critical in the medical domain, e.g., Attention Deficit Hyperactivity Disorder (ADHD) diagnosis, where model efficiency and interpretability are paramount. This paper proposes a novel parameter-efficient framework based on the Kolmogorov-Arnold Network (KAN) to overcome these challenges. Unlike CNNs, KAN restructures feature transformations, significantly reducing parameter overhead while preserving high classification accuracy. An attention-driven feature selection mechanism dynamically prioritizes the most significant features, minimizing irrelevant features and unnecessary computational load. Recognizing the complex and diverse nature of ADHD- related brain connectivity features, a novel activation function with learnable coefficients is introduced, enabling adaptive transformation based on specific data patterns. To further enhance model generalization, an advanced sliding window-based data augmentation technique is incorporated to meet substantial data requirements for training. Extensive experimentation on the benchmark ADHD-200 dataset demonstrates the model's superiority, achieving an accuracy of 79.25 %, an F1-score of 78. 75 % and a precision of 78.23 %, surpassing many state-of-the-art ADHD studies. Remarkably, these results are achieved using only a few thousand parameters compared to the millions required by many existing approaches, making it valuable for various resource-constrained researchers and organizations. The proposed framework, seamlessly fusing KAN, attention-driven feature selection, adaptive activation, and robust data augmentation, achieves substantial parameter reduction with enhanced performance. This lightweight architecture, combined with superior performance and interpretability, makes the proposed model highly promising for ADHD diagnosis and other complex medical applications.</div></div>\",\"PeriodicalId\":20776,\"journal\":{\"name\":\"Psychiatry Research: Neuroimaging\",\"volume\":\"351 \",\"pages\":\"Article 112016\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychiatry Research: Neuroimaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092549272500071X\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychiatry Research: Neuroimaging","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092549272500071X","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

摘要

随着深度学习在医学分析中的不断发展，模型，特别是卷积神经网络（cnn）的复杂性日益增加，在可解释性、计算成本和现实世界的适用性方面提出了重大挑战。这些问题在医学领域至关重要，例如，注意缺陷多动障碍（ADHD）的诊断，其中模型的效率和可解释性是至关重要的。本文提出了一种基于Kolmogorov-Arnold网络（KAN）的参数高效框架来克服这些挑战。与cnn不同，KAN对特征变换进行重构，在保持较高分类精度的同时显著降低了参数开销。注意驱动的特征选择机制动态地优先考虑最重要的特征，最小化不相关的特征和不必要的计算负荷。认识到ADHD相关大脑连接特征的复杂性和多样性，引入了一种具有可学习系数的新颖激活函数，实现了基于特定数据模式的自适应转换。为了进一步提高模型的泛化能力，采用了一种先进的基于滑动窗口的数据增强技术来满足训练的大量数据需求。在基准ADHD-200数据集上进行的大量实验证明了该模型的优越性，准确率达到79.25%，f1得分为78分。精确度为78.23%，超过了许多最先进的ADHD研究。值得注意的是，与许多现有方法所需的数百万个参数相比，这些结果仅使用了几千个参数，使其对各种资源受限的研究人员和组织具有价值。所提出的框架无缝融合了KAN、注意力驱动特征选择、自适应激活和鲁棒数据增强，在提高性能的同时实现了大量的参数减少。这种轻量级的架构，加上优越的性能和可解释性，使得所提出的模型在ADHD诊断和其他复杂的医学应用中非常有前途。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Redefining parameter-efficiency in ADHD diagnosis: A lightweight attention-driven kolmogorov-arnold network with reduced parameter complexity and a novel activation function

As deep learning continues to advance in medical analysis, the increasing complexity of models, particularly Convolutional Neural Networks (CNNs), presents significant challenges related to interpretability, computational costs, and real-world applicability. These issues are critical in the medical domain, e.g., Attention Deficit Hyperactivity Disorder (ADHD) diagnosis, where model efficiency and interpretability are paramount. This paper proposes a novel parameter-efficient framework based on the Kolmogorov-Arnold Network (KAN) to overcome these challenges. Unlike CNNs, KAN restructures feature transformations, significantly reducing parameter overhead while preserving high classification accuracy. An attention-driven feature selection mechanism dynamically prioritizes the most significant features, minimizing irrelevant features and unnecessary computational load. Recognizing the complex and diverse nature of ADHD- related brain connectivity features, a novel activation function with learnable coefficients is introduced, enabling adaptive transformation based on specific data patterns. To further enhance model generalization, an advanced sliding window-based data augmentation technique is incorporated to meet substantial data requirements for training. Extensive experimentation on the benchmark ADHD-200 dataset demonstrates the model's superiority, achieving an accuracy of 79.25 %, an F1-score of 78. 75 % and a precision of 78.23 %, surpassing many state-of-the-art ADHD studies. Remarkably, these results are achieved using only a few thousand parameters compared to the millions required by many existing approaches, making it valuable for various resource-constrained researchers and organizations. The proposed framework, seamlessly fusing KAN, attention-driven feature selection, adaptive activation, and robust data augmentation, achieves substantial parameter reduction with enhanced performance. This lightweight architecture, combined with superior performance and interpretability, makes the proposed model highly promising for ADHD diagnosis and other complex medical applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Psychiatry Research: Neuroimaging 医学-精神病学

CiteScore

3.80

自引率

0.00%

发文量

审稿时长

22.5 weeks

期刊介绍： The Neuroimaging section of Psychiatry Research publishes manuscripts on positron emission tomography, magnetic resonance imaging, computerized electroencephalographic topography, regional cerebral blood flow, computed tomography, magnetoencephalography, autoradiography, post-mortem regional analyses, and other imaging techniques. Reports concerning results in psychiatric disorders, dementias, and the effects of behaviorial tasks and pharmacological treatments are featured. We also invite manuscripts on the methods of obtaining images and computer processing of the images themselves. Selected case reports are also published.