{"title":"A framework for Interpretable deep learning in cross-subject detection of event-related potentials","authors":"Shayan Jalilpour , Gernot Müller-Putz","doi":"10.1016/j.engappai.2024.109642","DOIUrl":null,"url":null,"abstract":"<div><div>Event-related potential-based Brain-Computer Interfaces are becoming widely popular due to their ability to send commands quickly with high accuracy. However, the stationary characteristics of electroencephalographic signals, coupled with their low signal-to-noise ratio, lead to variations in amplitude, time period, and latency in the patterns of event-related potentials across different trials, sessions, days and subjects. Conventional feature extraction and machine learning algorithms are not designed to handle these differences, requiring the development of methods that can address these variations. Here, we propose a novel lightweight deep neural network for event-related potential classification, consisting of three modules. In this model, we have a spatio-temporal module that learns local features simultaneously across channels and time points. Following this, there's a component extractor module comprising depthwise convolutions, inspired by mixed depthwise convolutions, to capture the event-related potential characteristics with different temporal durations. Lastly, an advanced temporal layer addresses event-related potential shape and scale variations using deformable convolutions. We conducted experiments on event-related potential detection in a subject-independent scenario using one error-related negativity potential dataset and three perturbation-evoked potential datasets. Comparisons were made with established methods including two conventional machine learning algorithms and three well-known deep learning architectures, demonstrating that our model outperformed them in terms of classification accuracy and parameter efficiency. In our analysis, we aimed to understand the model's performance using gradient-weighted class activation mapping and t-distributed stochastic neighbor embedding. These methods facilitated the visualization and interpretation of our model's effectiveness, providing insights into its relationship with the neuroscientific characteristics of event-related potentials.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109642"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624018001","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Event-related potential-based Brain-Computer Interfaces are becoming widely popular due to their ability to send commands quickly with high accuracy. However, the stationary characteristics of electroencephalographic signals, coupled with their low signal-to-noise ratio, lead to variations in amplitude, time period, and latency in the patterns of event-related potentials across different trials, sessions, days and subjects. Conventional feature extraction and machine learning algorithms are not designed to handle these differences, requiring the development of methods that can address these variations. Here, we propose a novel lightweight deep neural network for event-related potential classification, consisting of three modules. In this model, we have a spatio-temporal module that learns local features simultaneously across channels and time points. Following this, there's a component extractor module comprising depthwise convolutions, inspired by mixed depthwise convolutions, to capture the event-related potential characteristics with different temporal durations. Lastly, an advanced temporal layer addresses event-related potential shape and scale variations using deformable convolutions. We conducted experiments on event-related potential detection in a subject-independent scenario using one error-related negativity potential dataset and three perturbation-evoked potential datasets. Comparisons were made with established methods including two conventional machine learning algorithms and three well-known deep learning architectures, demonstrating that our model outperformed them in terms of classification accuracy and parameter efficiency. In our analysis, we aimed to understand the model's performance using gradient-weighted class activation mapping and t-distributed stochastic neighbor embedding. These methods facilitated the visualization and interpretation of our model's effectiveness, providing insights into its relationship with the neuroscientific characteristics of event-related potentials.
基于事件相关电位的脑机接口能够快速、准确地发送指令,因此广受欢迎。然而,脑电信号的静态特性加上其信噪比低,导致不同试验、会话、日期和受试者的事件相关电位模式在振幅、时间段和延迟方面存在差异。传统的特征提取和机器学习算法无法处理这些差异,因此需要开发能处理这些差异的方法。在此,我们提出了一种用于事件相关电位分类的新型轻量级深度神经网络,由三个模块组成。在这个模型中,我们有一个时空模块,可以跨信道和时间点同时学习局部特征。随后是一个由深度卷积组成的成分提取模块,其灵感来自于混合深度卷积,用于捕捉不同时间长度的事件相关电位特征。最后,高级时间层利用可变形卷积处理事件相关电位的形状和尺度变化。我们使用一个错误相关负性电位数据集和三个扰动诱发电位数据集,在不依赖受试者的情况下进行了事件相关电位检测实验。实验结果表明,我们的模型在分类准确性和参数效率方面都优于它们。在分析中,我们使用梯度加权类激活映射和 t 分布随机邻域嵌入来了解模型的性能。这些方法促进了模型效果的可视化和解释,使我们能够深入了解模型与事件相关电位的神经科学特征之间的关系。
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.