Deep learning for ECoG brain-computer interface: end-to-end vs. hand-crafted features

International Conference of the Italian Association for Artificial Intelligence Pub Date : 2022-10-05 DOI:10.48550/arXiv.2210.02544

Maciej Śliwowski, Matthieu Martin, A. Souloumiac, P. Blanchart, T. Aksenova

{"title":"Deep learning for ECoG brain-computer interface: end-to-end vs. hand-crafted features","authors":"Maciej Śliwowski, Matthieu Martin, A. Souloumiac, P. Blanchart, T. Aksenova","doi":"10.48550/arXiv.2210.02544","DOIUrl":null,"url":null,"abstract":"In brain signal processing, deep learning (DL) models have become commonly used. However, the performance gain from using end-to-end DL models compared to conventional ML approaches is usually significant but moderate, typically at the cost of increased computational load and deteriorated explainability. The core idea behind deep learning approaches is scaling the performance with bigger datasets. However, brain signals are temporal data with a low signal-to-noise ratio, uncertain labels, and nonstationary data in time. Those factors may influence the training process and slow down the models' performance improvement. These factors' influence may differ for end-to-end DL model and one using hand-crafted features. As not studied before, this paper compares models that use raw ECoG signal and time-frequency features for BCI motor imagery decoding. We investigate whether the current dataset size is a stronger limitation for any models. Finally, obtained filters were compared to identify differences between hand-crafted features and optimized with backpropagation. To compare the effectiveness of both strategies, we used a multilayer perceptron and a mix of convolutional and LSTM layers that were already proved effective in this task. The analysis was performed on the long-term clinical trial database (almost 600 minutes of recordings) of a tetraplegic patient executing motor imagery tasks for 3D hand translation. For a given dataset, the results showed that end-to-end training might not be significantly better than the hand-crafted features-based model. The performance gap is reduced with bigger datasets, but considering the increased computational load, end-to-end training may not be profitable for this application.","PeriodicalId":293643,"journal":{"name":"International Conference of the Italian Association for Artificial Intelligence","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference of the Italian Association for Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.02544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In brain signal processing, deep learning (DL) models have become commonly used. However, the performance gain from using end-to-end DL models compared to conventional ML approaches is usually significant but moderate, typically at the cost of increased computational load and deteriorated explainability. The core idea behind deep learning approaches is scaling the performance with bigger datasets. However, brain signals are temporal data with a low signal-to-noise ratio, uncertain labels, and nonstationary data in time. Those factors may influence the training process and slow down the models' performance improvement. These factors' influence may differ for end-to-end DL model and one using hand-crafted features. As not studied before, this paper compares models that use raw ECoG signal and time-frequency features for BCI motor imagery decoding. We investigate whether the current dataset size is a stronger limitation for any models. Finally, obtained filters were compared to identify differences between hand-crafted features and optimized with backpropagation. To compare the effectiveness of both strategies, we used a multilayer perceptron and a mix of convolutional and LSTM layers that were already proved effective in this task. The analysis was performed on the long-term clinical trial database (almost 600 minutes of recordings) of a tetraplegic patient executing motor imagery tasks for 3D hand translation. For a given dataset, the results showed that end-to-end training might not be significantly better than the hand-crafted features-based model. The performance gap is reduced with bigger datasets, but considering the increased computational load, end-to-end training may not be profitable for this application.

查看原文本刊更多论文

ECoG脑机接口的深度学习:端到端vs.手工功能

在脑信号处理中，深度学习(DL)模型已经被广泛使用。然而，与传统的机器学习方法相比，使用端到端深度学习模型的性能提升通常是显著的，但幅度不大，通常是以增加计算负载和降低可解释性为代价的。深度学习方法背后的核心思想是在更大的数据集上扩展性能。然而，大脑信号是时间数据，具有低信噪比、标签不确定、时间上的非平稳等特点。这些因素可能会影响训练过程，减缓模型性能的提高。对于端到端深度学习模型和使用手工特征的模型，这些因素的影响可能有所不同。本文比较了使用原始ECoG信号和时频特征进行脑机接口运动图像解码的模型。我们研究当前的数据集大小是否对任何模型都有更强的限制。最后，将得到的滤波器进行比较，以识别手工制作的特征与反向传播优化的特征之间的差异。为了比较两种策略的有效性，我们使用了多层感知器以及卷积层和LSTM层的混合，这些层已经在该任务中被证明是有效的。分析是在长期临床试验数据库(近600分钟的记录)上进行的，该数据库记录了一名四肢瘫痪患者执行3D手部翻译的运动图像任务。对于给定的数据集，结果表明端到端训练可能不会比手工制作的基于特征的模型好得多。使用更大的数据集可以减少性能差距，但是考虑到增加的计算负载，端到端训练可能对这个应用程序没有好处。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference of the Italian Association for Artificial Intelligence

自引率

0.00%

发文量