探索深度学习和脑机接口可解释模型之间的权衡。

Advances in neural information processing systems Pub Date : 2024-01-01

Luis H Cubillos, Guy Revach, Matthew J Mender, Joseph T Costello, Hisham Temmar, Aren Hite, Diksha Zutshi, Dylan M Wallace, Xiaoyong Ni, Madison M Kelberman, Matthew S Willsey, Ruud J G van Sloun, Nir Shlezinger, Parag Patil, Anne Draelos, Cynthia A Chestek

{"title":"探索深度学习和脑机接口可解释模型之间的权衡。","authors":"Luis H Cubillos, Guy Revach, Matthew J Mender, Joseph T Costello, Hisham Temmar, Aren Hite, Diksha Zutshi, Dylan M Wallace, Xiaoyong Ni, Madison M Kelberman, Matthew S Willsey, Ruud J G van Sloun, Nir Shlezinger, Parag Patil, Anne Draelos, Cynthia A Chestek","doi":"","DOIUrl":null,"url":null,"abstract":"People with brain or spinal cord-related paralysis often need to rely on others for basic tasks, limiting their independence. A potential solution is brain-machine interfaces (BMIs), which could allow them to voluntarily control external devices (e.g., robotic arm) by decoding brain activity to movement commands. In the past decade, deep-learning decoders have achieved state-of-the-art results in most BMI applications, ranging from speech production to finger control. However, the 'black-box' nature of deep-learning decoders could lead to unexpected behaviors, resulting in major safety concerns in real-world physical control scenarios. In these applications, explainable but lower-performing decoders, such as the Kalman filter (KF), remain the norm. In this study, we designed a BMI decoder based on KalmanNet, an extension of the KF that augments its operation with recurrent neural networks to compute the Kalman gain. This results in a varying \"trust\" that shifts between inputs and dynamics. We used this algorithm to predict finger movements from the brain activity of two monkeys. We compared KalmanNet results offline (pre-recorded data, <math><mi>n</mi> <mspace></mspace> <mo>=</mo> <mspace></mspace> <mn>13</mn></math> days) and online (real-time predictions, <math><mi>n</mi> <mspace></mspace> <mo>=</mo> <mspace></mspace> <mn>5</mn></math> days) with a simple KF and two recent deep-learning algorithms: tcFNN (non-ReFIT version) and LSTM. KalmanNet achieved comparable or better results than other deep learning models in offline and online modes, relying on the dynamical model for stopping while depending more on neural inputs for initiating movements. We further validated this mechanism by implementing a heteroscedastic KF that used the same strategy, and it also approached state-of-the-art performance while remaining in the explainable domain of standard KFs. However, we also see two downsides to KalmanNet. KalmanNet shares the limited generalization ability of existing deep-learning decoders, and its usage of the KF as an inductive bias limits its performance in the presence of unseen noise distributions. Despite this trade-off, our analysis successfully integrates traditional controls and modern deep-learning approaches to motivate high-performing yet still explainable BMI designs.","PeriodicalId":72099,"journal":{"name":"Advances in neural information processing systems","volume":"37 ","pages":"133975-133998"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11996206/pdf/","citationCount":"0","resultStr":"{\"title\":\"Exploring the trade-off between deep-learning and explainable models for brain-machine interfaces.\",\"authors\":\"Luis H Cubillos, Guy Revach, Matthew J Mender, Joseph T Costello, Hisham Temmar, Aren Hite, Diksha Zutshi, Dylan M Wallace, Xiaoyong Ni, Madison M Kelberman, Matthew S Willsey, Ruud J G van Sloun, Nir Shlezinger, Parag Patil, Anne Draelos, Cynthia A Chestek\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"People with brain or spinal cord-related paralysis often need to rely on others for basic tasks, limiting their independence. A potential solution is brain-machine interfaces (BMIs), which could allow them to voluntarily control external devices (e.g., robotic arm) by decoding brain activity to movement commands. In the past decade, deep-learning decoders have achieved state-of-the-art results in most BMI applications, ranging from speech production to finger control. However, the 'black-box' nature of deep-learning decoders could lead to unexpected behaviors, resulting in major safety concerns in real-world physical control scenarios. In these applications, explainable but lower-performing decoders, such as the Kalman filter (KF), remain the norm. In this study, we designed a BMI decoder based on KalmanNet, an extension of the KF that augments its operation with recurrent neural networks to compute the Kalman gain. This results in a varying \\\"trust\\\" that shifts between inputs and dynamics. We used this algorithm to predict finger movements from the brain activity of two monkeys. We compared KalmanNet results offline (pre-recorded data, <math><mi>n</mi> <mspace></mspace> <mo>=</mo> <mspace></mspace> <mn>13</mn></math> days) and online (real-time predictions, <math><mi>n</mi> <mspace></mspace> <mo>=</mo> <mspace></mspace> <mn>5</mn></math> days) with a simple KF and two recent deep-learning algorithms: tcFNN (non-ReFIT version) and LSTM. KalmanNet achieved comparable or better results than other deep learning models in offline and online modes, relying on the dynamical model for stopping while depending more on neural inputs for initiating movements. We further validated this mechanism by implementing a heteroscedastic KF that used the same strategy, and it also approached state-of-the-art performance while remaining in the explainable domain of standard KFs. However, we also see two downsides to KalmanNet. KalmanNet shares the limited generalization ability of existing deep-learning decoders, and its usage of the KF as an inductive bias limits its performance in the presence of unseen noise distributions. Despite this trade-off, our analysis successfully integrates traditional controls and modern deep-learning approaches to motivate high-performing yet still explainable BMI designs.\",\"PeriodicalId\":72099,\"journal\":{\"name\":\"Advances in neural information processing systems\",\"volume\":\"37 \",\"pages\":\"133975-133998\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11996206/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in neural information processing systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in neural information processing systems","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

患有脑或脊髓相关瘫痪的人通常需要依靠他人完成基本任务，这限制了他们的独立性。一个潜在的解决方案是脑机接口（bmi），通过将大脑活动解码为运动命令，可以让他们自愿控制外部设备（如机械手臂）。在过去的十年中，深度学习解码器在大多数BMI应用中取得了最先进的结果，从语音生成到手指控制。然而，深度学习解码器的“黑箱”特性可能会导致意外行为，从而在现实世界的物理控制场景中引发重大安全问题。在这些应用中，可解释但性能较低的解码器，如卡尔曼滤波器（KF），仍然是标准。在本研究中，我们设计了一个基于KalmanNet的BMI解码器，KalmanNet是KF的扩展，通过循环神经网络增强其运算来计算Kalman增益。这导致在输入和动态之间产生不同的“信任”。我们用这个算法从两只猴子的大脑活动中预测手指的运动。我们将KalmanNet结果离线（预记录数据，n = 13天）和在线（实时预测，n = 5天）与简单的KF和两种最新的深度学习算法：tcFNN（非refit版本）和LSTM进行了比较。KalmanNet在离线和在线模式下取得了与其他深度学习模型相当或更好的结果，依赖于动态模型来停止，而更多地依赖于神经输入来启动运动。我们通过实现使用相同策略的异方差KF进一步验证了这一机制，它也接近了最先进的性能，同时保持在标准KF的可解释范围内。然而，我们也看到了KalmanNet的两个缺点。KalmanNet具有现有深度学习解码器有限的泛化能力，并且它将KF作为归纳偏压的使用限制了它在存在看不见的噪声分布时的性能。尽管存在这种权衡，但我们的分析成功地整合了传统控制和现代深度学习方法，以激发高性能但仍可解释的BMI设计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

本刊更多论文

Exploring the trade-off between deep-learning and explainable models for brain-machine interfaces.

People with brain or spinal cord-related paralysis often need to rely on others for basic tasks, limiting their independence. A potential solution is brain-machine interfaces (BMIs), which could allow them to voluntarily control external devices (e.g., robotic arm) by decoding brain activity to movement commands. In the past decade, deep-learning decoders have achieved state-of-the-art results in most BMI applications, ranging from speech production to finger control. However, the 'black-box' nature of deep-learning decoders could lead to unexpected behaviors, resulting in major safety concerns in real-world physical control scenarios. In these applications, explainable but lower-performing decoders, such as the Kalman filter (KF), remain the norm. In this study, we designed a BMI decoder based on KalmanNet, an extension of the KF that augments its operation with recurrent neural networks to compute the Kalman gain. This results in a varying "trust" that shifts between inputs and dynamics. We used this algorithm to predict finger movements from the brain activity of two monkeys. We compared KalmanNet results offline (pre-recorded data, $n = 13$ days) and online (real-time predictions, $n = 5$ days) with a simple KF and two recent deep-learning algorithms: tcFNN (non-ReFIT version) and LSTM. KalmanNet achieved comparable or better results than other deep learning models in offline and online modes, relying on the dynamical model for stopping while depending more on neural inputs for initiating movements. We further validated this mechanism by implementing a heteroscedastic KF that used the same strategy, and it also approached state-of-the-art performance while remaining in the explainable domain of standard KFs. However, we also see two downsides to KalmanNet. KalmanNet shares the limited generalization ability of existing deep-learning decoders, and its usage of the KF as an inductive bias limits its performance in the presence of unseen noise distributions. Despite this trade-off, our analysis successfully integrates traditional controls and modern deep-learning approaches to motivate high-performing yet still explainable BMI designs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advances in neural information processing systems

自引率

0.00%

发文量