Lightweight Transformer exhibits comparable performance to LLMs for Seizure Prediction: A case for light-weight models for EEG data.

Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data Pub Date : 2024-12-01 DOI:10.1109/bigdata62323.2024.10825319

Paras Parani, Umair Mohammad, Fahad Saeed

{"title":"Lightweight Transformer exhibits comparable performance to LLMs for Seizure Prediction: A case for light-weight models for EEG data.","authors":"Paras Parani, Umair Mohammad, Fahad Saeed","doi":"10.1109/bigdata62323.2024.10825319","DOIUrl":null,"url":null,"abstract":"Predicting seizures ahead of time will have a significant positive clinical impact for people with epilepsy. Advances in machine learning/artificial intelligence (ML/AI) has provided us the tools needed to perform such predictive tasks. To date, advanced deep learning (DL) architectures such as the convolutional neural network (CNN) and long short-term memory (LSTM) have been used with mixed results. However, highly connected activity exhibited by epileptic seizures necessitates the design of more complex ML techniques which can better capture the complex interconnected neurological processes. Other challenges include the variability of EEG sensor data quality, different epilepsy and seizure profiles, lack of annotated datasets and absence of ML-ready benchmarks. In addition, successful models will need to perform inference in almost real-time using limited hardware compute-capacity. To address these challenges, we propose a lightweight architecture, called ESPFormer, whose novelty lies in the simple and smaller model-size and a lower computational load footprint needed to infer in real-time compared to other works in the literature. To quantify the performance of this lightweight model, we compared its performance with a custom-designed residual neural network (ResNet), a pre-trained vision transformer (ViT) and a pre-trained large-language model (LLM). We tested ESPFormer on MLSPred-Bench which is the largest patient-independent seizure prediction dataset comprising 12 benchmarks. Our results demonstrate that ESPFormer provides the best performance in terms of prediction accuracy for 4/12 benchmarks with an average improvement of 2.65% compared to the LLM, 3.35% compared to the ViT and 17.65% compared to the ResNet - and comparable results for other benchmarks. Our results indicate that lightweight transformer architecture may outperform resource-intensive LLM based models for real-time EEG-based seizure predictions.","PeriodicalId":520404,"journal":{"name":"Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data","volume":"2024 ","pages":"4941-4945"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11877310/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/bigdata62323.2024.10825319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Predicting seizures ahead of time will have a significant positive clinical impact for people with epilepsy. Advances in machine learning/artificial intelligence (ML/AI) has provided us the tools needed to perform such predictive tasks. To date, advanced deep learning (DL) architectures such as the convolutional neural network (CNN) and long short-term memory (LSTM) have been used with mixed results. However, highly connected activity exhibited by epileptic seizures necessitates the design of more complex ML techniques which can better capture the complex interconnected neurological processes. Other challenges include the variability of EEG sensor data quality, different epilepsy and seizure profiles, lack of annotated datasets and absence of ML-ready benchmarks. In addition, successful models will need to perform inference in almost real-time using limited hardware compute-capacity. To address these challenges, we propose a lightweight architecture, called ESPFormer, whose novelty lies in the simple and smaller model-size and a lower computational load footprint needed to infer in real-time compared to other works in the literature. To quantify the performance of this lightweight model, we compared its performance with a custom-designed residual neural network (ResNet), a pre-trained vision transformer (ViT) and a pre-trained large-language model (LLM). We tested ESPFormer on MLSPred-Bench which is the largest patient-independent seizure prediction dataset comprising 12 benchmarks. Our results demonstrate that ESPFormer provides the best performance in terms of prediction accuracy for 4/12 benchmarks with an average improvement of 2.65% compared to the LLM, 3.35% compared to the ViT and 17.65% compared to the ResNet - and comparable results for other benchmarks. Our results indicate that lightweight transformer architecture may outperform resource-intensive LLM based models for real-time EEG-based seizure predictions.

查看原文本刊更多论文

轻量级变压器在癫痫发作预测方面表现出与llm相当的性能：脑电图数据轻量级模型的案例。

提前预测癫痫发作将对癫痫患者产生重大的积极临床影响。机器学习/人工智能（ML/AI）的进步为我们提供了执行此类预测任务所需的工具。迄今为止，卷积神经网络（CNN）和长短期记忆（LSTM）等高级深度学习（DL）架构的使用结果喜忧参半。然而，癫痫发作表现出的高度关联活动需要设计更复杂的ML技术，以更好地捕捉复杂的相互关联的神经过程。其他挑战包括脑电图传感器数据质量的可变性，不同的癫痫和发作概况，缺乏注释数据集以及缺乏ml准备的基准。此外，成功的模型需要使用有限的硬件计算能力几乎实时地执行推理。为了应对这些挑战，我们提出了一种名为ESPFormer的轻量级架构，与文献中的其他作品相比，其新颖之处在于简单和更小的模型尺寸以及更低的实时推断所需的计算负载占用。为了量化该轻量级模型的性能，我们将其性能与定制设计的残差神经网络（ResNet）、预训练的视觉转换器（ViT）和预训练的大语言模型（LLM）进行了比较。我们在MLSPred-Bench上测试了ESPFormer， MLSPred-Bench是由12个基准测试组成的最大的独立于患者的癫痫发作预测数据集。我们的结果表明，ESPFormer在4/12基准测试的预测精度方面提供了最佳性能，与LLM相比平均提高了2.65%，与ViT相比提高了3.35%，与ResNet相比提高了17.65%，并且与其他基准测试相比也有类似的结果。我们的研究结果表明，轻量级变压器架构在基于脑电图的实时癫痫发作预测方面可能优于基于LLM的资源密集型模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data

自引率

0.00%

发文量