From data to decision: Scaling artificial intelligence with informatics for epilepsy management

IF 7.9 1区医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL

Clinical and Translational Medicine Pub Date : 2024-12-13 DOI:10.1002/ctm2.70108

Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis

{"title":"From data to decision: Scaling artificial intelligence with informatics for epilepsy management","authors":"Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis","doi":"10.1002/ctm2.70108","DOIUrl":null,"url":null,"abstract":"The integration of artificial intelligence (AI) into epilepsy research presents a critical opportunity to revolutionize the management of this complex neurological disorder.1 Despite significant advancements in developing AI algorithms to diagnose and manage epilepsy, their translation into clinical practice remains limited. This gap underscores the urgent need for scalable AI and neuroinformatics approaches that can bridge the divide between research and real-world application.2 The ability to generalize AI models from controlled research environments to diverse clinical settings is crucial. Current efforts have made substantial progress, but they also reveal common pitfalls, such as overestimation of model performance due to data leakage and the challenges of small sample sizes, which hinder the generalization of these models.To address these challenges and fully realize the potential of AI in epilepsy care, a robust framework for data sharing and collaboration across research centres is essential. Cloud-based informatics platforms offer a promising solution by enabling the aggregation and harmonization of large, multisite datasets. These platforms can facilitate the development of AI models that are not only powerful but also scalable and generalizable across different patient populations and clinical scenarios. In this commentary, we will explore the common methodological errors that lead to overly optimistic AI models in epilepsy research and propose strategies to overcome these issues. We will also discuss the importance of collaborative data sharing in building robust, clinically relevant AI tools and highlight the role of advanced neuroinformatics infrastructures in supporting the translational pathway from research to clinical practice (Figure 1).The promise of AI in epilepsy research is often hampered by methodological errors that lead to overly optimistic performance metrics. One of the most significant issues is data leakage, which occurs when information from outside the training dataset influences the model, resulting in an overestimation of its predictive power. This can happen when features are derived from the entire dataset rather than just the training subset.3 To mitigate this, strict separation between training and test datasets is essential and feature selection must be performed within each fold of the cross-validation process independently. Nested cross-validation, where model selection and performance estimation are conducted separately, further reduces the risk of data leakage.Another common error is the improper application of cross-validation techniques. Often, researchers perform feature selection or hyperparameter tuning on the entire dataset before cross-validation, leading to inflated performance metrics. The correct approach is to embed these steps within each fold of the cross-validation process to ensure that the test data remain completely unseen until the final evaluation. This practice helps prevent overfitting and provides a more accurate estimate of how the model will perform on new data.Small sample size presents a third challenge, particularly in epilepsy research, where datasets are often of modest size and heterogeneous. Small datasets can lead to overfitting, where the model learns patterns specific to the training data but fails to generalize to new data. Addressing this requires both methodological rigour and collaborative efforts to pool data across multiple sites, thereby creating larger, more diverse datasets. Data augmentation techniques, such as generating synthetic data, can also help increase the effective size of the training set.The development of robust AI models in epilepsy is further strengthened by collaborative data sharing, which allows researchers to pool datasets from multiple sources, increasing both the size and diversity of the data available for training. Epilepsy is a highly heterogeneous disorder, and individual research centres often have access to only small modest-size cohorts. By aggregating data across different sites, researchers can develop AI tools that are more representative of the broad clinical reality to improve generalizability and reliability across diverse clinical settings.Collaborative data sharing also enables the replication of studies, which is critical for validating AI models across different cohorts to ensure that the models are both accurate and reproducible. Such collaboration fosters the sharing of expertise and resources, allowing researchers to tackle complex challenges, such as integrating multimodal data—neuroimaging, electrophysiology and clinical records—into more sophisticated AI models.To support effective data sharing and utilization across multiple sites, advanced neuroinformatics infrastructures are indispensable. Platforms like EBRAINS, Pennsieve (https://app.pennsieve.io/) and OpenNeuro, among others, provide the technological foundation needed to securely aggregate, manage and analyze large-scale epilepsy datasets.4, 5 These platforms enable researchers to apply standardized methods and tools across different datasets to ensure the rigour, robustness and reproducibility of AI models.Neuroinformatics platforms also adhere to the principles of making data findable, accessible, interoperable and reusable, which is crucial for effective data sharing.6 By facilitating data harmonization and integration, these platforms ensure that data from multiple sources can be combined and analyzed consistently.7 Furthermore, neuroinformatics infrastructures support collaborative analysis by allowing researchers to share not just data, but also the algorithms and models developed from that data. For example, researchers could share their electrode localization outputs generated from a standardized pipeline,8 together with their intracranial electroencephalography recordings, and the deep learning model trained for seizure detection. Alternatively, researchers might only share their data,9 and the preprocessing and model building could all happen within these infrastructures.10 This fosters an open science environment where AI models can be tested and refined across different datasets to accelerate the development of clinically applicable tools.In summary, the advancement of AI in epilepsy research depends on both methodological rigour and collaborative efforts. By addressing common errors in AI model development and leveraging the power of collaborative data sharing, we can build robust, clinically relevant tools. Neuroinformatics infrastructures provide the necessary support for these endeavours to ensure that AI models are not only powerful but also applicable in real-world clinical settings. These combined strategies are essential to translate AI research into tangible improvements in epilepsy care, ultimately leading to better patient outcomes.Conceptualization: Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis. Writing—original draft preparation and revision for intellectual content: Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis.The authors declare no conflict of interest.","PeriodicalId":10189,"journal":{"name":"Clinical and Translational Medicine","volume":"14 12","pages":""},"PeriodicalIF":7.9000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11645443/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ctm2.70108","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

The integration of artificial intelligence (AI) into epilepsy research presents a critical opportunity to revolutionize the management of this complex neurological disorder.¹ Despite significant advancements in developing AI algorithms to diagnose and manage epilepsy, their translation into clinical practice remains limited. This gap underscores the urgent need for scalable AI and neuroinformatics approaches that can bridge the divide between research and real-world application.² The ability to generalize AI models from controlled research environments to diverse clinical settings is crucial. Current efforts have made substantial progress, but they also reveal common pitfalls, such as overestimation of model performance due to data leakage and the challenges of small sample sizes, which hinder the generalization of these models.

To address these challenges and fully realize the potential of AI in epilepsy care, a robust framework for data sharing and collaboration across research centres is essential. Cloud-based informatics platforms offer a promising solution by enabling the aggregation and harmonization of large, multisite datasets. These platforms can facilitate the development of AI models that are not only powerful but also scalable and generalizable across different patient populations and clinical scenarios. In this commentary, we will explore the common methodological errors that lead to overly optimistic AI models in epilepsy research and propose strategies to overcome these issues. We will also discuss the importance of collaborative data sharing in building robust, clinically relevant AI tools and highlight the role of advanced neuroinformatics infrastructures in supporting the translational pathway from research to clinical practice (Figure 1).

The promise of AI in epilepsy research is often hampered by methodological errors that lead to overly optimistic performance metrics. One of the most significant issues is data leakage, which occurs when information from outside the training dataset influences the model, resulting in an overestimation of its predictive power. This can happen when features are derived from the entire dataset rather than just the training subset.³ To mitigate this, strict separation between training and test datasets is essential and feature selection must be performed within each fold of the cross-validation process independently. Nested cross-validation, where model selection and performance estimation are conducted separately, further reduces the risk of data leakage.

Another common error is the improper application of cross-validation techniques. Often, researchers perform feature selection or hyperparameter tuning on the entire dataset before cross-validation, leading to inflated performance metrics. The correct approach is to embed these steps within each fold of the cross-validation process to ensure that the test data remain completely unseen until the final evaluation. This practice helps prevent overfitting and provides a more accurate estimate of how the model will perform on new data.

Small sample size presents a third challenge, particularly in epilepsy research, where datasets are often of modest size and heterogeneous. Small datasets can lead to overfitting, where the model learns patterns specific to the training data but fails to generalize to new data. Addressing this requires both methodological rigour and collaborative efforts to pool data across multiple sites, thereby creating larger, more diverse datasets. Data augmentation techniques, such as generating synthetic data, can also help increase the effective size of the training set.

The development of robust AI models in epilepsy is further strengthened by collaborative data sharing, which allows researchers to pool datasets from multiple sources, increasing both the size and diversity of the data available for training. Epilepsy is a highly heterogeneous disorder, and individual research centres often have access to only small modest-size cohorts. By aggregating data across different sites, researchers can develop AI tools that are more representative of the broad clinical reality to improve generalizability and reliability across diverse clinical settings.

Collaborative data sharing also enables the replication of studies, which is critical for validating AI models across different cohorts to ensure that the models are both accurate and reproducible. Such collaboration fosters the sharing of expertise and resources, allowing researchers to tackle complex challenges, such as integrating multimodal data—neuroimaging, electrophysiology and clinical records—into more sophisticated AI models.

To support effective data sharing and utilization across multiple sites, advanced neuroinformatics infrastructures are indispensable. Platforms like EBRAINS, Pennsieve (https://app.pennsieve.io/) and OpenNeuro, among others, provide the technological foundation needed to securely aggregate, manage and analyze large-scale epilepsy datasets.^{4, 5} These platforms enable researchers to apply standardized methods and tools across different datasets to ensure the rigour, robustness and reproducibility of AI models.

Neuroinformatics platforms also adhere to the principles of making data findable, accessible, interoperable and reusable, which is crucial for effective data sharing.⁶ By facilitating data harmonization and integration, these platforms ensure that data from multiple sources can be combined and analyzed consistently.⁷ Furthermore, neuroinformatics infrastructures support collaborative analysis by allowing researchers to share not just data, but also the algorithms and models developed from that data. For example, researchers could share their electrode localization outputs generated from a standardized pipeline,⁸ together with their intracranial electroencephalography recordings, and the deep learning model trained for seizure detection. Alternatively, researchers might only share their data,⁹ and the preprocessing and model building could all happen within these infrastructures.¹⁰ This fosters an open science environment where AI models can be tested and refined across different datasets to accelerate the development of clinically applicable tools.

In summary, the advancement of AI in epilepsy research depends on both methodological rigour and collaborative efforts. By addressing common errors in AI model development and leveraging the power of collaborative data sharing, we can build robust, clinically relevant tools. Neuroinformatics infrastructures provide the necessary support for these endeavours to ensure that AI models are not only powerful but also applicable in real-world clinical settings. These combined strategies are essential to translate AI research into tangible improvements in epilepsy care, ultimately leading to better patient outcomes.

Conceptualization: Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis. Writing—original draft preparation and revision for intellectual content: Nishant Sinha, Alfredo Lucas, Kathryn Adamiak Davis.

The authors declare no conflict of interest.

Abstract Image

查看原文本刊更多论文

从数据到决策：扩展人工智能与癫痫管理的信息学。

将人工智能（AI）整合到癫痫研究中，为彻底改变这一复杂神经系统疾病的管理提供了一个关键机会尽管在开发诊断和管理癫痫的人工智能算法方面取得了重大进展，但将其转化为临床实践仍然有限。这一差距强调了对可扩展的人工智能和神经信息学方法的迫切需要，这些方法可以弥合研究和现实应用之间的鸿沟将人工智能模型从受控研究环境推广到不同临床环境的能力至关重要。目前的努力已经取得了实质性的进展，但它们也揭示了常见的陷阱，例如由于数据泄漏而对模型性能的高估以及小样本量的挑战，这阻碍了这些模型的推广。为了应对这些挑战并充分发挥人工智能在癫痫治疗中的潜力，必须建立一个强有力的跨研究中心数据共享和协作框架。基于云的信息学平台通过聚合和协调大型多站点数据集提供了一个很有前途的解决方案。这些平台可以促进人工智能模型的开发，这些模型不仅功能强大，而且可以在不同的患者群体和临床场景中扩展和推广。在这篇评论中，我们将探讨导致癫痫研究中过度乐观的人工智能模型的常见方法错误，并提出克服这些问题的策略。我们还将讨论协作数据共享在构建强大的、临床相关的人工智能工具中的重要性，并强调先进的神经信息学基础设施在支持从研究到临床实践的转化途径中的作用（图1）。人工智能在癫痫研究中的前景经常受到方法学错误的阻碍，这些错误导致了过于乐观的绩效指标。最重要的问题之一是数据泄漏，当来自训练数据集外部的信息影响模型时，会导致对其预测能力的高估。当特征来自整个数据集而不仅仅是训练子集时，就会发生这种情况为了减轻这种情况，训练和测试数据集之间的严格分离是必不可少的，并且必须在交叉验证过程的每个折叠中独立执行特征选择。嵌套交叉验证，其中模型选择和性能估计是分开进行的，进一步降低了数据泄漏的风险。另一个常见错误是交叉验证技术的不当应用。通常，研究人员在交叉验证之前对整个数据集进行特征选择或超参数调优，从而导致夸大的性能指标。正确的方法是将这些步骤嵌入到交叉验证过程的每个步骤中，以确保测试数据在最终评估之前完全不可见。这种做法有助于防止过拟合，并对模型在新数据上的表现提供更准确的估计。小样本量提出了第三个挑战，特别是在癫痫研究中，数据集通常是中等规模和异质性的。小数据集可能导致过拟合，即模型学习特定于训练数据的模式，但无法推广到新数据。解决这个问题需要严谨的方法和协作的努力，以便跨多个站点汇集数据，从而创建更大、更多样化的数据集。数据增强技术，例如生成合成数据，也可以帮助增加训练集的有效大小。协作数据共享进一步加强了稳健的癫痫人工智能模型的开发，这使研究人员能够汇集来自多个来源的数据集，从而增加了可用于培训的数据的规模和多样性。癫痫是一种高度异质性的疾病，个别研究中心通常只能获得小规模的队列。通过汇总不同地点的数据，研究人员可以开发更能代表广泛临床现实的人工智能工具，以提高不同临床环境的普遍性和可靠性。协作数据共享还可以实现研究的复制，这对于跨不同队列验证人工智能模型至关重要，以确保模型既准确又可复制。这种合作促进了专业知识和资源的共享，使研究人员能够应对复杂的挑战，例如将多模式数据（神经成像、电生理学和临床记录）集成到更复杂的人工智能模型中。为了支持跨多个站点的有效数据共享和利用，先进的神经信息学基础设施是必不可少的。EBRAINS、Pennsieve （https://app.pennsieve）等平台。 io/)和OpenNeuro等提供了安全聚合、管理和分析大规模癫痫数据集所需的技术基础。4,5这些平台使研究人员能够在不同的数据集上应用标准化的方法和工具，以确保人工智能模型的严谨性、稳健性和可重复性。神经信息学平台还坚持数据可查找、可访问、可互操作和可重复使用的原则，这对有效的数据共享至关重要通过促进数据协调和集成，这些平台确保来自多个来源的数据可以被一致地组合和分析此外，神经信息学基础设施不仅允许研究人员共享数据，还允许从数据中开发的算法和模型，从而支持协作分析。例如，研究人员可以共享由标准化管道生成的电极定位输出，8以及他们的颅内脑电图记录，以及为癫痫发作检测训练的深度学习模型。或者，研究人员可能只共享他们的数据，预处理和模型构建都可以在这些基础设施中进行这营造了一个开放的科学环境，人工智能模型可以在不同的数据集上进行测试和改进，以加速临床应用工具的开发。总之，人工智能在癫痫研究中的进展取决于方法的严谨性和合作努力。通过解决人工智能模型开发中的常见错误并利用协作数据共享的力量，我们可以构建强大的临床相关工具。神经信息学基础设施为这些努力提供了必要的支持，以确保人工智能模型不仅强大，而且适用于现实世界的临床环境。这些综合策略对于将人工智能研究转化为癫痫治疗的切实改善，最终改善患者的治疗效果至关重要。概念化：Nishant Sinha， Alfredo Lucas, Kathryn Adamiak Davis。写作-知识内容的原始草稿准备和修订：Nishant Sinha， Alfredo Lucas, Kathryn Adamiak Davis。作者声明无利益冲突。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Clinical and Translational Medicine Multiple-

CiteScore

15.90

自引率

1.90%

发文量

450

审稿时长

4 weeks

期刊介绍： Clinical and Translational Medicine (CTM) is an international, peer-reviewed, open-access journal dedicated to accelerating the translation of preclinical research into clinical applications and fostering communication between basic and clinical scientists. It highlights the clinical potential and application of various fields including biotechnologies, biomaterials, bioengineering, biomarkers, molecular medicine, omics science, bioinformatics, immunology, molecular imaging, drug discovery, regulation, and health policy. With a focus on the bench-to-bedside approach, CTM prioritizes studies and clinical observations that generate hypotheses relevant to patients and diseases, guiding investigations in cellular and molecular medicine. The journal encourages submissions from clinicians, researchers, policymakers, and industry professionals.