A Study of Non-Linear Manifold Feature Extraction in Spike Sorting.

IF 3.1 4区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Neuroinformatics Pub Date : 2025-10-02 DOI:10.1007/s12021-025-09744-3

Eugen-Richard Ardelean, Raluca Portase

{"title":"A Study of Non-Linear Manifold Feature Extraction in Spike Sorting.","authors":"Eugen-Richard Ardelean, Raluca Portase","doi":"10.1007/s12021-025-09744-3","DOIUrl":null,"url":null,"abstract":"<p><p>With recent developments in recording hardware, the processing of neuronal data must keep up with the increasing volumes and complexity by capturing the intrinsic relationships between instances of neuronal activity while remaining invariant to noise. Here, we explore a suite of non-linear manifold feature extraction methods - including PHATE, t-SNE, UMAP, TriMap - in an attempt to identify the most adequate method for automated spike sorting. Spike sorting is the process of clustering instances of neuronal activity, called spikes, based on similarity. By embedding high-dimensional spike shapes into low-dimensional manifolds that preserve local and global structure, we demonstrate more separable and robust clusters than those obtained via traditional feature extraction methods, such as PCA. We evaluated all feature extraction methods analyzed on 95 single-channel synthetic datasets and 2 single-channel real datasets spanning a range of cluster counts. Quantitative evaluation using clustering performance metrics (such as Adjusted Rand Index, Silhouette Score, etc.) indicates that several manifold feature extractions outperform other feature extraction methods. Our results suggest that the embeddings obtained by non-linear manifold approaches can offer a powerful, high-precision option in the spike sorting of the next-generation of electrophysiological recordings. While this study focuses on single-channel data and a subset of manifold learning techniques, a baseline has been established, and future avenues of research have been opened through this work. Future work may extend these insights to multi-channel settings, such as high-density probes and incorporate emerging manifold methods, such as hierarchical and multi-view extensions, which could further improve the robustness and accuracy of spike sorting.</p>","PeriodicalId":49761,"journal":{"name":"Neuroinformatics","volume":"23 4","pages":"48"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491110/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroinformatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12021-025-09744-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

With recent developments in recording hardware, the processing of neuronal data must keep up with the increasing volumes and complexity by capturing the intrinsic relationships between instances of neuronal activity while remaining invariant to noise. Here, we explore a suite of non-linear manifold feature extraction methods - including PHATE, t-SNE, UMAP, TriMap - in an attempt to identify the most adequate method for automated spike sorting. Spike sorting is the process of clustering instances of neuronal activity, called spikes, based on similarity. By embedding high-dimensional spike shapes into low-dimensional manifolds that preserve local and global structure, we demonstrate more separable and robust clusters than those obtained via traditional feature extraction methods, such as PCA. We evaluated all feature extraction methods analyzed on 95 single-channel synthetic datasets and 2 single-channel real datasets spanning a range of cluster counts. Quantitative evaluation using clustering performance metrics (such as Adjusted Rand Index, Silhouette Score, etc.) indicates that several manifold feature extractions outperform other feature extraction methods. Our results suggest that the embeddings obtained by non-linear manifold approaches can offer a powerful, high-precision option in the spike sorting of the next-generation of electrophysiological recordings. While this study focuses on single-channel data and a subset of manifold learning techniques, a baseline has been established, and future avenues of research have been opened through this work. Future work may extend these insights to multi-channel settings, such as high-density probes and incorporate emerging manifold methods, such as hierarchical and multi-view extensions, which could further improve the robustness and accuracy of spike sorting.

查看原文本刊更多论文

尖峰分类中非线性流形特征提取的研究。

随着近年来记录硬件的发展，神经元数据的处理必须通过捕捉神经元活动实例之间的内在关系来跟上不断增长的体积和复杂性，同时保持对噪声的不变性。在这里，我们探索了一套非线性流形特征提取方法-包括PHATE， t-SNE， UMAP， TriMap -试图确定最适合自动尖峰分类的方法。尖峰排序是基于相似性对神经元活动实例（称为尖峰）进行聚类的过程。通过将高维尖峰形状嵌入到低维流形中，保持局部和全局结构，我们展示了比传统特征提取方法（如PCA）获得的聚类更具可分离性和鲁棒性。我们在95个单通道合成数据集和2个单通道真实数据集上评估了所有特征提取方法。使用聚类性能指标（如Adjusted Rand Index， Silhouette Score等）进行定量评估表明，几种流形特征提取优于其他特征提取方法。我们的研究结果表明，通过非线性流形方法获得的嵌入可以为下一代电生理记录的尖峰排序提供强大的、高精度的选择。虽然本研究侧重于单通道数据和多种学习技术的子集，但已经建立了基线，并通过这项工作开辟了未来的研究途径。未来的工作可能会将这些见解扩展到多通道设置，如高密度探针，并结合新兴的多种方法，如分层和多视图扩展，这可以进一步提高尖峰排序的鲁棒性和准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neuroinformatics 医学-计算机：跨学科应用

CiteScore

6.00

自引率

6.70%

发文量

审稿时长

3 months

期刊介绍： Neuroinformatics publishes original articles and reviews with an emphasis on data structure and software tools related to analysis, modeling, integration, and sharing in all areas of neuroscience research. The editors particularly invite contributions on: (1) Theory and methodology, including discussions on ontologies, modeling approaches, database design, and meta-analyses; (2) Descriptions of developed databases and software tools, and of the methods for their distribution; (3) Relevant experimental results, such as reports accompanie by the release of massive data sets; (4) Computational simulations of models integrating and organizing complex data; and (5) Neuroengineering approaches, including hardware, robotics, and information theory studies.