{"title":"A Study of Non-Linear Manifold Feature Extraction in Spike Sorting.","authors":"Eugen-Richard Ardelean, Raluca Portase","doi":"10.1007/s12021-025-09744-3","DOIUrl":null,"url":null,"abstract":"<p><p>With recent developments in recording hardware, the processing of neuronal data must keep up with the increasing volumes and complexity by capturing the intrinsic relationships between instances of neuronal activity while remaining invariant to noise. Here, we explore a suite of non-linear manifold feature extraction methods - including PHATE, t-SNE, UMAP, TriMap - in an attempt to identify the most adequate method for automated spike sorting. Spike sorting is the process of clustering instances of neuronal activity, called spikes, based on similarity. By embedding high-dimensional spike shapes into low-dimensional manifolds that preserve local and global structure, we demonstrate more separable and robust clusters than those obtained via traditional feature extraction methods, such as PCA. We evaluated all feature extraction methods analyzed on 95 single-channel synthetic datasets and 2 single-channel real datasets spanning a range of cluster counts. Quantitative evaluation using clustering performance metrics (such as Adjusted Rand Index, Silhouette Score, etc.) indicates that several manifold feature extractions outperform other feature extraction methods. Our results suggest that the embeddings obtained by non-linear manifold approaches can offer a powerful, high-precision option in the spike sorting of the next-generation of electrophysiological recordings. While this study focuses on single-channel data and a subset of manifold learning techniques, a baseline has been established, and future avenues of research have been opened through this work. Future work may extend these insights to multi-channel settings, such as high-density probes and incorporate emerging manifold methods, such as hierarchical and multi-view extensions, which could further improve the robustness and accuracy of spike sorting.</p>","PeriodicalId":49761,"journal":{"name":"Neuroinformatics","volume":"23 4","pages":"48"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491110/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroinformatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12021-025-09744-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
With recent developments in recording hardware, the processing of neuronal data must keep up with the increasing volumes and complexity by capturing the intrinsic relationships between instances of neuronal activity while remaining invariant to noise. Here, we explore a suite of non-linear manifold feature extraction methods - including PHATE, t-SNE, UMAP, TriMap - in an attempt to identify the most adequate method for automated spike sorting. Spike sorting is the process of clustering instances of neuronal activity, called spikes, based on similarity. By embedding high-dimensional spike shapes into low-dimensional manifolds that preserve local and global structure, we demonstrate more separable and robust clusters than those obtained via traditional feature extraction methods, such as PCA. We evaluated all feature extraction methods analyzed on 95 single-channel synthetic datasets and 2 single-channel real datasets spanning a range of cluster counts. Quantitative evaluation using clustering performance metrics (such as Adjusted Rand Index, Silhouette Score, etc.) indicates that several manifold feature extractions outperform other feature extraction methods. Our results suggest that the embeddings obtained by non-linear manifold approaches can offer a powerful, high-precision option in the spike sorting of the next-generation of electrophysiological recordings. While this study focuses on single-channel data and a subset of manifold learning techniques, a baseline has been established, and future avenues of research have been opened through this work. Future work may extend these insights to multi-channel settings, such as high-density probes and incorporate emerging manifold methods, such as hierarchical and multi-view extensions, which could further improve the robustness and accuracy of spike sorting.
期刊介绍:
Neuroinformatics publishes original articles and reviews with an emphasis on data structure and software tools related to analysis, modeling, integration, and sharing in all areas of neuroscience research. The editors particularly invite contributions on: (1) Theory and methodology, including discussions on ontologies, modeling approaches, database design, and meta-analyses; (2) Descriptions of developed databases and software tools, and of the methods for their distribution; (3) Relevant experimental results, such as reports accompanie by the release of massive data sets; (4) Computational simulations of models integrating and organizing complex data; and (5) Neuroengineering approaches, including hardware, robotics, and information theory studies.