Improving Visual Neuroscience Cell Type Classification with Supervised Machine Learning*

2022 Systems and Information Engineering Design Symposium (SIEDS) Pub Date : 2022-04-28 DOI:10.1109/sieds55548.2022.9799330

Jordan Hiatt, D. Howe, Lauren Neal

{"title":"Improving Visual Neuroscience Cell Type Classification with Supervised Machine Learning*","authors":"Jordan Hiatt, D. Howe, Lauren Neal","doi":"10.1109/sieds55548.2022.9799330","DOIUrl":null,"url":null,"abstract":"At present, visual neuroscientists must employ an inefficient, time-intensive process to study the ways in which various types of neurons react to characteristics of a visual stimulus; the standard procedure requires specifying and monitoring a single cell type per individual microscopy recording. This research paper proposes an alternative method: utilize a supervised classification algorithm to distinguish between several cell types – based on the cells’ behavior and response to stimuli – in the context of a single recording. This allows researchers to record multiple cell types at once and, subsequently, classify them by type for further analysis. For this classifier, the neuronal spatial footprints and neuronal temporal activity are extracted from raw microscopy recordings using constrained nonnegative matrix factorization. From these data, neuronal features are engineered for the classifier, which-along with features engineered from the visual stimulus corresponding to the neuronal activity-are used by various models to predict the cell type of the recorded neurons. Several algorithms are tested to compare their classification performance, including random forest classifiers, neural networks, and K-nearest neighbors classifiers. This research concludes that the relationship between stimulus and fluorescent response is a moderate predictor of cell type. We develop a cell type classification model that leverages one-hot encoding and engineering of visual stimulus and fluorescent response features, sliding time/frame windows, and dimensionality reduction to generate inputs in a model to classify multiple neuronal cell types in a single microscopy recording. We originally hypothesized that the K-nearest neighbors and/or neural network implementations would produce the strongest classification performance due to the algorithms’ ability to flexibly fit nonlinear feature spaces. Due to the imbalanced nature of the dataset, with five classes total and one class making up nearly 50% of the data, balanced accuracy is a better indicator of model performance than accuracy. Classifying cells via random chance would yield a balanced accuracy of 20%. Our best cell type classifier, a convolutional neural network optimized for time series classification, gives us an accuracy score of 70.6% and balanced accuracy of 53.7%.","PeriodicalId":286724,"journal":{"name":"2022 Systems and Information Engineering Design Symposium (SIEDS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/sieds55548.2022.9799330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

At present, visual neuroscientists must employ an inefficient, time-intensive process to study the ways in which various types of neurons react to characteristics of a visual stimulus; the standard procedure requires specifying and monitoring a single cell type per individual microscopy recording. This research paper proposes an alternative method: utilize a supervised classification algorithm to distinguish between several cell types – based on the cells’ behavior and response to stimuli – in the context of a single recording. This allows researchers to record multiple cell types at once and, subsequently, classify them by type for further analysis. For this classifier, the neuronal spatial footprints and neuronal temporal activity are extracted from raw microscopy recordings using constrained nonnegative matrix factorization. From these data, neuronal features are engineered for the classifier, which-along with features engineered from the visual stimulus corresponding to the neuronal activity-are used by various models to predict the cell type of the recorded neurons. Several algorithms are tested to compare their classification performance, including random forest classifiers, neural networks, and K-nearest neighbors classifiers. This research concludes that the relationship between stimulus and fluorescent response is a moderate predictor of cell type. We develop a cell type classification model that leverages one-hot encoding and engineering of visual stimulus and fluorescent response features, sliding time/frame windows, and dimensionality reduction to generate inputs in a model to classify multiple neuronal cell types in a single microscopy recording. We originally hypothesized that the K-nearest neighbors and/or neural network implementations would produce the strongest classification performance due to the algorithms’ ability to flexibly fit nonlinear feature spaces. Due to the imbalanced nature of the dataset, with five classes total and one class making up nearly 50% of the data, balanced accuracy is a better indicator of model performance than accuracy. Classifying cells via random chance would yield a balanced accuracy of 20%. Our best cell type classifier, a convolutional neural network optimized for time series classification, gives us an accuracy score of 70.6% and balanced accuracy of 53.7%.

查看原文本刊更多论文

用监督机器学习改进视觉神经科学细胞类型分类*

目前，视觉神经科学家必须采用一种低效且耗时的方法来研究不同类型的神经元对视觉刺激特征的反应方式;标准程序要求指定和监测每个显微镜记录的单个细胞类型。这篇研究论文提出了一种替代方法:在单个记录的背景下，基于细胞的行为和对刺激的反应，利用监督分类算法来区分几种细胞类型。这使得研究人员可以一次记录多种细胞类型，然后根据类型对它们进行分类，以便进一步分析。对于该分类器，使用约束非负矩阵分解从原始显微镜记录中提取神经元的空间足迹和神经元的时间活动。从这些数据中，为分类器设计神经元特征，这些特征与与神经元活动相对应的视觉刺激的特征一起被各种模型用来预测记录的神经元的细胞类型。我们测试了几种算法来比较它们的分类性能，包括随机森林分类器、神经网络和k近邻分类器。本研究认为，刺激和荧光反应之间的关系是细胞类型的适度预测因子。我们开发了一种细胞类型分类模型，该模型利用视觉刺激和荧光响应特征的单热编码和工程、滑动时间/帧窗口和降维来生成模型中的输入，以便在单个显微镜记录中对多种神经元细胞类型进行分类。我们最初假设，由于算法灵活拟合非线性特征空间的能力，k近邻和/或神经网络实现将产生最强的分类性能。由于数据集的不平衡性，总共有5个类，其中一个类占数据的近50%，平衡精度是比精度更好的模型性能指标。通过随机机会对细胞进行分类将产生20%的平衡准确率。我们最好的细胞类型分类器是一种针对时间序列分类进行优化的卷积神经网络，它的准确率为70.6%，平衡准确率为53.7%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 Systems and Information Engineering Design Symposium (SIEDS)

自引率

0.00%

发文量