基于矩阵化和特征选择的动物行为张量分析。

IF 6.3 2区 医学 Q1 BIOLOGY
Beichen Wang , Jiazhang Cai , Luyang Fang , Motokazu Tsujikawa , Ping Ma , Yuk Fai Leung
{"title":"基于矩阵化和特征选择的动物行为张量分析。","authors":"Beichen Wang ,&nbsp;Jiazhang Cai ,&nbsp;Luyang Fang ,&nbsp;Motokazu Tsujikawa ,&nbsp;Ping Ma ,&nbsp;Yuk Fai Leung","doi":"10.1016/j.compbiomed.2025.110959","DOIUrl":null,"url":null,"abstract":"<div><div>Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"197 ","pages":"Article 110959"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tensor analysis of animal behavior by matricization and feature selection\",\"authors\":\"Beichen Wang ,&nbsp;Jiazhang Cai ,&nbsp;Luyang Fang ,&nbsp;Motokazu Tsujikawa ,&nbsp;Ping Ma ,&nbsp;Yuk Fai Leung\",\"doi\":\"10.1016/j.compbiomed.2025.110959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"197 \",\"pages\":\"Article 110959\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525013113\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525013113","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

当代神经行为研究经常收集多维张量(MDT)数据,其中包含多个动物受到各种扰动的多个特征的时间序列测量。对MDT数据的适当分析可以揭示驱动这种行为的神经回路。然而,许多MDT分析,如张量分解,可能无法产生易于解释的结果,也无法直接兼容为二维张量(2DT)结构设计的标准多变量分析。为了解决这个问题,通过降维技术将MDT数据转换为2DT,包括索引构建和特征拼接等矩阵化方法。然而,矩阵化可能会排除关键信息或引入虚假噪声到多变量分析中,因此它对多变量分析的影响仍然难以捉摸。在这里,我们展示了不同的矩阵化方法和特征选择方法。我们使用从野生型和视障突变体中收集的斑马鱼视觉-运动反应MDT数据集来评估它们对多变量分析性能的影响。我们使用各种索引构建和特征连接方法对MDT数据集进行矩阵化,然后使用过滤器和嵌入方法识别信息丰富的2DT特征。为了评估这些特征选择方法,我们应用了几种分类器来区分不同基因型的斑马鱼,并通过交叉验证和保留验证来评估它们的性能。我们发现,大多数分类器使用由特征串联矩阵化并由嵌入方法或联合操作选择的2DT特征时表现最好。结果还揭示了野生型和突变型之间独特的行为差异,但没有通过多变量分析或MDT分析确定。我们的结果证明了通过矩阵化和特征选择来分析MDT行为数据的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Tensor analysis of animal behavior by matricization and feature selection
Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信