Beichen Wang , Jiazhang Cai , Luyang Fang , Motokazu Tsujikawa , Ping Ma , Yuk Fai Leung
{"title":"基于矩阵化和特征选择的动物行为张量分析。","authors":"Beichen Wang , Jiazhang Cai , Luyang Fang , Motokazu Tsujikawa , Ping Ma , Yuk Fai Leung","doi":"10.1016/j.compbiomed.2025.110959","DOIUrl":null,"url":null,"abstract":"<div><div>Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"197 ","pages":"Article 110959"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tensor analysis of animal behavior by matricization and feature selection\",\"authors\":\"Beichen Wang , Jiazhang Cai , Luyang Fang , Motokazu Tsujikawa , Ping Ma , Yuk Fai Leung\",\"doi\":\"10.1016/j.compbiomed.2025.110959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"197 \",\"pages\":\"Article 110959\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525013113\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525013113","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
Tensor analysis of animal behavior by matricization and feature selection
Contemporary neurobehavior research often collects multi-dimensional tensor (MDT) data containing time-series measurements for multiple features from multiple animals subjected to various perturbations. Proper analysis of the MDT data can reveal neural circuitries driving the behavior. However, many MDT analyses, such as tensor decomposition, may not yield results that are easy to interpret or directly compatible with standard multivariate analysis designed for 2-dimensional tensor (2DT) structures. To address this issue, the MDT data are transformed into 2DT by dimensionality reduction techniques, including matricization methods such as Index Construction and Feature Concatenation. Nonetheless, matricization may exclude key information or introduce spurious noise to multivariate analysis, so its impact on multivariate analysis remains elusive. Here, we demonstrated different matricization approaches and feature selection methods. We evaluated their impacts on multivariate analysis performance using an MDT dataset of zebrafish visual-motor response collected from wildtypes and visually-impaired mutants. We matricized the MDT dataset using various Index Construction and Feature Concatenation methods, then identified informative 2DT features using the filter and embedded methods. To evaluate these feature-selection methods, we applied several classifiers to distinguish zebrafish of different genotypes and assessed their performances with cross-validation and holdout validation. We found that most classifiers performed the best using 2DT features matricized by Feature Concatenation and selected by the embedded method or union operation. The results also revealed unique behavioral differences between the wildtypes and mutants, but not identified by multivariate analysis or MDT analysis. Our results demonstrate the utility of analyzing MDT behavioral data by matricization and feature selection.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.