基于混合DTW的时间序列数据集成分析方法

V. Boeva, E. Kostadinova
{"title":"基于混合DTW的时间序列数据集成分析方法","authors":"V. Boeva, E. Kostadinova","doi":"10.1109/ICAIS.2009.18","DOIUrl":null,"url":null,"abstract":"Gene expression microarrays are the most commonly available source of high-throughput biological data. Each microarray experiment is supposed to measure the gene expression levels of a set of genes in a number of different experimental conditions or time points. Integration of results from different microarray experiments to the specific analysis is an important and yet challenging problem. Direct integration of microarrays is often ineffective because of the diverse types of experiment specific variations. In this paper, we propose a new hybrid method, which is specially suited for integration analysis of time series expression data across different experiments. The proposed algorithm utilizes Dynamic Time Warping (DTW) distance in order to measure the similarity between time expression profiles. First for each considered time series dataset a quadratic distance matrix that contains the DTW distances calculated between the expression profiles of each gene pair is built. Then using a hybrid aggregation algorithm the obtained DTW distance matrices are transformed into a single matrix, consisting of one overall DTW distance per each gene pair. The values of the resulting matrix can be interpreted as the consensus DTW distances supported by all the experiments. These may be further analyzed and help find the relationship among the genes. The proposed method is validated on gene expression time series data coming from two independent studies examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.","PeriodicalId":161840,"journal":{"name":"2009 International Conference on Adaptive and Intelligent Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A Hybrid DTW Based Method for Integration Analysis of Time Series Data\",\"authors\":\"V. Boeva, E. Kostadinova\",\"doi\":\"10.1109/ICAIS.2009.18\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene expression microarrays are the most commonly available source of high-throughput biological data. Each microarray experiment is supposed to measure the gene expression levels of a set of genes in a number of different experimental conditions or time points. Integration of results from different microarray experiments to the specific analysis is an important and yet challenging problem. Direct integration of microarrays is often ineffective because of the diverse types of experiment specific variations. In this paper, we propose a new hybrid method, which is specially suited for integration analysis of time series expression data across different experiments. The proposed algorithm utilizes Dynamic Time Warping (DTW) distance in order to measure the similarity between time expression profiles. First for each considered time series dataset a quadratic distance matrix that contains the DTW distances calculated between the expression profiles of each gene pair is built. Then using a hybrid aggregation algorithm the obtained DTW distance matrices are transformed into a single matrix, consisting of one overall DTW distance per each gene pair. The values of the resulting matrix can be interpreted as the consensus DTW distances supported by all the experiments. These may be further analyzed and help find the relationship among the genes. The proposed method is validated on gene expression time series data coming from two independent studies examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.\",\"PeriodicalId\":161840,\"journal\":{\"name\":\"2009 International Conference on Adaptive and Intelligent Systems\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Adaptive and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIS.2009.18\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Adaptive and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIS.2009.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

基因表达微阵列是高通量生物数据最常见的来源。每个微阵列实验都应该测量一组基因在多个不同实验条件或时间点的基因表达水平。将不同微阵列实验的结果整合到具体分析中是一个重要而又具有挑战性的问题。微阵列的直接集成往往是无效的,因为不同类型的实验特定的变化。本文提出了一种新的混合方法,该方法特别适用于不同实验时间序列表达式数据的积分分析。该算法利用动态时间翘曲(Dynamic Time Warping, DTW)距离来度量时间表达轮廓之间的相似性。首先,为每个考虑的时间序列数据集建立一个二次距离矩阵,该矩阵包含每个基因对表达谱之间计算的DTW距离。然后使用混合聚合算法将得到的DTW距离矩阵转化为单个矩阵,每个基因对包含一个总DTW距离。结果矩阵的值可以解释为所有实验支持的一致DTW距离。这些可能会被进一步分析,并有助于发现基因之间的关系。该方法在两项独立研究的基因表达时间序列数据上得到了验证,这些研究检测了分裂酵母Schizosaccharomyces pombe基因表达的整体细胞周期控制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Hybrid DTW Based Method for Integration Analysis of Time Series Data
Gene expression microarrays are the most commonly available source of high-throughput biological data. Each microarray experiment is supposed to measure the gene expression levels of a set of genes in a number of different experimental conditions or time points. Integration of results from different microarray experiments to the specific analysis is an important and yet challenging problem. Direct integration of microarrays is often ineffective because of the diverse types of experiment specific variations. In this paper, we propose a new hybrid method, which is specially suited for integration analysis of time series expression data across different experiments. The proposed algorithm utilizes Dynamic Time Warping (DTW) distance in order to measure the similarity between time expression profiles. First for each considered time series dataset a quadratic distance matrix that contains the DTW distances calculated between the expression profiles of each gene pair is built. Then using a hybrid aggregation algorithm the obtained DTW distance matrices are transformed into a single matrix, consisting of one overall DTW distance per each gene pair. The values of the resulting matrix can be interpreted as the consensus DTW distances supported by all the experiments. These may be further analyzed and help find the relationship among the genes. The proposed method is validated on gene expression time series data coming from two independent studies examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信