{"title":"推断时间序列微阵列之间的相似性:基于内容的方法","authors":"Duygu Dede Sener, H. Oğul","doi":"10.1109/CYBConf.2015.7175932","DOIUrl":null,"url":null,"abstract":"Public repositories for gene expression studies have been growing rapidly in the last decade. Retrieval of gene expression experiments based on textual descriptions does not provide sufficient data for biologists and clinicians. Content-based search has recently become more desirable in retrieving similar experiments. Current methods for content-based retrieval cannot address the problem of profiling the gene behaviors in multiple measurement points, i.e. in time course. This study, to the best of our knowledge, is the first attempt to build a fingerprint for each gene by considering all time points to infer its time-course profile to represent the experiment content in an information retrieval framework. An empirical study is performed on a large dataset of Arabidopsis microarrays from Gene Expression Omnibus (GEO). Experimental results show that relevant experiments are retrieved based on content similarity.","PeriodicalId":177233,"journal":{"name":"2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inferring similarity between time-series microarrays: A content-based approach\",\"authors\":\"Duygu Dede Sener, H. Oğul\",\"doi\":\"10.1109/CYBConf.2015.7175932\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Public repositories for gene expression studies have been growing rapidly in the last decade. Retrieval of gene expression experiments based on textual descriptions does not provide sufficient data for biologists and clinicians. Content-based search has recently become more desirable in retrieving similar experiments. Current methods for content-based retrieval cannot address the problem of profiling the gene behaviors in multiple measurement points, i.e. in time course. This study, to the best of our knowledge, is the first attempt to build a fingerprint for each gene by considering all time points to infer its time-course profile to represent the experiment content in an information retrieval framework. An empirical study is performed on a large dataset of Arabidopsis microarrays from Gene Expression Omnibus (GEO). Experimental results show that relevant experiments are retrieved based on content similarity.\",\"PeriodicalId\":177233,\"journal\":{\"name\":\"2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CYBConf.2015.7175932\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBConf.2015.7175932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
基因表达研究的公共资源库在过去十年中发展迅速。基于文本描述的基因表达实验检索不能为生物学家和临床医生提供足够的数据。最近,基于内容的搜索在检索类似实验时变得更加可取。现有的基于内容的检索方法无法解决在多个测量点(即在时间过程中)分析基因行为的问题。据我们所知,这项研究是第一次尝试通过考虑所有时间点来推断每个基因的时间轨迹,从而在信息检索框架中表示实验内容,从而构建每个基因的指纹。本文对来自Gene Expression Omnibus (GEO)的大型拟南芥微阵列数据集进行了实证研究。实验结果表明,基于内容相似度的方法可以检索到相关实验。
Inferring similarity between time-series microarrays: A content-based approach
Public repositories for gene expression studies have been growing rapidly in the last decade. Retrieval of gene expression experiments based on textual descriptions does not provide sufficient data for biologists and clinicians. Content-based search has recently become more desirable in retrieving similar experiments. Current methods for content-based retrieval cannot address the problem of profiling the gene behaviors in multiple measurement points, i.e. in time course. This study, to the best of our knowledge, is the first attempt to build a fingerprint for each gene by considering all time points to infer its time-course profile to represent the experiment content in an information retrieval framework. An empirical study is performed on a large dataset of Arabidopsis microarrays from Gene Expression Omnibus (GEO). Experimental results show that relevant experiments are retrieved based on content similarity.