{"title":"Extraction of latent concepts from an integrated human gene database: Non-negative matrix factorization for identification of hidden data structure","authors":"K. Murakami","doi":"10.1109/SOCPAR.2015.7492771","DOIUrl":null,"url":null,"abstract":"Information in genetic databases often describes complex concepts, such as diseases and gene functions having implicit relationships. However, such information is presented as independent concepts (for example, “genes” and “function”), making it difficult for the user, even specialists, to understand their meaning in relation to one another. This facilitates the need for extraction of hidden relationships among biological concepts, and for the addition of this information to databases. Therefore, we factorized a gene data matrix and extracted hidden relationships among both genes and their functional terms. We successfully identified composite concepts explained by plural genes and plural terms. This re-organization provides new insights for researchers and is helpful for interpretation of information.","PeriodicalId":409493,"journal":{"name":"2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR)","volume":"192 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOCPAR.2015.7492771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Information in genetic databases often describes complex concepts, such as diseases and gene functions having implicit relationships. However, such information is presented as independent concepts (for example, “genes” and “function”), making it difficult for the user, even specialists, to understand their meaning in relation to one another. This facilitates the need for extraction of hidden relationships among biological concepts, and for the addition of this information to databases. Therefore, we factorized a gene data matrix and extracted hidden relationships among both genes and their functional terms. We successfully identified composite concepts explained by plural genes and plural terms. This re-organization provides new insights for researchers and is helpful for interpretation of information.