A symbolic computing approach to evidence code mapping for biological data integration and subjective analysis for reference associations for metabolic pathways
S. Kher, Jianling Peng, E. SyrkinWurtele, J. Dickerson
{"title":"A symbolic computing approach to evidence code mapping for biological data integration and subjective analysis for reference associations for metabolic pathways","authors":"S. Kher, Jianling Peng, E. SyrkinWurtele, J. Dickerson","doi":"10.1109/NAFIPS.2008.4531321","DOIUrl":null,"url":null,"abstract":"Biological data are scattered across thousands of biological databases and hundreds of scientific journals. Integration among these databases faces numerous challenges including various levels of heterogeneity, limited accessibility, redundancy, and conflicts in the data. The integration process needs both quantitative and qualitative mechanisms to accommodate input metrics such as evidence, context, references, and experimental conditions, which are not uniform across the databases. Evidence codes reflect source reliability and data quality. However, different databases define their own evidence codes. This paper presents a mechanism to qualitatively integrate the evidence codes and the references specified by each database. The methodology is tested using a sample pathway from the BioCyc Tierl, KEGG, and MetNetDB pathway databases. The results are promising and form a concrete basis for data integration.","PeriodicalId":430770,"journal":{"name":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAFIPS.2008.4531321","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Biological data are scattered across thousands of biological databases and hundreds of scientific journals. Integration among these databases faces numerous challenges including various levels of heterogeneity, limited accessibility, redundancy, and conflicts in the data. The integration process needs both quantitative and qualitative mechanisms to accommodate input metrics such as evidence, context, references, and experimental conditions, which are not uniform across the databases. Evidence codes reflect source reliability and data quality. However, different databases define their own evidence codes. This paper presents a mechanism to qualitatively integrate the evidence codes and the references specified by each database. The methodology is tested using a sample pathway from the BioCyc Tierl, KEGG, and MetNetDB pathway databases. The results are promising and form a concrete basis for data integration.