生物通路中情境依赖性冲突信息的系统识别

Seyeol Yoon, J. Jung, Hasun Yu, Mijin Kwon, Sungji Choo, Kyunghyun Park, Dongjin Jang, Sangwoo Kim, Doheon Lee
{"title":"生物通路中情境依赖性冲突信息的系统识别","authors":"Seyeol Yoon, J. Jung, Hasun Yu, Mijin Kwon, Sungji Choo, Kyunghyun Park, Dongjin Jang, Sangwoo Kim, Doheon Lee","doi":"10.1145/2665970.2665973","DOIUrl":null,"url":null,"abstract":"Interactions between biological entities such as genes, proteins and metabolites, so called pathways, are key features to understand molecular mechanisms of life. As pathway information is being accumulated rapidly through various knowledge resources, there are growing interests in maintaining integrity of the heterogeneous databases. Here, we defined conflict as a status where two contradictory evidences (i.e. 'A increases B' and 'A decreases B') coexist in a same pathway. This conflict damages unity so that inference of simulation on the integrated pathway network might be unreliable. We defined rule and rule group. A rule consists of interaction of two entities, meta-relation (increase or decrease), and contexts terms about tissue specificity or environmental conditions. The rules, which have the same interaction, are grouped into a rule group. If the rules don't have unanimous meta-relation, the rule group and the rules are judged as being conflicting. This analysis revealed that almost 20% of known interactions suffer from conflicting information and conflicting information occurred much more frequently in the literatures than the public database. With consideration for dual functions depending on context, we thought it might resolve conflict to consider context. We grouped rules, which have the same context terms as well as interaction. It's revealed that up to 86% of the conflicts could be resolved by considering context. Subsequent analysis also showed that those contradictory records generally compete each other closely, but some information might be suspicious when their evidence levels are seriously imbalanced. By identifying and resolving the conflicts, we expect that pathway databases can be cleaned and used for better secondary analyses such as gene/protein annotation, network dynamics and qualitative/quantitative simulation.","PeriodicalId":143937,"journal":{"name":"Data and Text Mining in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Systematic Identification of Context-dependent Conflicting Information in Biological Pathways\",\"authors\":\"Seyeol Yoon, J. Jung, Hasun Yu, Mijin Kwon, Sungji Choo, Kyunghyun Park, Dongjin Jang, Sangwoo Kim, Doheon Lee\",\"doi\":\"10.1145/2665970.2665973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Interactions between biological entities such as genes, proteins and metabolites, so called pathways, are key features to understand molecular mechanisms of life. As pathway information is being accumulated rapidly through various knowledge resources, there are growing interests in maintaining integrity of the heterogeneous databases. Here, we defined conflict as a status where two contradictory evidences (i.e. 'A increases B' and 'A decreases B') coexist in a same pathway. This conflict damages unity so that inference of simulation on the integrated pathway network might be unreliable. We defined rule and rule group. A rule consists of interaction of two entities, meta-relation (increase or decrease), and contexts terms about tissue specificity or environmental conditions. The rules, which have the same interaction, are grouped into a rule group. If the rules don't have unanimous meta-relation, the rule group and the rules are judged as being conflicting. This analysis revealed that almost 20% of known interactions suffer from conflicting information and conflicting information occurred much more frequently in the literatures than the public database. With consideration for dual functions depending on context, we thought it might resolve conflict to consider context. We grouped rules, which have the same context terms as well as interaction. It's revealed that up to 86% of the conflicts could be resolved by considering context. Subsequent analysis also showed that those contradictory records generally compete each other closely, but some information might be suspicious when their evidence levels are seriously imbalanced. By identifying and resolving the conflicts, we expect that pathway databases can be cleaned and used for better secondary analyses such as gene/protein annotation, network dynamics and qualitative/quantitative simulation.\",\"PeriodicalId\":143937,\"journal\":{\"name\":\"Data and Text Mining in Bioinformatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data and Text Mining in Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2665970.2665973\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and Text Mining in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2665970.2665973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

生物实体(如基因、蛋白质和代谢物)之间的相互作用,即所谓的途径,是理解生命分子机制的关键特征。随着途径信息通过各种知识资源快速积累,维护异构数据库的完整性日益受到关注。在这里,我们将冲突定义为两个相互矛盾的证据(即:“A增加B”和“A减少B”)在同一路径上共存。这种冲突破坏了统一性,使得综合路径网络仿真的推理不可靠。我们定义了规则和规则组。规则由两个实体的相互作用、元关系(增加或减少)以及关于组织特异性或环境条件的上下文术语组成。具有相同交互的规则被分组到规则组中。如果规则没有一致的元关系,则判断规则组和规则是冲突的。这一分析表明,近20%的已知相互作用存在信息冲突,而文献中的信息冲突比公共数据库中的信息冲突要频繁得多。考虑到上下文的双重功能,我们认为考虑上下文可能会解决冲突。我们对具有相同上下文术语和交互的规则进行分组。据透露,高达86%的冲突可以通过考虑上下文来解决。随后的分析还表明,这些相互矛盾的记录通常相互密切竞争,但当证据水平严重不平衡时,一些信息可能是可疑的。通过识别和解决冲突,我们期望通路数据库可以被清理,并用于更好的二次分析,如基因/蛋白质注释,网络动力学和定性/定量模拟。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Systematic Identification of Context-dependent Conflicting Information in Biological Pathways
Interactions between biological entities such as genes, proteins and metabolites, so called pathways, are key features to understand molecular mechanisms of life. As pathway information is being accumulated rapidly through various knowledge resources, there are growing interests in maintaining integrity of the heterogeneous databases. Here, we defined conflict as a status where two contradictory evidences (i.e. 'A increases B' and 'A decreases B') coexist in a same pathway. This conflict damages unity so that inference of simulation on the integrated pathway network might be unreliable. We defined rule and rule group. A rule consists of interaction of two entities, meta-relation (increase or decrease), and contexts terms about tissue specificity or environmental conditions. The rules, which have the same interaction, are grouped into a rule group. If the rules don't have unanimous meta-relation, the rule group and the rules are judged as being conflicting. This analysis revealed that almost 20% of known interactions suffer from conflicting information and conflicting information occurred much more frequently in the literatures than the public database. With consideration for dual functions depending on context, we thought it might resolve conflict to consider context. We grouped rules, which have the same context terms as well as interaction. It's revealed that up to 86% of the conflicts could be resolved by considering context. Subsequent analysis also showed that those contradictory records generally compete each other closely, but some information might be suspicious when their evidence levels are seriously imbalanced. By identifying and resolving the conflicts, we expect that pathway databases can be cleaned and used for better secondary analyses such as gene/protein annotation, network dynamics and qualitative/quantitative simulation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信