RS3药物子结构搜索系统的改进

H. Lee, Hyunsang Kang, Yoonsook Lee, Byungchol Chang, Jaehyuk Cha
{"title":"RS3药物子结构搜索系统的改进","authors":"H. Lee, Hyunsang Kang, Yoonsook Lee, Byungchol Chang, Jaehyuk Cha","doi":"10.1109/ICNIDC.2010.5657973","DOIUrl":null,"url":null,"abstract":"For the development of a new drug, medicinal chemists perform the structure-based drug design, i.e., search the existing drugs with the chemical structure similar to the chemical structure of the target drug. In brief, researcher and developer want to know drugs to have a specific chemical structure. Therefore, from large drug databases, it is necessary to give the information of selected drugs fast, which should have the members of a set of chemical moieties matching a user-defined query moiety. Substructure searching is the process of identifying the members of a set of chemical moieties that match a specific query moiety. Testing for substructure searching was developed in the late 1950s. In graph theoretical terms, this problem corresponds to determining which graphs in a set are subgraph isomorphic to a specified query graph. Testing for subgraph isomorphism has been proved, in the general case, to be an NP-complete problem. For the purpose of overcoming this difficulty, there were computational approaches. On the 1990s, a US patent has been granted on an atom-centered indexing scheme, used by the RS3 system; this has the virtue that the indexes generated can be searched by direct text comparison. This system is commercially usedihttp://www.acelrys.com/rs3). We define the RS3 system's drawback and present a new indexing scheme. This system treats substructure searching with substring matching by means of expressing chemical structure as predefined strings. However, it has insufficient ‘recall’ and ‘precision’ because of a fatal shortage of the indexing scheme. Expressing 2D chemical structure into ID a string has limit. Therefore, we break 2D chemical structure into ID structure fragments. We present in this paper a new index technique to improve recall and precision surprisingly.","PeriodicalId":348778,"journal":{"name":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The improvement of the RS3 system for drug substructure searching\",\"authors\":\"H. Lee, Hyunsang Kang, Yoonsook Lee, Byungchol Chang, Jaehyuk Cha\",\"doi\":\"10.1109/ICNIDC.2010.5657973\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For the development of a new drug, medicinal chemists perform the structure-based drug design, i.e., search the existing drugs with the chemical structure similar to the chemical structure of the target drug. In brief, researcher and developer want to know drugs to have a specific chemical structure. Therefore, from large drug databases, it is necessary to give the information of selected drugs fast, which should have the members of a set of chemical moieties matching a user-defined query moiety. Substructure searching is the process of identifying the members of a set of chemical moieties that match a specific query moiety. Testing for substructure searching was developed in the late 1950s. In graph theoretical terms, this problem corresponds to determining which graphs in a set are subgraph isomorphic to a specified query graph. Testing for subgraph isomorphism has been proved, in the general case, to be an NP-complete problem. For the purpose of overcoming this difficulty, there were computational approaches. On the 1990s, a US patent has been granted on an atom-centered indexing scheme, used by the RS3 system; this has the virtue that the indexes generated can be searched by direct text comparison. This system is commercially usedihttp://www.acelrys.com/rs3). We define the RS3 system's drawback and present a new indexing scheme. This system treats substructure searching with substring matching by means of expressing chemical structure as predefined strings. However, it has insufficient ‘recall’ and ‘precision’ because of a fatal shortage of the indexing scheme. Expressing 2D chemical structure into ID a string has limit. Therefore, we break 2D chemical structure into ID structure fragments. We present in this paper a new index technique to improve recall and precision surprisingly.\",\"PeriodicalId\":348778,\"journal\":{\"name\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNIDC.2010.5657973\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNIDC.2010.5657973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

为了开发一种新药,药物化学家进行基于结构的药物设计,即寻找与目标药物化学结构相似的现有药物。简而言之,研究人员和开发人员想知道药物具有特定的化学结构。因此,从大型药物数据库中,有必要快速给出所选药物的信息,该信息应具有与用户自定义查询片段匹配的一组化学片段的成员。子结构搜索是识别与特定查询片段匹配的一组化学片段的成员的过程。子结构搜索测试是在20世纪50年代后期发展起来的。在图论术语中,这个问题对应于确定集合中哪些图与指定的查询图同构。在一般情况下,证明了子图同构的检验是一个np完全问题。为了克服这个困难,有了计算方法。上世纪90年代,RS3系统使用的以原子为中心的索引方案获得了一项美国专利;这样做的好处是,生成的索引可以通过直接文本比较进行搜索。这个系统是商业用途的(http://:/www.acelrys.com/rs3)。定义了RS3系统的缺点,提出了一种新的索引方案。该系统通过将化学结构表示为预定义字符串的方式,将子结构搜索与子字符串匹配进行处理。然而,由于索引方案的致命不足,它的“召回”和“精确”不足。将二维化学结构表示为ID字符串是有限制的。因此,我们将二维化学结构分解为ID结构片段。本文提出了一种新的索引技术,可以显著提高查全率和查准率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The improvement of the RS3 system for drug substructure searching
For the development of a new drug, medicinal chemists perform the structure-based drug design, i.e., search the existing drugs with the chemical structure similar to the chemical structure of the target drug. In brief, researcher and developer want to know drugs to have a specific chemical structure. Therefore, from large drug databases, it is necessary to give the information of selected drugs fast, which should have the members of a set of chemical moieties matching a user-defined query moiety. Substructure searching is the process of identifying the members of a set of chemical moieties that match a specific query moiety. Testing for substructure searching was developed in the late 1950s. In graph theoretical terms, this problem corresponds to determining which graphs in a set are subgraph isomorphic to a specified query graph. Testing for subgraph isomorphism has been proved, in the general case, to be an NP-complete problem. For the purpose of overcoming this difficulty, there were computational approaches. On the 1990s, a US patent has been granted on an atom-centered indexing scheme, used by the RS3 system; this has the virtue that the indexes generated can be searched by direct text comparison. This system is commercially usedihttp://www.acelrys.com/rs3). We define the RS3 system's drawback and present a new indexing scheme. This system treats substructure searching with substring matching by means of expressing chemical structure as predefined strings. However, it has insufficient ‘recall’ and ‘precision’ because of a fatal shortage of the indexing scheme. Expressing 2D chemical structure into ID a string has limit. Therefore, we break 2D chemical structure into ID structure fragments. We present in this paper a new index technique to improve recall and precision surprisingly.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信