图在化学信息学和结构生物信息学中的应用

Eleanor J. Gardiner
{"title":"图在化学信息学和结构生物信息学中的应用","authors":"Eleanor J. Gardiner","doi":"10.4018/978-1-61350-053-8.ch017","DOIUrl":null,"url":null,"abstract":"INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …","PeriodicalId":227251,"journal":{"name":"Graph Data Management","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Graph applications in chemoinformatics and structural bioinformatics\",\"authors\":\"Eleanor J. Gardiner\",\"doi\":\"10.4018/978-1-61350-053-8.ch017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …\",\"PeriodicalId\":227251,\"journal\":{\"name\":\"Graph Data Management\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Graph Data Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/978-1-61350-053-8.ch017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Graph Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/978-1-61350-053-8.ch017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

化学的空间是非常大的。最近的估计表明,潜在的“类似药物”分子的数量在1012到10180之间(Gorse, 2006)。绝大多数这些分子从来没有,也永远不会被合成,但仍然需要方法来确定应该制造哪些潜在的化合物。一些大型制药/农化公司维护着包含数百万分子的公司数据库。新化学实体(NCEs)可能成为药物的发现取决于成功的挖掘。本章的重点是图论在化学信息学和结构生物信息学中的应用。化学图论的历史很长,可以追溯到19世纪60年代和凯库勒的结构理论。将分子的原子视为节点,将键视为标记图(分子图)的边(二维表示)是很自然的。本章将集中讨论利用此类图的计算机表示及其在二维和三维(其中边缘表示一对原子之间的三维空间中的距离)中的扩展而开发的算法,以及开发用于利用它们的算法。这些算法一般会被总结而不是详细说明。这些方法后来扩展到更大的大分子(如蛋白质);这些将不太详细地介绍。存储在这些数据库中的信息。这些信息可以是明确的(例如,分子可以用化学反应或活性数据注释),也可以隐含在分子的结构中。多年来,同构算法已经形成了这些数据库中分子对之间结构比较的基础,旨在提取这种隐式信息。本章的主要目的是向图论领域的从业者介绍化学信息学的概念,并展示图论技术在解决化学信息学问题方面的广泛应用。然而,化学信息学中的许多图论算法随后被用于结构生物信息学领域中大分子的结构比较。本章的第二个目的是提供这些应用程序的简要概述。现在将描述这一章的布局。在背景部分给出了化学信息学的一些定义,并将该主题置于药物发现过程的背景下。对结构生物信息学进行了定义,并介绍了必要的图论符号。这两个主要的…
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Graph applications in chemoinformatics and structural bioinformatics
INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信