图在化学信息学和结构生物信息学中的应用

Graph Data Management Pub Date : 1900-01-01 DOI:10.4018/978-1-61350-053-8.ch017

Eleanor J. Gardiner

{"title":"图在化学信息学和结构生物信息学中的应用","authors":"Eleanor J. Gardiner","doi":"10.4018/978-1-61350-053-8.ch017","DOIUrl":null,"url":null,"abstract":"INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …","PeriodicalId":227251,"journal":{"name":"Graph Data Management","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Graph applications in chemoinformatics and structural bioinformatics\",\"authors\":\"Eleanor J. Gardiner\",\"doi\":\"10.4018/978-1-61350-053-8.ch017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …\",\"PeriodicalId\":227251,\"journal\":{\"name\":\"Graph Data Management\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Graph Data Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/978-1-61350-053-8.ch017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Graph Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/978-1-61350-053-8.ch017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

化学的空间是非常大的。最近的估计表明，潜在的“类似药物”分子的数量在1012到10180之间(Gorse, 2006)。绝大多数这些分子从来没有，也永远不会被合成，但仍然需要方法来确定应该制造哪些潜在的化合物。一些大型制药/农化公司维护着包含数百万分子的公司数据库。新化学实体(NCEs)可能成为药物的发现取决于成功的挖掘。本章的重点是图论在化学信息学和结构生物信息学中的应用。化学图论的历史很长，可以追溯到19世纪60年代和凯库勒的结构理论。将分子的原子视为节点，将键视为标记图(分子图)的边(二维表示)是很自然的。本章将集中讨论利用此类图的计算机表示及其在二维和三维(其中边缘表示一对原子之间的三维空间中的距离)中的扩展而开发的算法，以及开发用于利用它们的算法。这些算法一般会被总结而不是详细说明。这些方法后来扩展到更大的大分子(如蛋白质);这些将不太详细地介绍。存储在这些数据库中的信息。这些信息可以是明确的(例如，分子可以用化学反应或活性数据注释)，也可以隐含在分子的结构中。多年来，同构算法已经形成了这些数据库中分子对之间结构比较的基础，旨在提取这种隐式信息。本章的主要目的是向图论领域的从业者介绍化学信息学的概念，并展示图论技术在解决化学信息学问题方面的广泛应用。然而，化学信息学中的许多图论算法随后被用于结构生物信息学领域中大分子的结构比较。本章的第二个目的是提供这些应用程序的简要概述。现在将描述这一章的布局。在背景部分给出了化学信息学的一些定义，并将该主题置于药物发现过程的背景下。对结构生物信息学进行了定义，并介绍了必要的图论符号。这两个主要的…

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Graph applications in chemoinformatics and structural bioinformatics

INTRODUCTION Chemistry space is exceedingly large. Recent estimates put the number of potentially 'drug-like' molecules at anything between 10 12 and 10 180 (Gorse, 2006). The overwhelming majority of these molecules never has been, and never will be, synthesized but methods are nevertheless required to determine which of these potential compounds should be made. Some large pharmaceutical/ agrochemical companies maintain corporate databases of millions of molecules. The discovery of New Chemical Entities (NCEs) which may become drugs depends on the successful mining ABSTRACT The focus of this chapter will be the uses of graph theory in chemoinformatics and in structural bio-informatics. There is a long history of chemical graph theory dating back to the 1860's and Kekule's structural theory. It is natural to regard the atoms of a molecule as nodes and the bonds as edges (2D representations) of a labeled graph (a molecular graph). This chapter will concentrate on the algorithms developed to exploit the computer representation of such graphs and their extensions in both two and three dimensions (where an edge represents the distance in 3D space between a pair of atoms), together with the algorithms developed to exploit them. The algorithms will generally be summarized rather than detailed. The methods were later extended to larger macromolecules (such as proteins); these will be covered in less detail. of the information stored in these databases. Such information may be explicit (e.g. the molecules may be annotated with chemical reaction or activity data) or may be implicit in the structure of a molecule. For many years isomorphism algorithms have formed the basis of structural comparison between pairs of molecules in these databases, designed to extract this implicit information. The main purpose of this chapter is to introduce the concept of chemoinformatics to practitioners from the field of graph theory and to demonstrate the widespread application of graph-theoretic techniques to the solving of chemoinformatics problems. However, many graph-theoretic algorithms from chemoinformatics have subsequently been adapted for the structural comparison of macromolecules in the field known as structural bioinformatics. A secondary aim of the chapter is to provide a brief overview of these applications. The layout of the chapter will now be described. In the Background section some definitions of chemoinformatics are given and the topic is placed within the context of the drug discovery process. Structural bioinformatics is also defined and the necessary graph theoretic notation is introduced. The two main …

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Graph Data Management

自引率

0.00%

发文量