用图上的函数编码蛋白质结构

Promita Bose, Xiaxia Yu, R. Harrison
{"title":"用图上的函数编码蛋白质结构","authors":"Promita Bose, Xiaxia Yu, R. Harrison","doi":"10.1109/BIBMW.2011.6112396","DOIUrl":null,"url":null,"abstract":"The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"1 1","pages":"338-344"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Encoding protein structure with functions on graphs\",\"authors\":\"Promita Bose, Xiaxia Yu, R. Harrison\",\"doi\":\"10.1109/BIBMW.2011.6112396\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.\",\"PeriodicalId\":6345,\"journal\":{\"name\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"volume\":\"1 1\",\"pages\":\"338-344\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBMW.2011.6112396\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2011.6112396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

摘要

机器学习和数据挖掘在蛋白质结构分析和预测中的应用是一个在计算机科学和生物学中具有潜在高影响的研究领域。蛋白质结构本质上是复杂的物体,具有清晰和模糊的混合特性。因此,为它们开发有效的表征本身就是一个研究问题,而量化和预测性质和结构在结构生物学中具有直接的重要性。本文的重点是开发一种紧凑、有效、高效和准确的蛋白质结构表示,该表示与广泛使用的机器学习工具(如SVM)兼容。基于Delaunay三角测量的图被用来表示结构,然后从这些图中构建函数来开发与氨基酸序列紧密结合的蛋白质结构的恒定大小表示。这些表示保留了足够的信息,对模型与实验结构的分类和模型质量的回归分析有价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Encoding protein structure with functions on graphs
The application of machine learning and datamining to the analysis and prediction of protein structure is a research area with potentially high impact in both computer science and biology. Proteins structures are inherently complicated objects with a mixture of crisp and fuzzy properties. Therefore developing effective representations for them is a research problem in itself, while quantifying and predicting properties and structure is of immediate importance in structural biology. This paper focuses on developing a compact, effective, efficient and accurate representation of protein structure that is compatible with widely used machine learning tools like the SVM. Graphs based on Delaunay triangulation are used to represent the structure, and then functions are constructed from these graphs to develop constant-size representations of protein structure that are tightly bound to the amino acid sequence. The representations preserve sufficient information to be valuable for model vs. experimental structure classification and regression analysis of model quality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信