通过功能域分析在计算机上发现癌症相关基因

H. Hsiao, J. Tsai, Peixuan Wu, Rouh-Mei Hu
{"title":"通过功能域分析在计算机上发现癌症相关基因","authors":"H. Hsiao, J. Tsai, Peixuan Wu, Rouh-Mei Hu","doi":"10.1109/MMSE.2003.1254458","DOIUrl":null,"url":null,"abstract":"Clinically, cancer is a complex family of diseases. From the viewpoint of molecular biology, cancer is a genetic disease resulting from abnormal gene expression due to DNA instability, such as translocation, amplification, deletion or point mutations. The purpose was first to perform functional analysis of a set of cancerous genes, and to discover or predict the other human genes that are hypothetically cancer-related. To achieve this goal, an approach consisting of three major components was proposed. Firstly, an automatic system has been developed to collect different data sets from the Internet, like human genes, proteins, and functional domains. Secondly, the functional domain compositions extracted from proteins were adopted for grouping the cancerous genes into a number of clusters by hierarchical clustering and association rule methods. The functional domains commonly existing in a cluster of genes represented the characteristic of that cluster. A second set of hypothetically cancerous genes predicted by single nucleotide polymorphism (SNP) was utilized for testing purpose. The experimental result indicated that a total accuracy of 81.45% was reached. It is anticipates that the proposed approach can be applied to other genetic diseases with minor modification.","PeriodicalId":322357,"journal":{"name":"Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"In silico discovery of cancer-related genes by functional domain analysis\",\"authors\":\"H. Hsiao, J. Tsai, Peixuan Wu, Rouh-Mei Hu\",\"doi\":\"10.1109/MMSE.2003.1254458\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clinically, cancer is a complex family of diseases. From the viewpoint of molecular biology, cancer is a genetic disease resulting from abnormal gene expression due to DNA instability, such as translocation, amplification, deletion or point mutations. The purpose was first to perform functional analysis of a set of cancerous genes, and to discover or predict the other human genes that are hypothetically cancer-related. To achieve this goal, an approach consisting of three major components was proposed. Firstly, an automatic system has been developed to collect different data sets from the Internet, like human genes, proteins, and functional domains. Secondly, the functional domain compositions extracted from proteins were adopted for grouping the cancerous genes into a number of clusters by hierarchical clustering and association rule methods. The functional domains commonly existing in a cluster of genes represented the characteristic of that cluster. A second set of hypothetically cancerous genes predicted by single nucleotide polymorphism (SNP) was utilized for testing purpose. The experimental result indicated that a total accuracy of 81.45% was reached. It is anticipates that the proposed approach can be applied to other genetic diseases with minor modification.\",\"PeriodicalId\":322357,\"journal\":{\"name\":\"Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings.\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MMSE.2003.1254458\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSE.2003.1254458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在临床上,癌症是一个复杂的疾病家族。从分子生物学的角度来看,癌症是由于DNA不稳定导致基因表达异常,如易位、扩增、缺失或点突变而引起的遗传性疾病。其目的首先是对一组癌症基因进行功能分析,并发现或预测假设与癌症相关的其他人类基因。为实现这一目标,提出了由三个主要部分组成的方法。首先,已经开发了一个自动系统,从互联网上收集不同的数据集,如人类基因、蛋白质和功能域。其次,利用从蛋白质中提取的功能域组合,采用层次聚类和关联规则方法将癌基因聚类成多个簇;一个基因簇中共同存在的功能域代表了该基因簇的特征。第二组由单核苷酸多态性(SNP)预测的假设癌基因被用于测试目的。实验结果表明,该方法的总准确率达到81.45%。预计所提出的方法可以应用于其他遗传疾病,稍加修改。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
In silico discovery of cancer-related genes by functional domain analysis
Clinically, cancer is a complex family of diseases. From the viewpoint of molecular biology, cancer is a genetic disease resulting from abnormal gene expression due to DNA instability, such as translocation, amplification, deletion or point mutations. The purpose was first to perform functional analysis of a set of cancerous genes, and to discover or predict the other human genes that are hypothetically cancer-related. To achieve this goal, an approach consisting of three major components was proposed. Firstly, an automatic system has been developed to collect different data sets from the Internet, like human genes, proteins, and functional domains. Secondly, the functional domain compositions extracted from proteins were adopted for grouping the cancerous genes into a number of clusters by hierarchical clustering and association rule methods. The functional domains commonly existing in a cluster of genes represented the characteristic of that cluster. A second set of hypothetically cancerous genes predicted by single nucleotide polymorphism (SNP) was utilized for testing purpose. The experimental result indicated that a total accuracy of 81.45% was reached. It is anticipates that the proposed approach can be applied to other genetic diseases with minor modification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信