S. Subramaniam, D. Tcheng, K. Hu, Harish Ragavan, L. Rendell
{"title":"Knowledge engineering for protein structure and motifs: design of a prototype system","authors":"S. Subramaniam, D. Tcheng, K. Hu, Harish Ragavan, L. Rendell","doi":"10.1109/SEKE.1992.227960","DOIUrl":null,"url":null,"abstract":"A knowledge base and learning system is designed to help biologists predict protein structure and function based on sequence information. The knowledge base contains diverse information about: (a) protein motif sequences and structures, (b) heuristics and programs for identifying protein features, and (c) molecular simulation programs. The learning system selects the most relevant information for a particular prediction task and optimally integrates the information to generate accurate and comprehensible hypotheses. Biologists define the objectives for learning such as accuracy and comprehensibility. To overcome the limitations of existing induction algorithms, techniques are developed for constructing new features based on the existing knowledge base. Optimization algorithms are used for determining the best combination of induction and feature construction strategies for a problem.<<ETX>>","PeriodicalId":191866,"journal":{"name":"Proceedings Fourth International Conference on Software Engineering and Knowledge Engineering","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Fourth International Conference on Software Engineering and Knowledge Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEKE.1992.227960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
A knowledge base and learning system is designed to help biologists predict protein structure and function based on sequence information. The knowledge base contains diverse information about: (a) protein motif sequences and structures, (b) heuristics and programs for identifying protein features, and (c) molecular simulation programs. The learning system selects the most relevant information for a particular prediction task and optimally integrates the information to generate accurate and comprehensible hypotheses. Biologists define the objectives for learning such as accuracy and comprehensibility. To overcome the limitations of existing induction algorithms, techniques are developed for constructing new features based on the existing knowledge base. Optimization algorithms are used for determining the best combination of induction and feature construction strategies for a problem.<>