Joao Carlos Silva de Souza, Suzana Gomes Claudino, Rodolfo da Silva Simoes, Patricia Rufino Oliveira, K. M. Honório
{"title":"药物化学数据分析中标签不平衡和不确定性处理的最新进展","authors":"Joao Carlos Silva de Souza, Suzana Gomes Claudino, Rodolfo da Silva Simoes, Patricia Rufino Oliveira, K. M. Honório","doi":"10.1109/SAI.2016.7555985","DOIUrl":null,"url":null,"abstract":"The discovery of new drugs is a very important area of study in medicinal chemistry. Developing a drug is not an easy task, as much time and money are needed to undertake all steps required for the development and test of new drugs. Amid this context, chemoinformatics is the area that has the role of interfacing between chemistry and computing, assisting in the process of identifying potential new drugs, through machine learning techniques for classification. This article will present the difficulties of classification found in chemoinformatics and approach machine learning techniques that, applied in the context of chemoinformatics, assist in treating issues related to uncertainty in data labeling and unbalanced classes, as they are common problems when using data sets of a chemical nature.","PeriodicalId":219896,"journal":{"name":"2016 SAI Computing Conference (SAI)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Recent advances for handling imbalancement and uncertainty in labelling in medicinal chemistry data analysis\",\"authors\":\"Joao Carlos Silva de Souza, Suzana Gomes Claudino, Rodolfo da Silva Simoes, Patricia Rufino Oliveira, K. M. Honório\",\"doi\":\"10.1109/SAI.2016.7555985\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The discovery of new drugs is a very important area of study in medicinal chemistry. Developing a drug is not an easy task, as much time and money are needed to undertake all steps required for the development and test of new drugs. Amid this context, chemoinformatics is the area that has the role of interfacing between chemistry and computing, assisting in the process of identifying potential new drugs, through machine learning techniques for classification. This article will present the difficulties of classification found in chemoinformatics and approach machine learning techniques that, applied in the context of chemoinformatics, assist in treating issues related to uncertainty in data labeling and unbalanced classes, as they are common problems when using data sets of a chemical nature.\",\"PeriodicalId\":219896,\"journal\":{\"name\":\"2016 SAI Computing Conference (SAI)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 SAI Computing Conference (SAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAI.2016.7555985\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 SAI Computing Conference (SAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAI.2016.7555985","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recent advances for handling imbalancement and uncertainty in labelling in medicinal chemistry data analysis
The discovery of new drugs is a very important area of study in medicinal chemistry. Developing a drug is not an easy task, as much time and money are needed to undertake all steps required for the development and test of new drugs. Amid this context, chemoinformatics is the area that has the role of interfacing between chemistry and computing, assisting in the process of identifying potential new drugs, through machine learning techniques for classification. This article will present the difficulties of classification found in chemoinformatics and approach machine learning techniques that, applied in the context of chemoinformatics, assist in treating issues related to uncertainty in data labeling and unbalanced classes, as they are common problems when using data sets of a chemical nature.