Sixth International Conference on Machine Learning and Applications (ICMLA 2007)最新文献_第5页

Clustering Categorical Data Based on Maximal Frequent Itemsets 基于最大频繁项集的分类数据聚类

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.11

Dadong Yu, Dongbo Liu, Rui Luo, Jianxin Wang

引用次数: 4

Using evolutionary sampling to mine imbalanced data 利用进化抽样挖掘不平衡数据

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.73

D. J. Drown, T. Khoshgoftaar, R. Narayanan

引用次数: 22

Modifying kernels using label information improves SVM classification performance 使用标签信息修改核可以提高SVM的分类性能

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.84

Martin Renqiang Min, A. Bonner, Zhaolei Zhang

引用次数: 34

Text Mining and Ontology Applications in Bioinformatics and GIS 文本挖掘和本体在生物信息学和GIS中的应用

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.122

S. Navathe

{"title":"Text Mining and Ontology Applications in Bioinformatics and GIS","authors":"S. Navathe","doi":"10.1109/ICMLA.2007.122","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.122","url":null,"abstract":"Informatics and computers have not yet become as pervasive in chemistry as they have in physics and biology. Drawing analogies from bioinformatics, key ingredients for progress in chemoinformatics are the availability of large, annotated databases of compounds and reactions, data structures and algorithms to efficiently search these databases, and computational methods to predict the physical, chemical, and biological properties of new compounds and reactions. We will describe the development of: (1) a large public database of compounds and reactions (ChemDB); (2) machine learning kernel methods to predict molecular properties; and (3) the applications of these methods to drug screening/design problems and the identification of new drug leads against a major disease. More broadly, we will discuss some of the challenges and opportunities for computer science, AI, and machine learning in chemistry. Abstract: This talk will present some general problem areas and solutions in two fields of applications of machine learning: bioinformatics and Geographic Information Systems (GIS). The bioinformatics arena is very broad and encompasses many problems such as gene finding in sequences, molecular pathway construction, protein structure prediction etc. We will outline our research on finding important keywords from the biomedical literature by statistical analysis and some natural language analysis. We have also incorporated ontologies such as UMLS (Unified Medical Language System) to determine relationships among biological and medical concepts. The primary goal of this work has been to interpret the long lists of genes that are derived in microarray experiments used to understand and treat diseases. We are able to cluster genes based on their functional similarity. We have also used lists of keywords as feature vectors to drive SVM models for a classification of literature. In particular, we have dealt with the classification of relevant literature for Public health at the CDC (Centers of Disease Control). We will briefly explain the discovery of biomarkers for cancer using a technique that combines SVM and gene ontology.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127788365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 34

Sparsity regularization path for semi-supervised SVM 半监督支持向量机的稀疏正则化路径

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.81

G. Gasso, Karina Zapien Arreola, S. Canu

引用次数: 6

Biomarker Identification by Knowledge-Driven Multi-Level ICA and Motif Analysis 基于知识驱动的多层次ICA和Motif分析的生物标志物鉴定

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.58

Li Chen, J. Xuan, Chen Wang, Y. Wang, I. Shih, Tian-Li Wang, Zhen Zhang, R. Clarke, E. Hoffman

引用次数: 9

An optimization method for selecting parameters in support vector machines 支持向量机参数选择的优化方法

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.38

Yulin Dong, Manghui Tu, Zhonghang Xia, Guangming Xing

引用次数: 12

An incremental viterbi algorithm 一种增量viterbi算法

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.49

J. Bobbin

引用次数: 3

Memory-based context-sensitive spelling correction at web scale 基于记忆的上下文敏感拼写纠正在网络规模

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.50

Andrew Carlson, Ian Fette

引用次数: 75

Learning with limited minority class data 使用有限的少数族裔课堂数据进行学习

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.76

T. Khoshgoftaar, Chris Seiffert, J. V. Hulse, Amri Napolitano, A. Folleco

引用次数: 103