Sixth International Conference on Machine Learning and Applications (ICMLA 2007)最新文献

筛选
英文 中文
Constructive neural network ensemble for regression tasks in high dimensional spaces 高维空间回归任务的构造神经网络集成
A. Schmitz, H. Hefazi
{"title":"Constructive neural network ensemble for regression tasks in high dimensional spaces","authors":"A. Schmitz, H. Hefazi","doi":"10.1109/ICMLA.2007.82","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.82","url":null,"abstract":"This research focuses on the development of constructive neural networks (NN)for regression tasks in high dimensional spaces. A constructive algorithm which is referred to as modified cascade correlation (MCC) has been developed. MCC has several improvements relative to the original algorithm. They include stopping the training when the minimum squared error on a small unseen dataset is reached. This method is known to improve the generalization ability of the NN, i.e. its ability to accurately predict cases not in the training set. The subject of this paper is to investigate committee networks trained with the MCC. A mathematical function is used to study the generalization properties of the network for input space dimension ranging from five to thirty. The study shows that \"ensemble averaged\" network committees greatly improve the generalization performance of the MCC algorithm. Areas of further research are outlined and include investigating other types of committees.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128855141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Alignment of Multiple Proteins with an Ensemble of Hidden Markov Models 基于隐马尔可夫模型的多重蛋白质比对
Jia Song, Chunmei Liu, Yinglei Song, Junfeng Qu
{"title":"Alignment of Multiple Proteins with an Ensemble of Hidden Markov Models","authors":"Jia Song, Chunmei Liu, Yinglei Song, Junfeng Qu","doi":"10.1109/ICMLA.2007.90","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.90","url":null,"abstract":"The alignment of multiple protein sequences is a problem of fundamental importance in bioinformatics. In general, the optimal alignment can be obtained through the optimization of an objective function. However, such an optimization task is often computationally intractible, most of the existing alignment tools thus use statistical or machine learning based methods to avoid direct optimizations. In this paper, we develop a new method that can progressively construct and update a set of alignments by adding sequences in certain order to each of the existing alignments. In particular, each of the existing alignments is modeled with a profile hidden markov model (HMM) and an added sequence is aligned to each of these profile HMMs. The profile HMMs in the set are then updated based on the alignments with leading alignment scores. We performed experiments on BaliBASE benchmarks to compare the performance of this new approach with that of other alignment tools. Our experiments showed that, by introducing an integer parameter that controls the number of profile HMMs in the set, we are able to efficiently explore the alignment space and significantly improve the alignment accuracy on sequences with low similarity.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"05 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128983236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Scalable optimal linear representation for face and object recognition 面向人脸和物体识别的可扩展最佳线性表示
Yiming Wu, Xiuwen Liu, W. Mio
{"title":"Scalable optimal linear representation for face and object recognition","authors":"Yiming Wu, Xiuwen Liu, W. Mio","doi":"10.1109/ICMLA.2007.110","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.110","url":null,"abstract":"Optimal component analysis (OCA) is a linear method for feature extraction and dimension reduction. It has been widely used in many applications such as face and object recognitions. The optimal basis of OCA is obtained through solving an optimization problem on a Grassmann manifold. However, one limitation of OCA is the computational cost becoming heavy when the number of training data is large, which prevents OCA from efficiently applying in many real applications. In this paper, a scalable OCA (S-OCA) that uses a two-stage strategy is developed to bridge this gap. In the first stage, we cluster the training data using K-means algorithm and the dimension of data is reduced into a low dimensional space. In the second stage, OCA search is performed in the reduced space and the gradient is updated using an numerical approximation. In the process of OCA gradient updating, instead of choosing the entire training data, S-OCA randomly chooses a small subset of the training images in each class to update the gradient. This achieves stochastic gradient updating and at the same time reduces the searching time of OCA in orders of magnitude. Experimental results on face and object datasets show efficiency of the S-OCA method, in term of both classification accuracy and computational complexity.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124321749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Model evaluation for prognostics: estimating cost saving for the end users 预测的模型评估:估计最终用户节省的成本
Chunsheng Yang, S. Létourneau
{"title":"Model evaluation for prognostics: estimating cost saving for the end users","authors":"Chunsheng Yang, S. Létourneau","doi":"10.1109/ICMLA.2007.59","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.59","url":null,"abstract":"Unexpected failures of complex equipment such as trains or aircraft introduce superfluous costs, disrupt operation, have an effect on consumer's satisfaction, and potentially decrease safety in practice. One of the objectives of prognostics and health management (PHM) systems is to help reduce the number of unexpected failures by continuously monitoring the components of interest and predicting their failures sufficiently in advance to allow for proper planning. In other words, PHM systems may help turn unexpected failures into expected ones. Recent research has demonstrated the usefulness of data mining to help build prognostic models for PHM but also has identified the need for new model evaluation methods that take into account the specificities of prognostic applications. This paper investigates this problem. First, it reviews classical and recent methods to evaluate data mining models and it explains their deficiencies with respect to prognostic applications. The paper then proposes a novel approach that overcomes these deficiencies. This approach integrates the various costs and benefits involved in prognostics to quantify the cost saving expected from a given prognostic model. From the end user's perspective, the formula is practical as it is easy to understand and requires realistic inputs. The paper illustrates the usefulness of the methods through a real-world case study involving data-mining prognostic models and realistic costs/benefits information. The results show the feasibility of the approach and its applicability to various prognostic applications.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"211 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115190579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
An OCR-independent character segmentation using shortest-path in grayscale document images 灰度文档图像中使用最短路径的字符分割
Jia Tse, Christopher Jones, Dean Curtis, E. Yfantis
{"title":"An OCR-independent character segmentation using shortest-path in grayscale document images","authors":"Jia Tse, Christopher Jones, Dean Curtis, E. Yfantis","doi":"10.1109/ICMLA.2007.21","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.21","url":null,"abstract":"An optical character recognition (OCR) system with a high recognition rate is challenging to develop. One of the major contributors to OCR errors is smeared characters. Several factors lead to the smearing of characters such as bad scanning quality and a poor binarization technique. Typical approaches to character segmentation falls into three major categories: image-based, recognition-based, and holistic-based. Among these approaches, the segmentation path can be linear or non-linear. Our paper proposes a non-linear approach to segment characters on grayscale document images. Our method first determines whether characters are smeared together using general character features. The correct segmentation path is found using a shortest path approach. We achieved a segmentation accuracy of 95% over a set of about 2,000 smeared characters.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"231 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115883548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Evolving kernel functions for SVMs by genetic programming 基于遗传规划的支持向量机核函数演化
L. Dioşan, A. Rogozan, J. Pécuchet
{"title":"Evolving kernel functions for SVMs by genetic programming","authors":"L. Dioşan, A. Rogozan, J. Pécuchet","doi":"10.1109/ICMLA.2007.70","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.70","url":null,"abstract":"hybrid model for evolving support vector machine (SVM) kernel functions is developed in this paper. The kernel expression is considered as a parameter of the SVM algorithm and the current approach tries to find the best expression for this SVM parameter. The model is a hybrid technique that combines a genetic programming (GP) algorithm and a support vector machine (SVM) algorithm. Each GP chromosome is a tree encoding the mathematical expression for the kernel function. The evolved kernel is compared to several human-designed kernels and to a previous genetic kernel on several datasets. Numerical experiments show that the SVM embedding our evolved kernel performs statistically better than standard kernels, but also than previous genetic kernel for all considered classification problems.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131528367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Combining multi-distributed mixture models and bayesian networks for semi-supervised learning 结合多分布混合模型和贝叶斯网络进行半监督学习
Manuel Stritt, L. Schmidt-Thieme, G. Poeppel
{"title":"Combining multi-distributed mixture models and bayesian networks for semi-supervised learning","authors":"Manuel Stritt, L. Schmidt-Thieme, G. Poeppel","doi":"10.1109/ICMLA.2007.31","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.31","url":null,"abstract":"In many real world scenarios, mixture models have successfully been used for analyzing features in data ([11, 13, 21]). Usually, multivariate Gaussian distributions for continuous data ([2, 8, 4]) or Bayesian networks for nominal data ([15, 16]) are applied. In this paper, we combine both approaches in a family of Bayesian models for continuous data that are able to handle univariate as well as multivariate nodes, different types of distributions, e.g. Gaussian as well as Poisson distributed nodes, and dependencies between nodes. The models we introduce can be used for unsupervised, semi-supervised as well as for fully supervised learning tasks. We evaluate our models empirically on generated synthetic data and on public datasets thereby showing that they outperform classifiers such as SVMs and logistic regression on mixture data.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130938213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Discover the power of social and hidden curriculum to decision making: experiments with enron email and movie newsgroups 发现社会和隐藏课程对决策的力量:安然电子邮件和电影新闻组的实验
Hung-Ching Chen, M. Goldberg, M. Magdon-Ismail, W. Wallace
{"title":"Discover the power of social and hidden curriculum to decision making: experiments with enron email and movie newsgroups","authors":"Hung-Ching Chen, M. Goldberg, M. Magdon-Ismail, W. Wallace","doi":"10.1109/ICMLA.2007.87","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.87","url":null,"abstract":"The power of social values that helps to surreptitiously shape or formulate our behavior patterns is not only inevitable, but also influential as the directions of our decision making can never seem to escape the impact of this hidden agent. Therefore, the search of such power agent can be validated through a machine learning approach that enables us to discover the agent dynamics in which drives the evolution of the social groups in a community. By doing so, we set up the problem by introducing a parameterized probabilistic model for the agent dynamics: the acts of an agent are determined by micro-laws with unknown parameters. Our approach is to identify the appropriate parameters in the model. To solve the problem, we develop heuristic expectation-maximization style algorithms for determining the micro-laws of a community based on either observed communication links between actors, or the observed evolution of social groups. We present the learning results from the synthetic data as well as the findings on real communities, e.g., Enron email and movie newsgroups.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128734859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enhanced recursive feature elimination 增强递归特征消除
Xue-wen Chen, Jong Cheol Jeong
{"title":"Enhanced recursive feature elimination","authors":"Xue-wen Chen, Jong Cheol Jeong","doi":"10.1109/ICMLA.2007.35","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.35","url":null,"abstract":"For classification with small training samples and high dimensionality, feature selection plays an important role in avoiding overfitting problems and improving classification performance. One of the commonly used feature selection methods for small samples problems is recursive feature elimination (RFE) method. RFE method utilizes the generalization capability embedded in support vector machines and is thus suitable for small samples problems. Despite its good performance, RFE tends to discard \"weak\" features, which may provide a significant improvement of performance when combined with other features. In this paper, we propose an enhanced recursive feature elimination (EnRFE) method for feature selection in small training sample classification. Our experimental results show that the proposed method outperforms the original RFE in terms of classification accuracy on various datasets.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127472156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 136
Tracking recurrent concept drift in streaming data using ensemble classifiers 使用集成分类器跟踪流数据中的循环概念漂移
S. Ramamurthy, R. Bhatnagar
{"title":"Tracking recurrent concept drift in streaming data using ensemble classifiers","authors":"S. Ramamurthy, R. Bhatnagar","doi":"10.1109/ICMLA.2007.109","DOIUrl":"https://doi.org/10.1109/ICMLA.2007.109","url":null,"abstract":"Streaming data may consist of multiple drifting concepts each having its own underlying data distribution. We present an ensemble learning based approach to handle the data streams having multiple underlying modes. We build a global set of classifiers from sequential data chunks; ensembles are then selected from this global set of classifiers, and new classifiers created if needed, to represent the current concept in the stream. The system is capable of performing any-time classification and to detect concept drift in the stream. In streaming data historic concepts are likely to reappear so we don't delete any of the historic classifiers. Instead, we judiciously select only pertinent classifiers from the global set while forming the ensemble set for a classification task.","PeriodicalId":448863,"journal":{"name":"Sixth International Conference on Machine Learning and Applications (ICMLA 2007)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114784522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 91
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信