{"title":"Computed Data-Geometry Based Supervised and Semi-supervised Learning in High Dimensional Data","authors":"Elizabeth P. Chou, F. Hsieh, J. Capitanio","doi":"10.1109/ICMLA.2013.56","DOIUrl":null,"url":null,"abstract":"In most high dimensional settings, constructing supervised or semi-supervised learning rules has been facing various critically difficult issues, such as no visualizing tools for empirical guidance, no valid distance measures, and no suitable variable selection methods for proper discrimination among data nodes. We attempt to alleviate all of these difficulties by computing data-geometry via a recently developed computational algorithm called Data Cloud geometry (DCG). The computed geometry is represented by a hierarchy of clusters providing a base for developing a divide-and-conquer version of a learning approach. We demonstrate the advantages of taking posteriori geometric information into learning rules construction by evaluating its performance with many illustrated examples and several real data sets compared to the performance resulting from the majority of commonly used techniques.","PeriodicalId":168867,"journal":{"name":"2013 12th International Conference on Machine Learning and Applications","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 12th International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2013.56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In most high dimensional settings, constructing supervised or semi-supervised learning rules has been facing various critically difficult issues, such as no visualizing tools for empirical guidance, no valid distance measures, and no suitable variable selection methods for proper discrimination among data nodes. We attempt to alleviate all of these difficulties by computing data-geometry via a recently developed computational algorithm called Data Cloud geometry (DCG). The computed geometry is represented by a hierarchy of clusters providing a base for developing a divide-and-conquer version of a learning approach. We demonstrate the advantages of taking posteriori geometric information into learning rules construction by evaluating its performance with many illustrated examples and several real data sets compared to the performance resulting from the majority of commonly used techniques.