Breeding policies in evolutionary approximation of optimal subspace

Proceedings of the Second International Conference on Intelligent Processing and Manufacturing of Materials. IPMM'99 (Cat. No.99EX296) Pub Date : 1999-07-01 DOI:10.1109/IPMM.1999.792491

H.M. Huang, P.L. Leung

{"title":"Breeding policies in evolutionary approximation of optimal subspace","authors":"H.M. Huang, P.L. Leung","doi":"10.1109/IPMM.1999.792491","DOIUrl":null,"url":null,"abstract":"In very high dimension variable space (e.g. 30 or more), huge computations evenly hinder investigators to conduct any direct meaningful analysis. A traditional trick is firstly to conduct single variable analysis, then combine several top most single-fittest variables to approximate the optimal subspace. In this investigation, an evolutionary method for optimal subspace approximation is proposed. The breeding policies of this evolutionary approximation, its scalability and generalization have been intensively investigated. The studied object is a 30-D variable space which contains 6000 artificial individuals. In this data, except for 3 variables containing two donut-type data distributions, each with 3000 individuals, the remaining 27 variables only contain quasi-random data with the same value range as the donut data distributions. The donut distribution consist of two toroidal distributions (classes) which are interlocked like links in a chain. The cross-section of each distribution is a Gaussian function distributed with standard deviation delta. Even the Donut problem which possesses a variety of pathological traits can invalidate many non-complex analyses for classification. The goal of this investigation was to find the 3 donut variables within the optimal subspace of 30-D variable space in which most quasi-random variables emerge as noise variables. In order to reach this goal, various breeding policies were implemented and compared. Although no perfect solution for the approximation was found, various breeding policies and their impact on decreasing the error were studied. These were found to be relatively usable for reference and might be improved when used in a practical application.","PeriodicalId":194215,"journal":{"name":"Proceedings of the Second International Conference on Intelligent Processing and Manufacturing of Materials. IPMM'99 (Cat. No.99EX296)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Second International Conference on Intelligent Processing and Manufacturing of Materials. IPMM'99 (Cat. No.99EX296)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPMM.1999.792491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In very high dimension variable space (e.g. 30 or more), huge computations evenly hinder investigators to conduct any direct meaningful analysis. A traditional trick is firstly to conduct single variable analysis, then combine several top most single-fittest variables to approximate the optimal subspace. In this investigation, an evolutionary method for optimal subspace approximation is proposed. The breeding policies of this evolutionary approximation, its scalability and generalization have been intensively investigated. The studied object is a 30-D variable space which contains 6000 artificial individuals. In this data, except for 3 variables containing two donut-type data distributions, each with 3000 individuals, the remaining 27 variables only contain quasi-random data with the same value range as the donut data distributions. The donut distribution consist of two toroidal distributions (classes) which are interlocked like links in a chain. The cross-section of each distribution is a Gaussian function distributed with standard deviation delta. Even the Donut problem which possesses a variety of pathological traits can invalidate many non-complex analyses for classification. The goal of this investigation was to find the 3 donut variables within the optimal subspace of 30-D variable space in which most quasi-random variables emerge as noise variables. In order to reach this goal, various breeding policies were implemented and compared. Although no perfect solution for the approximation was found, various breeding policies and their impact on decreasing the error were studied. These were found to be relatively usable for reference and might be improved when used in a practical application.

查看原文本刊更多论文

最优子空间的进化逼近育种策略

在非常高维的变量空间中(例如30维或更多)，巨大的计算会阻碍研究人员进行任何直接有意义的分析。传统的方法是先进行单变量分析，然后结合几个最顶级的单拟合变量来近似最优子空间。在此研究中，提出了一种最优子空间逼近的进化方法。这种进化近似的育种策略、可扩展性和泛化性已经得到了深入的研究。研究对象是一个包含6000个人工个体的30维可变空间。在该数据中，除了3个变量包含两个甜甜圈型数据分布，每个有3000个个体外，其余27个变量只包含与甜甜圈数据分布相同取值范围的准随机数据。甜甜圈分布由两个环面分布(类)组成，这两个环面分布(类)像链中的链环一样互锁。每个分布的截面都是一个高斯函数，分布有标准差delta。即使是具有多种病理特征的Donut问题也会使许多非复杂的分类分析失效。本研究的目标是在30-D变量空间的最优子空间中找到3个甜甜圈变量，其中大多数准随机变量作为噪声变量出现。为了达到这一目标，实施了各种育种政策并进行了比较。虽然没有找到近似的完美解，但研究了各种育种策略及其对减小误差的影响。结果表明，这些方法具有一定的参考价值，并可在实际应用中加以改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Second International Conference on Intelligent Processing and Manufacturing of Materials. IPMM'99 (Cat. No.99EX296)

自引率

0.00%

发文量