{"title":"Feature Selection on Dynamometer Data for Reliability Analysis","authors":"Janell Duhaney, T. Khoshgoftaar, J. Sloan","doi":"10.1109/ICTAI.2011.173","DOIUrl":null,"url":null,"abstract":"An ocean turbine extracts the kinetic energy from ocean currents to generate electricity. Vibration signals from the turbine hold a wealth of information regarding its state, and detecting changes in these signals is crucial to the timely detection of faults. Wavelet transforms provide a means of analyzing these complex signals and extracting features which are representative of the signal. Feature selection techniques are needed once these wavelet features are extracted to eliminate redundant or useless features before the data is presented to a machine learning algorithm for pattern recognition and classification. This reduces the quantity of data to be processed and can often even increase the machine learner's ability to detect the current state of the machine. This paper empirically compares eight feature selection algorithms on wavelet transformed vibration data originating from an onshore test platform for an ocean turbine. A case study shows the classification performances of seven machine learners when trained on the datasets with varying numbers of features selected from the original set of all features. Our results highlight that by choosing an appropriate feature selection technique and applying it to selecting just the 3 most important features (3.33% of the original feature set), some classifiers such as the decision tree and random forest can correctly differentiate between faulty and nonfaulty states almost 100% of the time. These results also show the performance differences between different feature selection algorithms and classifier combinations.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2011.173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
An ocean turbine extracts the kinetic energy from ocean currents to generate electricity. Vibration signals from the turbine hold a wealth of information regarding its state, and detecting changes in these signals is crucial to the timely detection of faults. Wavelet transforms provide a means of analyzing these complex signals and extracting features which are representative of the signal. Feature selection techniques are needed once these wavelet features are extracted to eliminate redundant or useless features before the data is presented to a machine learning algorithm for pattern recognition and classification. This reduces the quantity of data to be processed and can often even increase the machine learner's ability to detect the current state of the machine. This paper empirically compares eight feature selection algorithms on wavelet transformed vibration data originating from an onshore test platform for an ocean turbine. A case study shows the classification performances of seven machine learners when trained on the datasets with varying numbers of features selected from the original set of all features. Our results highlight that by choosing an appropriate feature selection technique and applying it to selecting just the 3 most important features (3.33% of the original feature set), some classifiers such as the decision tree and random forest can correctly differentiate between faulty and nonfaulty states almost 100% of the time. These results also show the performance differences between different feature selection algorithms and classifier combinations.