{"title":"Ozone Day Prediction Using a Combination Method of Matrix Completion and Interactive Lasso","authors":"Jing Li, Chun-Xia Chen, Xue Jiang, Jinjia Wang","doi":"10.1109/IMCCC.2015.26","DOIUrl":null,"url":null,"abstract":"The missing data classification problem is one of the common problems in machine learning. Conventional method eliminates the samples with missing values. In this paper, matrix completion, as a new method is proposed for filling the missing data. And this method and two traditional methods, eliminating the samples with missing values and filling the missing data based on the sample similarity, are compared through experiments on the ozone classification data. In addition, the ozone day prediction depends on complex interaction information among data features, so the interactive lasso model is proposed for interaction feature selection and classification. The interactive lasso method is compared with the lasso and random forest (RF) methods. The final experimental results demonstrate our combination method. The classification accuracy of ozone day is approaching 100%.","PeriodicalId":438549,"journal":{"name":"2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fifth International Conference on Instrumentation and Measurement, Computer, Communication and Control (IMCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCCC.2015.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The missing data classification problem is one of the common problems in machine learning. Conventional method eliminates the samples with missing values. In this paper, matrix completion, as a new method is proposed for filling the missing data. And this method and two traditional methods, eliminating the samples with missing values and filling the missing data based on the sample similarity, are compared through experiments on the ozone classification data. In addition, the ozone day prediction depends on complex interaction information among data features, so the interactive lasso model is proposed for interaction feature selection and classification. The interactive lasso method is compared with the lasso and random forest (RF) methods. The final experimental results demonstrate our combination method. The classification accuracy of ozone day is approaching 100%.