Muting Ma , Mesut Yavuz , Matthew Hudnall , Qin Wang
{"title":"C2SLM: A Correlation-based Clustering-assisted Sparse Learning Model for Electric Vehicle Market Demand Forecasting","authors":"Muting Ma , Mesut Yavuz , Matthew Hudnall , Qin Wang","doi":"10.1016/j.patcog.2025.111984","DOIUrl":null,"url":null,"abstract":"<div><div>Battery electric vehicle (BEV) market demand forecasting is challenged by high-dimensional feature selection from a regression model. The Least Absolute Shrinkage and Selection Operator (Lasso) is a sparse learning model used for this purpose. However, due to the correlation structure of data, Lasso can become unstable in its feature selection. This instability is further amplified in econometric panel data because of unobserved heterogeneity. Motivated by this challenge, we propose a correlation-based clustering-assisted sparse learning model (C2SLM) based on Lasso for a BEV market demand forecasting problem in the U.S. state of Alabama. Three stages structure the C2SLM: (i) correlation-polarized feature generation, (ii) panel-based feature clustering, and (iii) clustering-assisted Lasso (caLasso). We specifically aggregate the data to capture heterogeneous market factors for the panel. We test the C2SLM against benchmark models with or without panel consideration. Our proposed model outperforms others and sheds significant insights into BEV policymaking. Additionally, our model performs consistently robustly on Gross Domestic Product (GDP) and S&P 500 stock trading market datasets.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"170 ","pages":"Article 111984"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325006442","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Battery electric vehicle (BEV) market demand forecasting is challenged by high-dimensional feature selection from a regression model. The Least Absolute Shrinkage and Selection Operator (Lasso) is a sparse learning model used for this purpose. However, due to the correlation structure of data, Lasso can become unstable in its feature selection. This instability is further amplified in econometric panel data because of unobserved heterogeneity. Motivated by this challenge, we propose a correlation-based clustering-assisted sparse learning model (C2SLM) based on Lasso for a BEV market demand forecasting problem in the U.S. state of Alabama. Three stages structure the C2SLM: (i) correlation-polarized feature generation, (ii) panel-based feature clustering, and (iii) clustering-assisted Lasso (caLasso). We specifically aggregate the data to capture heterogeneous market factors for the panel. We test the C2SLM against benchmark models with or without panel consideration. Our proposed model outperforms others and sheds significant insights into BEV policymaking. Additionally, our model performs consistently robustly on Gross Domestic Product (GDP) and S&P 500 stock trading market datasets.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.