{"title":"采用集合机器学习的线性模型中的特征重要性:法玛和弗伦奇五因子模型研究","authors":"Tae Yeon Kwon","doi":"10.1016/j.frl.2024.106406","DOIUrl":null,"url":null,"abstract":"<div><div>This study explores key considerations for interpreting feature influence and importance in Machine Learning (ML) for financial models that commonly assume linearity. Simulations demonstrate that ML techniques, including Random Forest, XGBoost, and CatBoost, may produce misleading feature importance ranks when the underlying model is linear. We empirically examine the Fama–French five-factor model using U.S. monthly data from July 1964 to June 2024. While the most important factors are consistently identified, the ranks of moderately important factors vary depending on the estimation method. These results highlight the need for a critical application of ML in financial modeling when the purpose is interpretability.</div></div>","PeriodicalId":12167,"journal":{"name":"Finance Research Letters","volume":"71 ","pages":"Article 106406"},"PeriodicalIF":6.9000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature importance in linear models with ensemble machine learning: A study of the Fama and French five-factor model\",\"authors\":\"Tae Yeon Kwon\",\"doi\":\"10.1016/j.frl.2024.106406\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study explores key considerations for interpreting feature influence and importance in Machine Learning (ML) for financial models that commonly assume linearity. Simulations demonstrate that ML techniques, including Random Forest, XGBoost, and CatBoost, may produce misleading feature importance ranks when the underlying model is linear. We empirically examine the Fama–French five-factor model using U.S. monthly data from July 1964 to June 2024. While the most important factors are consistently identified, the ranks of moderately important factors vary depending on the estimation method. These results highlight the need for a critical application of ML in financial modeling when the purpose is interpretability.</div></div>\",\"PeriodicalId\":12167,\"journal\":{\"name\":\"Finance Research Letters\",\"volume\":\"71 \",\"pages\":\"Article 106406\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2024-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Finance Research Letters\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1544612324014351\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Finance Research Letters","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1544612324014351","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
Feature importance in linear models with ensemble machine learning: A study of the Fama and French five-factor model
This study explores key considerations for interpreting feature influence and importance in Machine Learning (ML) for financial models that commonly assume linearity. Simulations demonstrate that ML techniques, including Random Forest, XGBoost, and CatBoost, may produce misleading feature importance ranks when the underlying model is linear. We empirically examine the Fama–French five-factor model using U.S. monthly data from July 1964 to June 2024. While the most important factors are consistently identified, the ranks of moderately important factors vary depending on the estimation method. These results highlight the need for a critical application of ML in financial modeling when the purpose is interpretability.
期刊介绍:
Finance Research Letters welcomes submissions across all areas of finance, aiming for rapid publication of significant new findings. The journal particularly encourages papers that provide insight into the replicability of established results, examine the cross-national applicability of previous findings, challenge existing methodologies, or demonstrate methodological contingencies.
Papers are invited in the following areas:
Actuarial studies
Alternative investments
Asset Pricing
Bankruptcy and liquidation
Banks and other Depository Institutions
Behavioral and experimental finance
Bibliometric and Scientometric studies of finance
Capital budgeting and corporate investment
Capital markets and accounting
Capital structure and payout policy
Commodities
Contagion, crises and interdependence
Corporate governance
Credit and fixed income markets and instruments
Derivatives
Emerging markets
Energy Finance and Energy Markets
Financial Econometrics
Financial History
Financial intermediation and money markets
Financial markets and marketplaces
Financial Mathematics and Econophysics
Financial Regulation and Law
Forecasting
Frontier market studies
International Finance
Market efficiency, event studies
Mergers, acquisitions and the market for corporate control
Micro Finance Institutions
Microstructure
Non-bank Financial Institutions
Personal Finance
Portfolio choice and investing
Real estate finance and investing
Risk
SME, Family and Entrepreneurial Finance