{"title":"随机对照试验对抗机器:机器学习预测方法能否恢复实验治疗效果?","authors":"Brian C. Prest, Casey J. Wichman, K. Palmer","doi":"10.1086/724518","DOIUrl":null,"url":null,"abstract":"We investigate how successfully machine-learning (ML) prediction algorithms can be used to estimate causal treatment effects in electricity demand applications with nonexperimental data. We use three prediction algorithms—XGBoost, random forests, and LASSO—to generate counterfactuals using observational data. Using those counterfactuals, we estimate nonexperimental treatment effects and compare them to experimental treatment effects from a randomized experiment for electricity customers who faced critical-peak pricing and information treatments. Our results show that nonexperimental treatment effects based on each algorithm replicate the true treatment effects, even when only using data from treated households. Additionally, when using both treatment households and nonexperimental comparison households, standard two-way fixed effects regressions replicate the experimental benchmark, suggesting little benefit from ML approaches over standard program evaluation methods in that setting.","PeriodicalId":47114,"journal":{"name":"Journal of the Association of Environmental and Resource Economists","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"RCTs against the Machine: Can Machine Learning Prediction Methods Recover Experimental Treatment Effects?\",\"authors\":\"Brian C. Prest, Casey J. Wichman, K. Palmer\",\"doi\":\"10.1086/724518\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We investigate how successfully machine-learning (ML) prediction algorithms can be used to estimate causal treatment effects in electricity demand applications with nonexperimental data. We use three prediction algorithms—XGBoost, random forests, and LASSO—to generate counterfactuals using observational data. Using those counterfactuals, we estimate nonexperimental treatment effects and compare them to experimental treatment effects from a randomized experiment for electricity customers who faced critical-peak pricing and information treatments. Our results show that nonexperimental treatment effects based on each algorithm replicate the true treatment effects, even when only using data from treated households. Additionally, when using both treatment households and nonexperimental comparison households, standard two-way fixed effects regressions replicate the experimental benchmark, suggesting little benefit from ML approaches over standard program evaluation methods in that setting.\",\"PeriodicalId\":47114,\"journal\":{\"name\":\"Journal of the Association of Environmental and Resource Economists\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2023-02-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Association of Environmental and Resource Economists\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1086/724518\",\"RegionNum\":3,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Association of Environmental and Resource Economists","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1086/724518","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
RCTs against the Machine: Can Machine Learning Prediction Methods Recover Experimental Treatment Effects?
We investigate how successfully machine-learning (ML) prediction algorithms can be used to estimate causal treatment effects in electricity demand applications with nonexperimental data. We use three prediction algorithms—XGBoost, random forests, and LASSO—to generate counterfactuals using observational data. Using those counterfactuals, we estimate nonexperimental treatment effects and compare them to experimental treatment effects from a randomized experiment for electricity customers who faced critical-peak pricing and information treatments. Our results show that nonexperimental treatment effects based on each algorithm replicate the true treatment effects, even when only using data from treated households. Additionally, when using both treatment households and nonexperimental comparison households, standard two-way fixed effects regressions replicate the experimental benchmark, suggesting little benefit from ML approaches over standard program evaluation methods in that setting.