{"title":"Leveraging machine learning methods to estimate heterogeneous effects: father absence in China as an example","authors":"Ran Liu","doi":"10.1080/21620555.2021.1948828","DOIUrl":null,"url":null,"abstract":"Abstract Individuals differ in their personal and environmental characteristics, and the same treatment or condition may affect individuals in different ways or magnitudes. Heterogeneity in effects thus has important implications for academic research and policymaking. However, it is difficult to uncover and estimate heterogenous effects using conventional parametric models without making assumptions based on limited information, and results can be difficult to interpret when involving a large number of moderators. To address these limitations, this paper introduces three supervised machine learning methods for estimating heterogeneous treatment effects with experimental and observational data: causal forest, Bayesian Additive Regression Trees (BART), and an ensemble approach called X-learner. These methods are first applied to simulated datasets and then implemented using empirical education survey data from China to estimate heterogeneous effects of father absence on student cognitive ability across a series of individual and family characteristics.","PeriodicalId":51780,"journal":{"name":"Chinese Sociological Review","volume":"54 1","pages":"223 - 251"},"PeriodicalIF":2.2000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/21620555.2021.1948828","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Sociological Review","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/21620555.2021.1948828","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIOLOGY","Score":null,"Total":0}
引用次数: 1
Abstract
Abstract Individuals differ in their personal and environmental characteristics, and the same treatment or condition may affect individuals in different ways or magnitudes. Heterogeneity in effects thus has important implications for academic research and policymaking. However, it is difficult to uncover and estimate heterogenous effects using conventional parametric models without making assumptions based on limited information, and results can be difficult to interpret when involving a large number of moderators. To address these limitations, this paper introduces three supervised machine learning methods for estimating heterogeneous treatment effects with experimental and observational data: causal forest, Bayesian Additive Regression Trees (BART), and an ensemble approach called X-learner. These methods are first applied to simulated datasets and then implemented using empirical education survey data from China to estimate heterogeneous effects of father absence on student cognitive ability across a series of individual and family characteristics.