{"title":"Optimal Matching with Matching Priority","authors":"M. Cannas, E. Sironi","doi":"10.3390/analytics3010009","DOIUrl":null,"url":null,"abstract":"Matching algorithms are commonly used to build comparable subsets (matchings) in observational studies. When a complete matching is not possible, some units must necessarily be excluded from the final matching. This may bias the final estimates comparing the two populations, and thus it is important to reduce the number of drops to avoid unsatisfactory results. Greedy matching algorithms may not reach the maximum matching size, thus dropping more units than necessary. Optimal matching algorithms do ensure a maximum matching size, but they implicitly assume that all units have the same matching priority. In this paper, we propose a matching strategy which is order optimal in the sense that it finds a maximum matching size which is consistent with a given matching priority. The strategy is based on an order-optimal matching algorithm originally proposed in connection with assignment problems by D. Gale. When a matching priority is given, the algorithm ensures that the discarded units have the lowest possible matching priority. We discuss the algorithm’s complexity and its relation with classic optimal matching. We illustrate its use with a problem in a case study concerning a comparison of female and male executives and a simulation.","PeriodicalId":512104,"journal":{"name":"Analytics","volume":"5 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/analytics3010009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Matching algorithms are commonly used to build comparable subsets (matchings) in observational studies. When a complete matching is not possible, some units must necessarily be excluded from the final matching. This may bias the final estimates comparing the two populations, and thus it is important to reduce the number of drops to avoid unsatisfactory results. Greedy matching algorithms may not reach the maximum matching size, thus dropping more units than necessary. Optimal matching algorithms do ensure a maximum matching size, but they implicitly assume that all units have the same matching priority. In this paper, we propose a matching strategy which is order optimal in the sense that it finds a maximum matching size which is consistent with a given matching priority. The strategy is based on an order-optimal matching algorithm originally proposed in connection with assignment problems by D. Gale. When a matching priority is given, the algorithm ensures that the discarded units have the lowest possible matching priority. We discuss the algorithm’s complexity and its relation with classic optimal matching. We illustrate its use with a problem in a case study concerning a comparison of female and male executives and a simulation.
在观察性研究中,匹配算法通常用于建立可比子集(匹配)。当不可能进行完全匹配时,一些单位必须从最终匹配中剔除。这可能会使比较两个人群的最终估计值出现偏差,因此必须减少剔除的数量,以避免出现令人不满意的结果。贪婪匹配算法可能达不到最大匹配规模,因此会放弃比必要更多的单位。最优匹配算法确实能确保最大匹配规模,但它们隐含的假设是,所有单元具有相同的匹配优先级。在本文中,我们提出了一种阶次最优匹配策略,它能找到与给定匹配优先级一致的最大匹配规模。该策略基于 D. Gale 最初针对分配问题提出的阶次最优匹配算法。当给出匹配优先级时,该算法确保被丢弃的单元具有尽可能低的匹配优先级。我们讨论了该算法的复杂性及其与经典最优匹配的关系。我们用一个案例研究中的问题来说明该算法的使用,该案例研究涉及女性和男性高管的比较和模拟。