Comparison of Nearest Neighbor and Caliper Algorithms in Outcome Propensity Score Matching to Study the Relationship between Type 2 Diabetes and Coronary Artery Disease

Q4 Medicine

Journal of Biostatistics and Epidemiology Pub Date : 2021-10-03 DOI:10.18502/jbe.v7i3.7297

Sara Sabbaghian Tousi, H. Tabesh, A. Saki, A. Tagipour, M. Tajfard

{"title":"Comparison of Nearest Neighbor and Caliper Algorithms in Outcome Propensity Score Matching to Study the Relationship between Type 2 Diabetes and Coronary Artery Disease","authors":"Sara Sabbaghian Tousi, H. Tabesh, A. Saki, A. Tagipour, M. Tajfard","doi":"10.18502/jbe.v7i3.7297","DOIUrl":null,"url":null,"abstract":"Introduction: Propensity score matching (PSM) is a method to reduce the impact of essential and confounders. When the number of confounders is high, there may be a problem of matching, in which, finding matched pairs for the case group is difficult, or impossible. The propensity score (PS) minimizes the effect of the confounders, and it is reduced to one dimension. There are various algorithms in the field of PSM. This study aimed to compared the nearest neighbor and caliper algorithms. \nMethods: Data obtained in this study were from patients undergoing angiography at Ghaem Hospital in Mashhad, between 2011-12. The study was a retrospective case-control using PSM. In total, 604 patients were included in the case and control groups. A logistic regression model was used to calculate the propensity score and adjust the variables, such as age, gender, Body Mass Index (BMI), systolic blood pressure, smoking status, and triglyceride. Then, the Odds Ratios (ORs) with 95% Confidence Intervals (CIs) for the raw data and two matching algorithms were determined to examine the relationship between type 2 diabetes and coronary artery disease (CAD). \nResults: Propensity score in the nearest neighbor and caliper algorithms matched the total number of 604 samples, 200 and 178 pairs, respectively. All variables were significantly different between the two groups before matching (P<0.05). The gender was significantly different between the two groups after matching using the nearest neighbor algorithm (P=0.002). No variables created a significant difference between the two groups after matching with the caliper algorithm. \nConclusion: Bias reduction in the caliper algorithm was greater than for the nearest neighbor algorithm for all variables except the triglyceride variable.","PeriodicalId":34310,"journal":{"name":"Journal of Biostatistics and Epidemiology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biostatistics and Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18502/jbe.v7i3.7297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 1

Abstract

Introduction: Propensity score matching (PSM) is a method to reduce the impact of essential and confounders. When the number of confounders is high, there may be a problem of matching, in which, finding matched pairs for the case group is difficult, or impossible. The propensity score (PS) minimizes the effect of the confounders, and it is reduced to one dimension. There are various algorithms in the field of PSM. This study aimed to compared the nearest neighbor and caliper algorithms. Methods: Data obtained in this study were from patients undergoing angiography at Ghaem Hospital in Mashhad, between 2011-12. The study was a retrospective case-control using PSM. In total, 604 patients were included in the case and control groups. A logistic regression model was used to calculate the propensity score and adjust the variables, such as age, gender, Body Mass Index (BMI), systolic blood pressure, smoking status, and triglyceride. Then, the Odds Ratios (ORs) with 95% Confidence Intervals (CIs) for the raw data and two matching algorithms were determined to examine the relationship between type 2 diabetes and coronary artery disease (CAD). Results: Propensity score in the nearest neighbor and caliper algorithms matched the total number of 604 samples, 200 and 178 pairs, respectively. All variables were significantly different between the two groups before matching (P<0.05). The gender was significantly different between the two groups after matching using the nearest neighbor algorithm (P=0.002). No variables created a significant difference between the two groups after matching with the caliper algorithm. Conclusion: Bias reduction in the caliper algorithm was greater than for the nearest neighbor algorithm for all variables except the triglyceride variable.

查看原文本刊更多论文

最近邻居算法和卡尺算法在结果倾向评分匹配中的比较研究2型糖尿病与冠状动脉疾病的关系

倾向评分匹配(PSM)是一种减少基本因素和混杂因素影响的方法。当混杂因素的数量很高时，可能会出现匹配问题，在这种情况下，为病例组找到匹配的配对是困难的，或者是不可能的。倾向得分(PS)最小化混杂因素的影响，并将其降至一维。在永磁同步领域有各种各样的算法。本研究旨在比较最近邻算法和卡尺算法。方法:本研究获得的数据来自2011- 2012年间在马什哈德Ghaem医院接受血管造影的患者。本研究采用PSM进行回顾性病例对照。病例组和对照组共604例患者。采用logistic回归模型计算倾向得分，并调整年龄、性别、体重指数(BMI)、收缩压、吸烟状况和甘油三酯等变量。然后，确定原始数据的95%置信区间的比值比(ORs)和两种匹配算法，以检查2型糖尿病与冠状动脉疾病(CAD)之间的关系。结果:最近邻和卡尺算法的倾向得分匹配的样本总数分别为604对、200对和178对。配对前两组间各项指标差异均有统计学意义(P<0.05)。采用最近邻算法匹配后，两组性别差异显著(P=0.002)。在与卡尺算法匹配后，两组之间没有变量产生显著差异。结论:除甘油三酯变量外，卡尺算法在所有变量上的偏倚减小都大于最近邻算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊