A Simulation Study Comparing Tree-Based Methods in Identifying Interactions of Continuous and Binary Variables for Prediction of Increased Risk of Disease

Sybil Prince Nelson
{"title":"A Simulation Study Comparing Tree-Based Methods in Identifying Interactions of Continuous and Binary Variables for Prediction of Increased Risk of Disease","authors":"Sybil Prince Nelson","doi":"10.37256/bsr.1120232148","DOIUrl":null,"url":null,"abstract":"Tree-based methods are commonly used to create models that predict an output based on several input variables. Classification and Regression Trees (CARTs) is a popular algorithm that builds tree-like graphs for predicting continuous and categorical dependent variables, but it has been shown to be biased toward the inclusion of continuous variables. Conditional inference is a technique used to alleviate this bias. C.Logic is an alternative tree-based method that uses Boolean logic to create classification trees. Previous research has shown that C.Logic is superior to CART in identifying interactions that lead to an increased risk of disease. No comparison has been made between the C.Logic package and CART with conditional inference as found in a package called Party. In this paper, a simulation study is used to compare the capability of these two algorithms to identify interactions between continuous and binary variables. It is found that while both methods succeed in identifying correct interactions, C.Logic is more effective. The C.Logic algorithm does a better job of alleviating the bias toward continuous variables when attempting to identify interacting variables that lead to an increased risk of disease.","PeriodicalId":298847,"journal":{"name":"Biostatistics Research","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biostatistics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37256/bsr.1120232148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Tree-based methods are commonly used to create models that predict an output based on several input variables. Classification and Regression Trees (CARTs) is a popular algorithm that builds tree-like graphs for predicting continuous and categorical dependent variables, but it has been shown to be biased toward the inclusion of continuous variables. Conditional inference is a technique used to alleviate this bias. C.Logic is an alternative tree-based method that uses Boolean logic to create classification trees. Previous research has shown that C.Logic is superior to CART in identifying interactions that lead to an increased risk of disease. No comparison has been made between the C.Logic package and CART with conditional inference as found in a package called Party. In this paper, a simulation study is used to compare the capability of these two algorithms to identify interactions between continuous and binary variables. It is found that while both methods succeed in identifying correct interactions, C.Logic is more effective. The C.Logic algorithm does a better job of alleviating the bias toward continuous variables when attempting to identify interacting variables that lead to an increased risk of disease.
比较基于树的方法在识别预测疾病风险增加的连续变量和二元变量的相互作用中的模拟研究
基于树的方法通常用于创建基于几个输入变量预测输出的模型。分类和回归树(cart)是一种流行的算法,它构建树状图来预测连续和分类因变量,但它已被证明偏向于包含连续变量。条件推理是一种用来减轻这种偏见的技术。C.Logic是另一种基于树的方法,它使用布尔逻辑创建分类树。先前的研究表明,C.Logic在识别导致疾病风险增加的相互作用方面优于CART。C.Logic软件包与CART之间没有进行比较,因为在一个名为Party的软件包中发现了条件推理。本文通过仿真研究比较了这两种算法识别连续变量和二元变量之间相互作用的能力。研究发现,虽然两种方法都能成功地识别出正确的相互作用,但C.Logic更有效。当试图识别导致疾病风险增加的相互作用变量时,C.Logic算法在减轻对连续变量的偏见方面做得更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信