Decision tree-based data mining approach for the evaluation of survival in primary malignant bone tumors: A surveillance, epidemiology and end results database study.

IF 1.3 4区 医学 Q3 ORTHOPEDICS
Dilek Yapar, Aliekber Yapar, Mehmet Ali Tokgöz, Uğur Bilge
{"title":"Decision tree-based data mining approach for the evaluation of survival in primary malignant bone tumors: A surveillance, epidemiology and end results database study.","authors":"Dilek Yapar, Aliekber Yapar, Mehmet Ali Tokgöz, Uğur Bilge","doi":"10.1177/10225536231189780","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to conduct a large-scale population-based study to understand the epidemiological characteristics of Primary Malignant Bone Tumors (PMBTs) and determine the prognostic factors by concurrently using the classical statistical method and data mining methods.</p><p><strong>Methods: </strong>Patients included in this study were extracted from the National Cancer Institute's Surveillance, Epidemiology and End Results (SEER) database: \"Incidence-SEER Research Data, 18 Registries, Nov 2020 Sub\". Patients with unclassified and incomplete information were excluded. This search algorithm resulted in a dataset comprising 6234 cases. Survival analyses were performed with Kaplan-Meier curves and the Log-rank test. Multivariate Cox regression analysis determined the independent prognostic factors of PMBT. A decision tree-based data mining technique was used in this study to confirm the prognostic factors.</p><p><strong>Results: </strong>5-years survival rate was 63.6% and 10-years survival rate was 55.3% in the patients with PMBT. Sex, age, median household income, histology, primary site, grade, stage, metastasis, and the total number of malignant tumors were determined as independent risk factors associated with overall survival (OS) in the multivariate COX regression analysis. The prognostic factors resulting in five terminal nodes in the decision tree (DT) included stage, age, and grade. The stage was the most important determining factor for vital status. The terminal node with the shortest number of surviving patients included 801 (72.3%) deaths in 1102 patients with distant stage, and hazard ratio was calculated as 5.4 (95% CI: 4.9-5.9; <i>p</i> < .001). These patients had a median survival of only 17 months.</p><p><strong>Conclusions: </strong>Rules extracted from DTs provide information about risk factors in specific patient groups and can be used by clinicians making decisions on individual patients. We recommend using DTs in combination with COX regression analysis to determine risk factors and the effect of these factors on survival.</p>","PeriodicalId":48794,"journal":{"name":"Journal of Orthopaedic Surgery","volume":"31 2","pages":"10225536231189780"},"PeriodicalIF":1.3000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Orthopaedic Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/10225536231189780","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aimed to conduct a large-scale population-based study to understand the epidemiological characteristics of Primary Malignant Bone Tumors (PMBTs) and determine the prognostic factors by concurrently using the classical statistical method and data mining methods.

Methods: Patients included in this study were extracted from the National Cancer Institute's Surveillance, Epidemiology and End Results (SEER) database: "Incidence-SEER Research Data, 18 Registries, Nov 2020 Sub". Patients with unclassified and incomplete information were excluded. This search algorithm resulted in a dataset comprising 6234 cases. Survival analyses were performed with Kaplan-Meier curves and the Log-rank test. Multivariate Cox regression analysis determined the independent prognostic factors of PMBT. A decision tree-based data mining technique was used in this study to confirm the prognostic factors.

Results: 5-years survival rate was 63.6% and 10-years survival rate was 55.3% in the patients with PMBT. Sex, age, median household income, histology, primary site, grade, stage, metastasis, and the total number of malignant tumors were determined as independent risk factors associated with overall survival (OS) in the multivariate COX regression analysis. The prognostic factors resulting in five terminal nodes in the decision tree (DT) included stage, age, and grade. The stage was the most important determining factor for vital status. The terminal node with the shortest number of surviving patients included 801 (72.3%) deaths in 1102 patients with distant stage, and hazard ratio was calculated as 5.4 (95% CI: 4.9-5.9; p < .001). These patients had a median survival of only 17 months.

Conclusions: Rules extracted from DTs provide information about risk factors in specific patient groups and can be used by clinicians making decisions on individual patients. We recommend using DTs in combination with COX regression analysis to determine risk factors and the effect of these factors on survival.

基于决策树的数据挖掘方法评估原发性恶性骨肿瘤的生存率:一项监测、流行病学和最终结果数据库研究。
目的:本研究旨在进行一项大规模的基于人群的研究,以了解原发性恶性骨肿瘤(PMBT)的流行病学特征,并同时使用经典统计方法和数据挖掘方法来确定预后因素。方法:纳入本研究的患者从国家癌症研究所的监测、流行病学和最终结果(SEER)数据库中提取:“发病率-流行病学研究数据,18个注册中心,2020年11月子”。未分类和信息不完整的患者被排除在外。该搜索算法产生了包括6234个案例的数据集。生存率分析采用Kaplan-Meier曲线和对数秩检验。多因素Cox回归分析确定了PMBT的独立预后因素。本研究采用基于决策树的数据挖掘技术来确认预后因素。结果:PMBT患者5年生存率63.6%,10年生存率55.3%。在多变量COX回归分析中,性别、年龄、家庭收入中位数、组织学、原发部位、分级、分期、转移和恶性肿瘤总数被确定为与总生存率(OS)相关的独立风险因素。在决策树(DT)中导致五个终端节点的预后因素包括阶段、年龄和等级。阶段是决定生命状态的最重要因素。存活患者数最短的终结点包括1102名远处分期患者中的801人(72.3%)死亡,危险比计算为5.4(95%CI:4.9-5.9;p<.001)。这些患者的中位生存期仅为17个月。结论:从DTs中提取的规则提供了有关特定患者群体中风险因素的信息,临床医生可以使用这些规则对个别患者做出决策。我们建议使用DTs结合COX回归分析来确定风险因素以及这些因素对生存率的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Orthopaedic Surgery
Journal of Orthopaedic Surgery ORTHOPEDICS-SURGERY
CiteScore
3.10
自引率
0.00%
发文量
91
审稿时长
13 weeks
期刊介绍: Journal of Orthopaedic Surgery is an open access peer-reviewed journal publishing original reviews and research articles on all aspects of orthopaedic surgery. It is the official journal of the Asia Pacific Orthopaedic Association. The journal welcomes and will publish materials of a diverse nature, from basic science research to clinical trials and surgical techniques. The journal encourages contributions from all parts of the world, but special emphasis is given to research of particular relevance to the Asia Pacific region.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信