Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Isidoro J Casanova, Manuel Campos, Jose M Juarez, Antonio Gomariz, Bernardo Canovas-Segura, Marta Lorente-Ros, Jose A Lorente
{"title":"Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare.","authors":"Isidoro J Casanova, Manuel Campos, Jose M Juarez, Antonio Gomariz, Bernardo Canovas-Segura, Marta Lorente-Ros, Jose A Lorente","doi":"10.1186/s12911-024-02566-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pattern mining techniques are helpful tools when extracting new knowledge in real practice, but the overwhelming number of patterns is still a limiting factor in the health-care domain. Current efforts concerning the definition of measures of interest for patterns are focused on reducing the number of patterns and quantifying their relevance (utility/usefulness). However, although the temporal dimension plays a key role in medical records, few efforts have been made to extract temporal knowledge about the patient's evolution from multivariate sequential patterns.</p><p><strong>Methods: </strong>In this paper, we propose a method to extract a new type of patterns in the clinical domain called Jumping Diagnostic Odds Ratio Sequential Patterns (JDORSP). The aim of this method is to employ the odds ratio to identify a concise set of sequential patterns that represent a patient's state with a statistically significant protection factor (i.e., a pattern associated with patients that survive) and those extensions whose evolution suddenly changes the patient's clinical state, thus making the sequential patterns a statistically significant risk factor (i.e., a pattern associated with patients that do not survive), or vice versa.</p><p><strong>Results: </strong>The results of our experiments highlight that our method reduces the number of sequential patterns obtained with state-of-the-art pattern reduction methods by over 95%. Only by achieving this drastic reduction can medical experts carry out a comprehensive clinical evaluation of the patterns that might be considered medical knowledge regarding the temporal evolution of the patients. We have evaluated the surprisingness and relevance of the sequential patterns with clinicians, and the most interesting fact is the high surprisingness of the extensions of the patterns that become a protection factor, that is, the patients that recover after several days of being at high risk of dying.</p><p><strong>Conclusions: </strong>Our proposed method with which to extract JDORSP generates a set of interpretable multivariate sequential patterns with new knowledge regarding the temporal evolution of the patients. The number of patterns is greatly reduced when compared to those generated by other methods and measures of interest. An additional advantage of this method is that it does not require any parameters or thresholds, and that the reduced number of patterns allows a manual evaluation.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11170878/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02566-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Pattern mining techniques are helpful tools when extracting new knowledge in real practice, but the overwhelming number of patterns is still a limiting factor in the health-care domain. Current efforts concerning the definition of measures of interest for patterns are focused on reducing the number of patterns and quantifying their relevance (utility/usefulness). However, although the temporal dimension plays a key role in medical records, few efforts have been made to extract temporal knowledge about the patient's evolution from multivariate sequential patterns.

Methods: In this paper, we propose a method to extract a new type of patterns in the clinical domain called Jumping Diagnostic Odds Ratio Sequential Patterns (JDORSP). The aim of this method is to employ the odds ratio to identify a concise set of sequential patterns that represent a patient's state with a statistically significant protection factor (i.e., a pattern associated with patients that survive) and those extensions whose evolution suddenly changes the patient's clinical state, thus making the sequential patterns a statistically significant risk factor (i.e., a pattern associated with patients that do not survive), or vice versa.

Results: The results of our experiments highlight that our method reduces the number of sequential patterns obtained with state-of-the-art pattern reduction methods by over 95%. Only by achieving this drastic reduction can medical experts carry out a comprehensive clinical evaluation of the patterns that might be considered medical knowledge regarding the temporal evolution of the patients. We have evaluated the surprisingness and relevance of the sequential patterns with clinicians, and the most interesting fact is the high surprisingness of the extensions of the patterns that become a protection factor, that is, the patients that recover after several days of being at high risk of dying.

Conclusions: Our proposed method with which to extract JDORSP generates a set of interpretable multivariate sequential patterns with new knowledge regarding the temporal evolution of the patients. The number of patterns is greatly reduced when compared to those generated by other methods and measures of interest. An additional advantage of this method is that it does not require any parameters or thresholds, and that the reduced number of patterns allows a manual evaluation.

利用几率比对医疗保健中的时间演化,发现令人惊讶的新型多变量序列模式。
背景:模式挖掘技术是在实际工作中提取新知识的有用工具,但在医疗保健领域,模式数量过多仍然是一个限制因素。目前,有关模式兴趣度量定义的工作主要集中在减少模式数量和量化其相关性(实用性/有用性)上。然而,尽管时间维度在医疗记录中起着关键作用,但从多变量序列模式中提取有关患者演变的时间知识的工作却少之又少:在本文中,我们提出了一种在临床领域提取新型模式的方法,称为跳跃诊断赔率序列模式(JDORSP)。该方法的目的是利用几率比来识别一组简明的序列模式,这些模式代表了具有统计意义的保护因素的患者状态(即与存活患者相关的模式),以及那些其演变会突然改变患者临床状态的扩展,从而使序列模式成为具有统计意义的风险因素(即与不存活患者相关的模式),反之亦然:实验结果表明,我们的方法将最先进的模式缩减方法获得的序列模式数量减少了 95% 以上。只有通过这种大幅减少的方法,医学专家才能对模式进行全面的临床评估,这些模式可被视为有关患者时间演变的医学知识。我们与临床医生一起评估了序列模式的出奇性和相关性,最有趣的事实是模式扩展的出奇性很高,这些模式成为了一个保护因素,也就是说,患者在面临死亡高风险数天后恢复了健康:我们提出的提取 JDORSP 的方法可以生成一组可解释的多变量序列模式,并提供有关患者时间演变的新知识。与其他方法和相关指标相比,该方法生成的模式数量大大减少。这种方法的另一个优点是,它不需要任何参数或阈值,而且由于模式数量减少,可以进行人工评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信