临床数据挖掘:问题、陷阱和解决方案

Emanuel Weitschek, G. Felici, P. Bertolazzi
{"title":"临床数据挖掘:问题、陷阱和解决方案","authors":"Emanuel Weitschek, G. Felici, P. Bertolazzi","doi":"10.1109/DEXA.2013.42","DOIUrl":null,"url":null,"abstract":"The wide spread of electronic data collection in medical environments leads to an exponential growth of clinical data extracted from heterogeneous patient samples. Collecting, managing, integrating and analyzing these data are essential activities in order to shed light on diseases and on related therapies. The major issues in clinical data analysis are the incompleteness (missing values), the different adopted measure scales, the integration of the disparate collection procedures. Therefore, the main challenges are in managing clinical data, in discovering patients interactions, and in integrating the different data sources. The final goal is to extract relevant information from huge amounts of clinical data. Therefore, the analysis of clinical data requires new effective and efficient methods to extract compact and relevant information: the interdisciplinary field of data mining, which guides the automated knowledge discovery process, is a natural way to approach the complex task of clinical data analysis. Data mining deals with structured and unstructured data, that are, respectively, data for which we can give a model or not. For example, in clinical contexts it is important to highlight those trials (variables) that are frequent in a particular disease diagnosis. The objective of this work is to study and apply methods to manage and retrieve relevant information in clinical data sets. A practical analysis from real patient data collected from several dementia clinical departments in Italy is reported as example of clinical data mining. The particular field of logic classification, where a data model is computed in form of propositional logic formulas, is investigated for clinical data mining and compared to other techniques, showing that it is a successful approach to compute a compact data model for clinical knowledge discovery.","PeriodicalId":428515,"journal":{"name":"2013 24th International Workshop on Database and Expert Systems Applications","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Clinical Data Mining: Problems, Pitfalls and Solutions\",\"authors\":\"Emanuel Weitschek, G. Felici, P. Bertolazzi\",\"doi\":\"10.1109/DEXA.2013.42\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The wide spread of electronic data collection in medical environments leads to an exponential growth of clinical data extracted from heterogeneous patient samples. Collecting, managing, integrating and analyzing these data are essential activities in order to shed light on diseases and on related therapies. The major issues in clinical data analysis are the incompleteness (missing values), the different adopted measure scales, the integration of the disparate collection procedures. Therefore, the main challenges are in managing clinical data, in discovering patients interactions, and in integrating the different data sources. The final goal is to extract relevant information from huge amounts of clinical data. Therefore, the analysis of clinical data requires new effective and efficient methods to extract compact and relevant information: the interdisciplinary field of data mining, which guides the automated knowledge discovery process, is a natural way to approach the complex task of clinical data analysis. Data mining deals with structured and unstructured data, that are, respectively, data for which we can give a model or not. For example, in clinical contexts it is important to highlight those trials (variables) that are frequent in a particular disease diagnosis. The objective of this work is to study and apply methods to manage and retrieve relevant information in clinical data sets. A practical analysis from real patient data collected from several dementia clinical departments in Italy is reported as example of clinical data mining. The particular field of logic classification, where a data model is computed in form of propositional logic formulas, is investigated for clinical data mining and compared to other techniques, showing that it is a successful approach to compute a compact data model for clinical knowledge discovery.\",\"PeriodicalId\":428515,\"journal\":{\"name\":\"2013 24th International Workshop on Database and Expert Systems Applications\",\"volume\":\"79 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 24th International Workshop on Database and Expert Systems Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DEXA.2013.42\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 24th International Workshop on Database and Expert Systems Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.2013.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25

摘要

医疗环境中电子数据收集的广泛传播导致从异质患者样本中提取的临床数据呈指数增长。收集、管理、整合和分析这些数据是阐明疾病和相关疗法的基本活动。临床数据分析的主要问题是数据不完整(缺失值)、采用不同的测量量表、不同收集程序的整合。因此,主要的挑战在于管理临床数据,发现患者之间的相互作用,以及整合不同的数据源。最终目标是从海量的临床数据中提取相关信息。因此,临床数据分析需要新的有效和高效的方法来提取紧凑和相关的信息:跨学科的数据挖掘领域,指导自动化的知识发现过程,是解决临床数据分析复杂任务的自然途径。数据挖掘处理结构化数据和非结构化数据,这两种数据分别是我们可以为其提供模型或不提供模型的数据。例如,在临床环境中,强调在特定疾病诊断中频繁出现的试验(变量)是很重要的。这项工作的目的是研究和应用方法来管理和检索临床数据集中的相关信息。从意大利几个痴呆症临床科室收集的真实患者数据进行实际分析,作为临床数据挖掘的例子。在逻辑分类的特定领域中,数据模型以命题逻辑公式的形式计算,研究了临床数据挖掘的数据模型,并与其他技术进行了比较,表明它是计算临床知识发现的紧凑数据模型的成功方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Clinical Data Mining: Problems, Pitfalls and Solutions
The wide spread of electronic data collection in medical environments leads to an exponential growth of clinical data extracted from heterogeneous patient samples. Collecting, managing, integrating and analyzing these data are essential activities in order to shed light on diseases and on related therapies. The major issues in clinical data analysis are the incompleteness (missing values), the different adopted measure scales, the integration of the disparate collection procedures. Therefore, the main challenges are in managing clinical data, in discovering patients interactions, and in integrating the different data sources. The final goal is to extract relevant information from huge amounts of clinical data. Therefore, the analysis of clinical data requires new effective and efficient methods to extract compact and relevant information: the interdisciplinary field of data mining, which guides the automated knowledge discovery process, is a natural way to approach the complex task of clinical data analysis. Data mining deals with structured and unstructured data, that are, respectively, data for which we can give a model or not. For example, in clinical contexts it is important to highlight those trials (variables) that are frequent in a particular disease diagnosis. The objective of this work is to study and apply methods to manage and retrieve relevant information in clinical data sets. A practical analysis from real patient data collected from several dementia clinical departments in Italy is reported as example of clinical data mining. The particular field of logic classification, where a data model is computed in form of propositional logic formulas, is investigated for clinical data mining and compared to other techniques, showing that it is a successful approach to compute a compact data model for clinical knowledge discovery.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信