Improving information fusion on multimodal clinical data in classification settings

International Workshop on Health Text Mining and Information Analysis Pub Date : 1900-01-01 DOI:10.18653/v1/2022.louhi-1.18

Sneha Jha, Erik Mayer, Mauricio Barahona

{"title":"Improving information fusion on multimodal clinical data in classification settings","authors":"Sneha Jha, Erik Mayer, Mauricio Barahona","doi":"10.18653/v1/2022.louhi-1.18","DOIUrl":null,"url":null,"abstract":"Clinical data often exists in different forms across the lifetime of a patient’s interaction with the healthcare system - structured, unstructured or semi-structured data in the form of laboratory readings, clinical notes, diagnostic codes, imaging and audio data of various kinds, and other observational data. Formulating a representation model that aggregates information from these heterogeneous sources may allow us to jointly model on data with more predictive signal than noise and help inform our model with useful constraints learned from better data. Multimodal fusion approaches help produce representations combined from heterogeneous modalities, which can be used for clinical prediction tasks. Representations produced through different fusion techniques require different training strategies. We investigate the advantage of adding narrative clinical text to structured modalities to classification tasks in the clinical domain. We show that while there is a competitive advantage in combined representations of clinical data, the approach can be helped by training guidance customized to each modality. We show empirical results across binary/multiclass settings, single/multitask settings and unified/multimodal learning rate settings for early and late information fusion of clinical data.","PeriodicalId":448872,"journal":{"name":"International Workshop on Health Text Mining and Information Analysis","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Health Text Mining and Information Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.louhi-1.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Clinical data often exists in different forms across the lifetime of a patient’s interaction with the healthcare system - structured, unstructured or semi-structured data in the form of laboratory readings, clinical notes, diagnostic codes, imaging and audio data of various kinds, and other observational data. Formulating a representation model that aggregates information from these heterogeneous sources may allow us to jointly model on data with more predictive signal than noise and help inform our model with useful constraints learned from better data. Multimodal fusion approaches help produce representations combined from heterogeneous modalities, which can be used for clinical prediction tasks. Representations produced through different fusion techniques require different training strategies. We investigate the advantage of adding narrative clinical text to structured modalities to classification tasks in the clinical domain. We show that while there is a competitive advantage in combined representations of clinical data, the approach can be helped by training guidance customized to each modality. We show empirical results across binary/multiclass settings, single/multitask settings and unified/multimodal learning rate settings for early and late information fusion of clinical data.

查看原文本刊更多论文

改进多模式临床数据在分类设置中的信息融合

在患者与医疗保健系统交互的整个生命周期中，临床数据通常以不同的形式存在——结构化、非结构化或半结构化数据，包括实验室读数、临床记录、诊断代码、各种成像和音频数据以及其他观察数据。制定一个聚合来自这些异构源的信息的表示模型，可以让我们在具有更多预测信号(而不是噪声)的数据上联合建模，并帮助我们利用从更好的数据中学到的有用约束告知我们的模型。多模态融合方法有助于从异构模态组合产生表征，可用于临床预测任务。通过不同的融合技术产生的表示需要不同的训练策略。我们调查的优势，增加叙事临床文本结构化模式的分类任务在临床领域。我们表明，虽然临床数据的组合表示具有竞争优势，但该方法可以通过针对每种模式定制的培训指导来帮助。我们展示了临床数据早期和晚期信息融合的二元/多类别设置、单一/多任务设置和统一/多模式学习率设置的实证结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Workshop on Health Text Mining and Information Analysis

自引率

0.00%

发文量