Symptom Recognition in Medical Conversations Via multi- Instance Learning and Prompt.

IF 5.7 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

Journal of Medical Systems Pub Date : 2025-08-20 DOI:10.1007/s10916-025-02240-w

Hua Wang, Xue-Feng Bai, Xiu-Tao Cui, Gang Chen, Guo-Ming Fan, Guo-Lian Wei, Ye-Ping Zheng, Jing-Jing Wu, Sheng-Sheng Gao

{"title":"Symptom Recognition in Medical Conversations Via multi- Instance Learning and Prompt.","authors":"Hua Wang, Xue-Feng Bai, Xiu-Tao Cui, Gang Chen, Guo-Ming Fan, Guo-Lian Wei, Ye-Ping Zheng, Jing-Jing Wu, Sheng-Sheng Gao","doi":"10.1007/s10916-025-02240-w","DOIUrl":null,"url":null,"abstract":"<p><p>With the widespread adoption of electronic health record (EHR) systems, there is a crucial need for automatic extraction of key symptom information from medical dialogue to support intelligent medical record generation. However, symptom recognition in such dialogues remains challenging because (a) symptom clues are scattered across multi-turn, unstructured conversations, (b) patient descriptions are often informal and deviate from standardized terminology, and (c) many symptom statements are ambiguous or negated, making them difficult for conventional models to interpret. To address these challenges, we propose a novel symptom identification approach that combines multi-instance learning (MIL) with prompt-guided attention for fine-grained symptom identification. In our framework, each conversation is treated as a bag of utterances. A MIL-based model aggregates information across utterances to improve recall and pinpoints which specific utterances mention each symptom, thus enabling sentence-level symptom recognition. Concurrently, a prompt-guided attention strategy leverages standardized symptom terminology as prior knowledge to guide the model in recognizing synonyms, implicit symptom mentions, and negations, thereby improving precision. We further employ R-Drop regularization to enhance robustness against noisy inputs. Experiments on public medical-dialogue datasets demonstrate that our method significantly outperforms existing techniques, achieving an 85.93% F1-score (with 85.09% precision and 86.83% recall) - about 8% points higher than a strong multi-label classification baseline. Notably, our model accurately identifies the specific utterances corresponding to each symptom mention (symptom-utterance pairs), highlighting its fine-grained extraction capability. Ablation studies confirm that the MIL component boosts recall, while the prompt-guided attention component reduces false positives. By precisely locating symptom information within conversations, our approach effectively tackles the issues of dispersed data and inconsistent expressions. This fine-grained symptom documentation capability represents a promising advancement for automated medical information extraction, more intelligent EHR systems, and diagnostic decision support.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"107"},"PeriodicalIF":5.7000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02240-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

With the widespread adoption of electronic health record (EHR) systems, there is a crucial need for automatic extraction of key symptom information from medical dialogue to support intelligent medical record generation. However, symptom recognition in such dialogues remains challenging because (a) symptom clues are scattered across multi-turn, unstructured conversations, (b) patient descriptions are often informal and deviate from standardized terminology, and (c) many symptom statements are ambiguous or negated, making them difficult for conventional models to interpret. To address these challenges, we propose a novel symptom identification approach that combines multi-instance learning (MIL) with prompt-guided attention for fine-grained symptom identification. In our framework, each conversation is treated as a bag of utterances. A MIL-based model aggregates information across utterances to improve recall and pinpoints which specific utterances mention each symptom, thus enabling sentence-level symptom recognition. Concurrently, a prompt-guided attention strategy leverages standardized symptom terminology as prior knowledge to guide the model in recognizing synonyms, implicit symptom mentions, and negations, thereby improving precision. We further employ R-Drop regularization to enhance robustness against noisy inputs. Experiments on public medical-dialogue datasets demonstrate that our method significantly outperforms existing techniques, achieving an 85.93% F1-score (with 85.09% precision and 86.83% recall) - about 8% points higher than a strong multi-label classification baseline. Notably, our model accurately identifies the specific utterances corresponding to each symptom mention (symptom-utterance pairs), highlighting its fine-grained extraction capability. Ablation studies confirm that the MIL component boosts recall, while the prompt-guided attention component reduces false positives. By precisely locating symptom information within conversations, our approach effectively tackles the issues of dispersed data and inconsistent expressions. This fine-grained symptom documentation capability represents a promising advancement for automated medical information extraction, more intelligent EHR systems, and diagnostic decision support.

查看原文本刊更多论文

基于多实例学习和提示的医学会话症状识别。

随着电子病历（EHR）系统的广泛采用，迫切需要从医疗对话中自动提取关键症状信息，以支持智能病历的生成。然而，在这样的对话中，症状识别仍然具有挑战性，因为(a)症状线索分散在多回合、非结构化的对话中，(b)患者描述通常是非正式的，偏离了标准化术语，(c)许多症状陈述含糊不清或被否定，使得传统模型难以解释。为了解决这些挑战，我们提出了一种新的症状识别方法，该方法将多实例学习（MIL）与快速引导注意力相结合，用于细粒度症状识别。在我们的框架中，每一次对话都被视为一个话语包。基于mil的模型聚合了话语之间的信息，以提高召回率，并确定哪些特定的话语提到了每个症状，从而实现句子级的症状识别。同时，提示引导注意策略利用标准化症状术语作为先验知识来指导模型识别同义词、隐式症状提及和否定，从而提高准确性。我们进一步使用R-Drop正则化来增强对噪声输入的鲁棒性。在公共医疗对话数据集上的实验表明，我们的方法显著优于现有的技术，达到了85.93%的f1得分（准确率85.09%，召回率86.83%），比强多标签分类基线高出约8%。值得注意的是，我们的模型准确地识别了对应于每个症状提及的特定话语（症状-话语对），突出了其细粒度提取能力。消融研究证实MIL成分促进回忆，而提示引导注意成分减少误报。通过精确定位对话中的症状信息，我们的方法有效地解决了数据分散和表达不一致的问题。这种细粒度的症状记录功能代表了自动化医疗信息提取、更智能的EHR系统和诊断决策支持的一个有希望的进步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Systems 医学-卫生保健

CiteScore

11.60

自引率

1.90%

发文量

审稿时长

4.8 months

期刊介绍： Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.