Input System for a GPT Model Simulating Doctor-Patient Interactions During Medical Consultation.

Jonathan Kambire, Seydou Golo Barro, Pascal Staccini
{"title":"Input System for a GPT Model Simulating Doctor-Patient Interactions During Medical Consultation.","authors":"Jonathan Kambire, Seydou Golo Barro, Pascal Staccini","doi":"10.3233/SHTI251562","DOIUrl":null,"url":null,"abstract":"<p><p>The introduction of the Licence-Master-Doctorate (LMD) system in African higher education has significantly reshaped university organization, particularly in health-related fields, by exacerbating structural challenges such as the shortage of faculty and inadequate infrastructure. In this context, the present work aims to construct a structured dialogical corpus designed for the training of a customized GPT-2 model, with the goal of simulating medical consultations and supporting the training of medical students. The methodology combines the use of reliable medical sources, the controlled generation of dialogues using existing artificial intelligence systems, and role-playing exercises involving medical students, with detailed annotation of clinical, emotional, and behavioral metadata. The final corpus comprises over 36 million tokens for pre-training and more than 8,326 simulated dialogues for fine-tuning, covering the most prevalent pathologies in Burkina Faso. This multilingual and culturally contextualized approach represents a significant departure from dominant Western corpora, laying the groundwork for a medical conversational model adapted to African realities. While the model is still in training, the complete results will be presented at a later stage. Nevertheless, the collected data already constitute a valuable resource for the development of realistic, diverse, and reusable educational simulators across various medical training contexts.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"360-364"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The introduction of the Licence-Master-Doctorate (LMD) system in African higher education has significantly reshaped university organization, particularly in health-related fields, by exacerbating structural challenges such as the shortage of faculty and inadequate infrastructure. In this context, the present work aims to construct a structured dialogical corpus designed for the training of a customized GPT-2 model, with the goal of simulating medical consultations and supporting the training of medical students. The methodology combines the use of reliable medical sources, the controlled generation of dialogues using existing artificial intelligence systems, and role-playing exercises involving medical students, with detailed annotation of clinical, emotional, and behavioral metadata. The final corpus comprises over 36 million tokens for pre-training and more than 8,326 simulated dialogues for fine-tuning, covering the most prevalent pathologies in Burkina Faso. This multilingual and culturally contextualized approach represents a significant departure from dominant Western corpora, laying the groundwork for a medical conversational model adapted to African realities. While the model is still in training, the complete results will be presented at a later stage. Nevertheless, the collected data already constitute a valuable resource for the development of realistic, diverse, and reusable educational simulators across various medical training contexts.

模拟医疗咨询过程中医患互动的GPT模型输入系统。
在非洲高等教育中采用执照-硕士-博士(LMD)制度,加剧了师资短缺和基础设施不足等结构性挑战,从而大大改变了大学组织,特别是在卫生领域。在此背景下,本研究旨在构建一个结构化对话语料库,用于定制GPT-2模型的培训,目的是模拟医疗咨询并支持医学生的培训。该方法结合了可靠医学资源的使用,使用现有人工智能系统控制对话的生成,以及涉及医学生的角色扮演练习,以及临床、情感和行为元数据的详细注释。最终的语料库包括超过3600万个用于预训练的代币和超过8326个用于微调的模拟对话,涵盖了布基纳法索最普遍的病症。这种多语言和文化语境化的方法代表了对占主导地位的西方语料库的重大背离,为适应非洲现实的医学对话模式奠定了基础。虽然该模型仍在训练中,但完整的结果将在稍后阶段呈现。尽管如此,收集到的数据已经构成了在各种医学培训背景下开发现实的、多样化的和可重复使用的教育模拟器的宝贵资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信