Ariana Genovese, Sahar Borna, Cesar A Gomez-Cabello, Syed Ali Haider, Srinivasagam Prabha, Antonio J Forte, Benjamin R Veenstra
{"title":"Artificial intelligence in clinical settings: a systematic review of its role in language translation and interpretation.","authors":"Ariana Genovese, Sahar Borna, Cesar A Gomez-Cabello, Syed Ali Haider, Srinivasagam Prabha, Antonio J Forte, Benjamin R Veenstra","doi":"10.21037/atm-24-162","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Addressing language barriers through accurate interpretation is crucial for providing quality care and establishing trust. While the ability of artificial intelligence (AI) to translate medical documentation has been studied, its role for patient-provider communication is less explored. This review evaluates AI's effectiveness in clinical translation by assessing accuracy, usability, satisfaction, and feedback on its use.</p><p><strong>Methods: </strong>A systematic search was conducted on July 11, 2024, across Cumulated Index in Nursing and Allied Health Literature (CINAHL), Institute of Electrical and Electronics Engineers (IEEE) Xplore, PubMed, Scopus, Web of Science, and Google Scholar. Inclusion criteria required AI to translate clinical information for a real or theoretical consultation. Exclusion criteria included reviews, correspondence, educational materials, non-peer-reviewed or retracted reports, non-English translations, pre-2016 publications, and reports on sign language or patient education. Search strings representing AI, language interpretation, and healthcare were used. Two investigators independently conducted the screening, extraction, synthesis of results, and bias assessments using Risk Of Bias In Non-randomized Studies - of Interventions (ROBINS-I), Mixed Methods Appraisal Tool (MMAT), and the Joanna Briggs Institute (JBI) Critical Appraisal Checklist for Qualitative Research. A third investigator resolved conflicts.</p><p><strong>Results: </strong>Of 1,095 reports, 9 studies were analyzed, evaluating AI translation platforms Google Translate, Microsoft Translator, Apple iTranslate, AwezaMed, Pocketalk W, and the Asynchronous Telepsychiatry (ATP) App. Investigations occurred in the US, France, Switzerland, and South Africa, with publications from 2019-2024. AI medical translation shows promise, typically providing accurate translations for brief communications in limited languages, though human translation is often necessary. Accuracy scores ranged from 83-97.8% when translating from English, and 36-76% when translating to English. Usability scores were 76.7-96.7%. Patients were more satisfied than clinicians, with 84-96.6% and 53.8-86.7% satisfied, respectively. Clinicians were hesitant to use AI due to questions of respect, quality, reliability, and misunderstanding. AI is being used as a last-resort option, to assist fluent, non-certified providers and lay interpreters, and for brief communications.</p><p><strong>Conclusions: </strong>Limitations include few languages tested, unidirectional translation, simulation, and evolving translation tools. AI shows promise in clinical translation, but the complexity of medical consultations requires a balanced approach combining AI and human translation services for quality care.</p>","PeriodicalId":8216,"journal":{"name":"Annals of translational medicine","volume":"12 6","pages":"117"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11729812/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of translational medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/atm-24-162","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/17 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Addressing language barriers through accurate interpretation is crucial for providing quality care and establishing trust. While the ability of artificial intelligence (AI) to translate medical documentation has been studied, its role for patient-provider communication is less explored. This review evaluates AI's effectiveness in clinical translation by assessing accuracy, usability, satisfaction, and feedback on its use.
Methods: A systematic search was conducted on July 11, 2024, across Cumulated Index in Nursing and Allied Health Literature (CINAHL), Institute of Electrical and Electronics Engineers (IEEE) Xplore, PubMed, Scopus, Web of Science, and Google Scholar. Inclusion criteria required AI to translate clinical information for a real or theoretical consultation. Exclusion criteria included reviews, correspondence, educational materials, non-peer-reviewed or retracted reports, non-English translations, pre-2016 publications, and reports on sign language or patient education. Search strings representing AI, language interpretation, and healthcare were used. Two investigators independently conducted the screening, extraction, synthesis of results, and bias assessments using Risk Of Bias In Non-randomized Studies - of Interventions (ROBINS-I), Mixed Methods Appraisal Tool (MMAT), and the Joanna Briggs Institute (JBI) Critical Appraisal Checklist for Qualitative Research. A third investigator resolved conflicts.
Results: Of 1,095 reports, 9 studies were analyzed, evaluating AI translation platforms Google Translate, Microsoft Translator, Apple iTranslate, AwezaMed, Pocketalk W, and the Asynchronous Telepsychiatry (ATP) App. Investigations occurred in the US, France, Switzerland, and South Africa, with publications from 2019-2024. AI medical translation shows promise, typically providing accurate translations for brief communications in limited languages, though human translation is often necessary. Accuracy scores ranged from 83-97.8% when translating from English, and 36-76% when translating to English. Usability scores were 76.7-96.7%. Patients were more satisfied than clinicians, with 84-96.6% and 53.8-86.7% satisfied, respectively. Clinicians were hesitant to use AI due to questions of respect, quality, reliability, and misunderstanding. AI is being used as a last-resort option, to assist fluent, non-certified providers and lay interpreters, and for brief communications.
Conclusions: Limitations include few languages tested, unidirectional translation, simulation, and evolving translation tools. AI shows promise in clinical translation, but the complexity of medical consultations requires a balanced approach combining AI and human translation services for quality care.
期刊介绍:
The Annals of Translational Medicine (Ann Transl Med; ATM; Print ISSN 2305-5839; Online ISSN 2305-5847) is an international, peer-reviewed Open Access journal featuring original and observational investigations in the broad fields of laboratory, clinical, and public health research, aiming to provide practical up-to-date information in significant research from all subspecialties of medicine and to broaden the readers’ vision and horizon from bench to bed and bed to bench. It is published quarterly (April 2013- Dec. 2013), monthly (Jan. 2014 - Feb. 2015), biweekly (March 2015-) and openly distributed worldwide. Annals of Translational Medicine is indexed in PubMed in Sept 2014 and in SCIE in 2018. Specific areas of interest include, but not limited to, multimodality therapy, epidemiology, biomarkers, imaging, biology, pathology, and technical advances related to medicine. Submissions describing preclinical research with potential for application to human disease, and studies describing research obtained from preliminary human experimentation with potential to further the understanding of biological mechanism underlying disease are encouraged. Also warmly welcome are studies describing public health research pertinent to clinic, disease diagnosis and prevention, or healthcare policy. With a focus on interdisciplinary academic cooperation, ATM aims to expedite the translation of scientific discovery into new or improved standards of management and health outcomes practice.