{"title":"Trust and accuracy in AI: Optometrists favor multimodal AI systems over unimodal for glaucoma diagnosis in collaborative environment","authors":"Faisal Ghaffar , Yousuf Zia Islam , Nadine Furtado , Catherine Burns","doi":"10.1016/j.compbiomed.2025.111132","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>User trust and decision accuracy are crucial for the successful collaboration of humans and Artificial Intelligence (AI) systems, especially in clinical settings such as glaucoma diagnosis. Both trust and accuracy are influenced by the data modality used by AI systems, which directly impacts the effectiveness of human-AI collaboration.</div></div><div><h3>Objective:</h3><div>The objective of this study is to discover the modality of an AI system that aligns best with an optometrist’s mental model. This was achieved by comparing trust levels between unimodal and multimodal AI systems used for glaucoma diagnosis. Additionally, we explore the impact of modality on various targets of user trust and user performance.</div></div><div><h3>Methods:</h3><div>We conducted a within-subject study with 20 optometrists, who were presented with both unimodal and multimodal AI mock-up systems specifically designed for glaucoma diagnosis. Trust was measured across five key targets using a 5 point Likert scale questionnaires. Statistical analysis was performed to assess trust differences between the unimodal and multimodal AI systems. Optometrist performance was evaluated based on the alignment of their decisions with those of the unimodal and multimodal AI systems.</div></div><div><h3>Results:</h3><div>The results showed that the multimodal system had a higher average trust rating of 3.98 on a Likert scale, indicating greater trust compared to the unimodal system, which had an average trust rating of 3.00. This difference was statistically significant (<em>p</em><span><math><mo><</mo></math></span>.001), with further analysis revealing significant variation across all trust targets (<em>p</em><span><math><mo><</mo></math></span>.001). Additionally, optometrists demonstrated higher F1 scores with the multimodal system (43.1%) compared to the unimodal system (37.3%), while accuracy remained comparable between the two systems (63.0% for multimodal and 63.3% for unimodal). However, major differences across these metrics were observed in relation to optometrist’s expertise.</div></div><div><h3>Conclusions:</h3><div>Multimodal AI systems, which use the same data modality as optometrists and align more closely with their mental models and decision-making processes, result in higher user trust and improved diagnostic performance. Therefore, for effective human-AI collaboration in healthcare, specifically for glaucoma diagnosis, AI systems should be designed to utilize the same data sources as the human counterparts, ensuring consistency and improving both trust and decision accuracy.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"198 ","pages":"Article 111132"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525014854","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background:
User trust and decision accuracy are crucial for the successful collaboration of humans and Artificial Intelligence (AI) systems, especially in clinical settings such as glaucoma diagnosis. Both trust and accuracy are influenced by the data modality used by AI systems, which directly impacts the effectiveness of human-AI collaboration.
Objective:
The objective of this study is to discover the modality of an AI system that aligns best with an optometrist’s mental model. This was achieved by comparing trust levels between unimodal and multimodal AI systems used for glaucoma diagnosis. Additionally, we explore the impact of modality on various targets of user trust and user performance.
Methods:
We conducted a within-subject study with 20 optometrists, who were presented with both unimodal and multimodal AI mock-up systems specifically designed for glaucoma diagnosis. Trust was measured across five key targets using a 5 point Likert scale questionnaires. Statistical analysis was performed to assess trust differences between the unimodal and multimodal AI systems. Optometrist performance was evaluated based on the alignment of their decisions with those of the unimodal and multimodal AI systems.
Results:
The results showed that the multimodal system had a higher average trust rating of 3.98 on a Likert scale, indicating greater trust compared to the unimodal system, which had an average trust rating of 3.00. This difference was statistically significant (p.001), with further analysis revealing significant variation across all trust targets (p.001). Additionally, optometrists demonstrated higher F1 scores with the multimodal system (43.1%) compared to the unimodal system (37.3%), while accuracy remained comparable between the two systems (63.0% for multimodal and 63.3% for unimodal). However, major differences across these metrics were observed in relation to optometrist’s expertise.
Conclusions:
Multimodal AI systems, which use the same data modality as optometrists and align more closely with their mental models and decision-making processes, result in higher user trust and improved diagnostic performance. Therefore, for effective human-AI collaboration in healthcare, specifically for glaucoma diagnosis, AI systems should be designed to utilize the same data sources as the human counterparts, ensuring consistency and improving both trust and decision accuracy.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.