基于人工智能的心脏磁共振分割中种族偏差原因的研究。

IF 4.4 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS

European heart journal. Digital health Pub Date : 2025-02-24 eCollection Date: 2025-05-01 DOI:10.1093/ehjdh/ztaf008

Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Sebastien Roujol, Theodore Barfoot, Shaheim Ogbomo-Harmitt, Miaojing Shi, Andrew King

{"title":"基于人工智能的心脏磁共振分割中种族偏差原因的研究。","authors":"Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Sebastien Roujol, Theodore Barfoot, Shaheim Ogbomo-Harmitt, Miaojing Shi, Andrew King","doi":"10.1093/ehjdh/ztaf008","DOIUrl":null,"url":null,"abstract":"Aims: Artificial intelligence (AI) methods are being used increasingly for the automated segmentation of cine cardiac magnetic resonance (CMR) imaging. However, these methods have been shown to be subject to race bias; i.e. they exhibit different levels of performance for different races depending on the (im)balance of the data used to train the AI model. In this paper, we investigate the source of this bias, seeking to understand its root cause(s).Methods and results: We trained AI models to perform race classification on cine CMR images and/or segmentations from White and Black subjects from the UK Biobank and found that the classification accuracy for images was higher than for segmentations. Interpretability methods showed that the models were primarily looking at non-heart regions. Cropping images tightly around the heart caused classification accuracy to drop to almost chance level. Visualizing the latent space of AI segmentation models showed that race information was encoded in the models. Training segmentation models using cropped images reduced but did not eliminate the bias. A number of possible confounders for the bias in segmentation model performance were identified for Black subjects but none for White subjects.Conclusion: Distributional differences between annotated CMR data of White and Black races, which can lead to bias in trained AI segmentation models, are predominantly image-based, not segmentation-based. Most of the differences occur in areas outside the heart, such as subcutaneous fat. These findings will be important for researchers investigating performance of AI models on different races.","PeriodicalId":72965,"journal":{"name":"European heart journal. Digital health","volume":"6 3","pages":"350-358"},"PeriodicalIF":4.4000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12088717/pdf/","citationCount":"0","resultStr":"{\"title\":\"An investigation into the causes of race bias in artificial intelligence-based cine cardiac magnetic resonance segmentation.\",\"authors\":\"Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Sebastien Roujol, Theodore Barfoot, Shaheim Ogbomo-Harmitt, Miaojing Shi, Andrew King\",\"doi\":\"10.1093/ehjdh/ztaf008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aims: Artificial intelligence (AI) methods are being used increasingly for the automated segmentation of cine cardiac magnetic resonance (CMR) imaging. However, these methods have been shown to be subject to race bias; i.e. they exhibit different levels of performance for different races depending on the (im)balance of the data used to train the AI model. In this paper, we investigate the source of this bias, seeking to understand its root cause(s).Methods and results: We trained AI models to perform race classification on cine CMR images and/or segmentations from White and Black subjects from the UK Biobank and found that the classification accuracy for images was higher than for segmentations. Interpretability methods showed that the models were primarily looking at non-heart regions. Cropping images tightly around the heart caused classification accuracy to drop to almost chance level. Visualizing the latent space of AI segmentation models showed that race information was encoded in the models. Training segmentation models using cropped images reduced but did not eliminate the bias. A number of possible confounders for the bias in segmentation model performance were identified for Black subjects but none for White subjects.Conclusion: Distributional differences between annotated CMR data of White and Black races, which can lead to bias in trained AI segmentation models, are predominantly image-based, not segmentation-based. Most of the differences occur in areas outside the heart, such as subcutaneous fat. These findings will be important for researchers investigating performance of AI models on different races.\",\"PeriodicalId\":72965,\"journal\":{\"name\":\"European heart journal. Digital health\",\"volume\":\"6 3\",\"pages\":\"350-358\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12088717/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European heart journal. Digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/ehjdh/ztaf008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European heart journal. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/ehjdh/ztaf008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

目的：人工智能（AI）方法越来越多地用于电影心脏磁共振（CMR）成像的自动分割。然而，这些方法已被证明会受到种族偏见的影响；也就是说，它们在不同的种族中表现出不同的表现水平，这取决于用于训练AI模型的数据的平衡。在本文中，我们调查了这种偏见的来源，试图了解其根本原因。方法和结果：我们训练AI模型对来自UK Biobank的电影CMR图像和/或来自白人和黑人受试者的分割进行种族分类，并发现图像的分类精度高于分割。可解释性方法表明，这些模型主要关注非心脏区域。在心脏周围紧密裁剪图像导致分类准确率几乎下降到偶然水平。对人工智能分割模型的潜在空间进行可视化，发现种族信息被编码在模型中。训练分割模型使用裁剪图像减少，但没有消除偏差。在黑人受试者中发现了一些可能导致分割模型性能偏差的混杂因素，但在白人受试者中没有。结论：白人和黑人注释CMR数据的分布差异主要是基于图像的，而不是基于分割的，这可能导致训练的AI分割模型产生偏差。大多数差异发生在心脏以外的区域，如皮下脂肪。这些发现对于研究人工智能模型在不同种族上的表现非常重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An investigation into the causes of race bias in artificial intelligence-based cine cardiac magnetic resonance segmentation.

Aims: Artificial intelligence (AI) methods are being used increasingly for the automated segmentation of cine cardiac magnetic resonance (CMR) imaging. However, these methods have been shown to be subject to race bias; i.e. they exhibit different levels of performance for different races depending on the (im)balance of the data used to train the AI model. In this paper, we investigate the source of this bias, seeking to understand its root cause(s).

Methods and results: We trained AI models to perform race classification on cine CMR images and/or segmentations from White and Black subjects from the UK Biobank and found that the classification accuracy for images was higher than for segmentations. Interpretability methods showed that the models were primarily looking at non-heart regions. Cropping images tightly around the heart caused classification accuracy to drop to almost chance level. Visualizing the latent space of AI segmentation models showed that race information was encoded in the models. Training segmentation models using cropped images reduced but did not eliminate the bias. A number of possible confounders for the bias in segmentation model performance were identified for Black subjects but none for White subjects.

Conclusion: Distributional differences between annotated CMR data of White and Black races, which can lead to bias in trained AI segmentation models, are predominantly image-based, not segmentation-based. Most of the differences occur in areas outside the heart, such as subcutaneous fat. These findings will be important for researchers investigating performance of AI models on different races.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

European heart journal. Digital health

CiteScore

5.00

自引率

0.00%

发文量