Raghav Mehta, Angelos Filos, Ujjwal Baid, C. Sako, Richard McKinley, M. Rebsamen, K. Datwyler, Raphael Meier, P. Radojewski, G. Murugesan, S. Nalawade, Chandan Ganesh, B. Wagner, F. Yu, B. Fei, A. Madhuranthakam, J. Maldjian, L. Daza, Catalina G'omez, P. Arbel'aez, Chengliang Dai, Shuo Wang, Hadrien Raynaud, Yuanhan Mo, E. Angelini, Yike Guo, Wenjia Bai, Subhashis Banerjee, L. Pei, A. Murat, Sarahi Rosas-Gonz'alez, Illyess Zemmoura, C. Tauber, Minh H. Vu, T. Nyholm, T. Lofstedt, Laura Mora Ballestar, Verónica Vilaplana, Hugh McHugh, G. M. Talou, Alan Wang, J. Patel, Ken Chang, K. Hoebel, M. Gidwani, N. Arun, Sharut Gupta, M. Aggarwal, Praveer Singh, E. Gerstner, Jayashree Kalpathy-Cramer, Nicolas Boutry, Alexis Huard, L. Vidyaratne, Md Monibor Rahman, K. Iftekharuddin, J. Chazalon, É. Puybareau, G. Tochon, Jun Ma, M. Cabezas, X. Lladó, A. Oliver, Liliana Valencia, S. Valverde, Mehdi Amian, M. Soltaninejad, A. Myronenko, Ali Hatamizadeh, Xuejing Feng, Q. Dou, N. Tustison, Craig Meyer, Nisarg A. Shah, S. Ta
{"title":"MICCAI BraTS 2020脑肿瘤分割中量化不确定性的挑战-排名指标和基准结果分析","authors":"Raghav Mehta, Angelos Filos, Ujjwal Baid, C. Sako, Richard McKinley, M. Rebsamen, K. Datwyler, Raphael Meier, P. Radojewski, G. Murugesan, S. Nalawade, Chandan Ganesh, B. Wagner, F. Yu, B. Fei, A. Madhuranthakam, J. Maldjian, L. Daza, Catalina G'omez, P. Arbel'aez, Chengliang Dai, Shuo Wang, Hadrien Raynaud, Yuanhan Mo, E. Angelini, Yike Guo, Wenjia Bai, Subhashis Banerjee, L. Pei, A. Murat, Sarahi Rosas-Gonz'alez, Illyess Zemmoura, C. Tauber, Minh H. Vu, T. Nyholm, T. Lofstedt, Laura Mora Ballestar, Verónica Vilaplana, Hugh McHugh, G. M. Talou, Alan Wang, J. Patel, Ken Chang, K. Hoebel, M. Gidwani, N. Arun, Sharut Gupta, M. Aggarwal, Praveer Singh, E. Gerstner, Jayashree Kalpathy-Cramer, Nicolas Boutry, Alexis Huard, L. Vidyaratne, Md Monibor Rahman, K. Iftekharuddin, J. Chazalon, É. Puybareau, G. Tochon, Jun Ma, M. Cabezas, X. Lladó, A. Oliver, Liliana Valencia, S. Valverde, Mehdi Amian, M. Soltaninejad, A. Myronenko, Ali Hatamizadeh, Xuejing Feng, Q. Dou, N. Tustison, Craig Meyer, Nisarg A. Shah, S. Ta","doi":"10.59275/j.melba.2022-354b","DOIUrl":null,"url":null,"abstract":"Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying the reliability of DL model predictions in the form of uncertainties could enable clinical review of the most uncertain regions, thereby building trust and paving the way toward clinical translation. Several uncertainty estimation methods have recently been introduced for DL medical image segmentation tasks. Developing scores to evaluate and compare the performance of uncertainty measures will assist the end-user in making more informed decisions. In this study, we explore and evaluate a score developed during the BraTS 2019 and BraTS 2020 task on uncertainty quantification (QU-BraTS) and designed to assess and rank uncertainty estimates for brain tumor multi-compartment segmentation. This score (1) rewards uncertainty estimates that produce high confidence in correct assertions and those that assign low confidence levels at incorrect assertions, and (2) penalizes uncertainty measures that lead to a higher percentage of under-confident correct assertions. We further benchmark the segmentation uncertainties generated by 14 independent participating teams of QU-BraTS 2020, all of which also participated in the main BraTS segmentation task. Overall, our findings confirm the importance and complementary value that uncertainty estimates provide to segmentation algorithms, highlighting the need for uncertainty quantification in medical image analyses. Finally, in favor of transparency and reproducibility, our evaluation code is made publicly available at https://github.com/RagMeh11/QU-BraTS.","PeriodicalId":75083,"journal":{"name":"The journal of machine learning for biomedical imaging","volume":"86 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Metrics and Benchmarking Results\",\"authors\":\"Raghav Mehta, Angelos Filos, Ujjwal Baid, C. Sako, Richard McKinley, M. Rebsamen, K. Datwyler, Raphael Meier, P. Radojewski, G. Murugesan, S. Nalawade, Chandan Ganesh, B. Wagner, F. Yu, B. Fei, A. Madhuranthakam, J. Maldjian, L. Daza, Catalina G'omez, P. Arbel'aez, Chengliang Dai, Shuo Wang, Hadrien Raynaud, Yuanhan Mo, E. Angelini, Yike Guo, Wenjia Bai, Subhashis Banerjee, L. Pei, A. Murat, Sarahi Rosas-Gonz'alez, Illyess Zemmoura, C. Tauber, Minh H. Vu, T. Nyholm, T. Lofstedt, Laura Mora Ballestar, Verónica Vilaplana, Hugh McHugh, G. M. Talou, Alan Wang, J. Patel, Ken Chang, K. Hoebel, M. Gidwani, N. Arun, Sharut Gupta, M. Aggarwal, Praveer Singh, E. Gerstner, Jayashree Kalpathy-Cramer, Nicolas Boutry, Alexis Huard, L. Vidyaratne, Md Monibor Rahman, K. Iftekharuddin, J. Chazalon, É. Puybareau, G. Tochon, Jun Ma, M. Cabezas, X. Lladó, A. Oliver, Liliana Valencia, S. Valverde, Mehdi Amian, M. Soltaninejad, A. Myronenko, Ali Hatamizadeh, Xuejing Feng, Q. Dou, N. Tustison, Craig Meyer, Nisarg A. Shah, S. Ta\",\"doi\":\"10.59275/j.melba.2022-354b\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying the reliability of DL model predictions in the form of uncertainties could enable clinical review of the most uncertain regions, thereby building trust and paving the way toward clinical translation. Several uncertainty estimation methods have recently been introduced for DL medical image segmentation tasks. Developing scores to evaluate and compare the performance of uncertainty measures will assist the end-user in making more informed decisions. In this study, we explore and evaluate a score developed during the BraTS 2019 and BraTS 2020 task on uncertainty quantification (QU-BraTS) and designed to assess and rank uncertainty estimates for brain tumor multi-compartment segmentation. This score (1) rewards uncertainty estimates that produce high confidence in correct assertions and those that assign low confidence levels at incorrect assertions, and (2) penalizes uncertainty measures that lead to a higher percentage of under-confident correct assertions. We further benchmark the segmentation uncertainties generated by 14 independent participating teams of QU-BraTS 2020, all of which also participated in the main BraTS segmentation task. Overall, our findings confirm the importance and complementary value that uncertainty estimates provide to segmentation algorithms, highlighting the need for uncertainty quantification in medical image analyses. Finally, in favor of transparency and reproducibility, our evaluation code is made publicly available at https://github.com/RagMeh11/QU-BraTS.\",\"PeriodicalId\":75083,\"journal\":{\"name\":\"The journal of machine learning for biomedical imaging\",\"volume\":\"86 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The journal of machine learning for biomedical imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59275/j.melba.2022-354b\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of machine learning for biomedical imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59275/j.melba.2022-354b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Metrics and Benchmarking Results
Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying the reliability of DL model predictions in the form of uncertainties could enable clinical review of the most uncertain regions, thereby building trust and paving the way toward clinical translation. Several uncertainty estimation methods have recently been introduced for DL medical image segmentation tasks. Developing scores to evaluate and compare the performance of uncertainty measures will assist the end-user in making more informed decisions. In this study, we explore and evaluate a score developed during the BraTS 2019 and BraTS 2020 task on uncertainty quantification (QU-BraTS) and designed to assess and rank uncertainty estimates for brain tumor multi-compartment segmentation. This score (1) rewards uncertainty estimates that produce high confidence in correct assertions and those that assign low confidence levels at incorrect assertions, and (2) penalizes uncertainty measures that lead to a higher percentage of under-confident correct assertions. We further benchmark the segmentation uncertainties generated by 14 independent participating teams of QU-BraTS 2020, all of which also participated in the main BraTS segmentation task. Overall, our findings confirm the importance and complementary value that uncertainty estimates provide to segmentation algorithms, highlighting the need for uncertainty quantification in medical image analyses. Finally, in favor of transparency and reproducibility, our evaluation code is made publicly available at https://github.com/RagMeh11/QU-BraTS.