可转移性指标的性能不能转化为医疗任务

DART@MICCAI Pub Date : 2023-08-14 DOI:10.48550/arXiv.2308.07444

Levy G. Chaves, Alceu Bissoto, Eduardo Valle, S. Avila

{"title":"可转移性指标的性能不能转化为医疗任务","authors":"Levy G. Chaves, Alceu Bissoto, Eduardo Valle, S. Avila","doi":"10.48550/arXiv.2308.07444","DOIUrl":null,"url":null,"abstract":"Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to efficiently calculate a score that correlates with the architecture accuracy on any target dataset. However, since transferability scores have not been evaluated on medical datasets, their use in this context remains uncertain, preventing them from benefiting practitioners. We fill that gap in this work, thoroughly evaluating seven transferability scores in three medical applications, including out-of-distribution scenarios. Despite promising results in general-purpose datasets, our results show that no transferability score can reliably and consistently estimate target performance in medical contexts, inviting further work in that direction.","PeriodicalId":106540,"journal":{"name":"DART@MICCAI","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Performance of Transferability Metrics does not Translate to Medical Tasks\",\"authors\":\"Levy G. Chaves, Alceu Bissoto, Eduardo Valle, S. Avila\",\"doi\":\"10.48550/arXiv.2308.07444\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to efficiently calculate a score that correlates with the architecture accuracy on any target dataset. However, since transferability scores have not been evaluated on medical datasets, their use in this context remains uncertain, preventing them from benefiting practitioners. We fill that gap in this work, thoroughly evaluating seven transferability scores in three medical applications, including out-of-distribution scenarios. Despite promising results in general-purpose datasets, our results show that no transferability score can reliably and consistently estimate target performance in medical contexts, inviting further work in that direction.\",\"PeriodicalId\":106540,\"journal\":{\"name\":\"DART@MICCAI\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DART@MICCAI\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2308.07444\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DART@MICCAI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2308.07444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

迁移学习通过从大数据集获得的知识在小数据集上进行深度学习(DL)，从而提高医学图像分析的性能。随着深度学习体系结构数量的爆炸式增长，用尽全力尝试所有候选方案变得不可行，从而促使人们选择更便宜的替代方案。可转移性评分方法作为一种诱人的解决方案出现，允许有效地计算与任何目标数据集上的架构准确性相关的分数。然而，由于可转移性评分尚未在医疗数据集上进行评估，因此它们在此背景下的使用仍然不确定，从而使它们无法使从业者受益。我们在这项工作中填补了这一空白，全面评估了三种医疗应用中的七个可转移性分数，包括非分布场景。尽管在通用数据集中取得了令人鼓舞的结果，但我们的研究结果表明，没有可转移性评分可以可靠和一致地估计医疗环境中的目标性能，这需要在该方向上进一步开展工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Performance of Transferability Metrics does not Translate to Medical Tasks

Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to efficiently calculate a score that correlates with the architecture accuracy on any target dataset. However, since transferability scores have not been evaluated on medical datasets, their use in this context remains uncertain, preventing them from benefiting practitioners. We fill that gap in this work, thoroughly evaluating seven transferability scores in three medical applications, including out-of-distribution scenarios. Despite promising results in general-purpose datasets, our results show that no transferability score can reliably and consistently estimate target performance in medical contexts, inviting further work in that direction.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

DART@MICCAI

自引率

0.00%

发文量