A. Tarkhan, Trung-Kien Nguyen, N. Simon, T. Bengtsson, Paolo Ocampo, Jian Dai
{"title":"Attention-Based Deep Multiple Instance Learning with Adaptive Instance Sampling","authors":"A. Tarkhan, Trung-Kien Nguyen, N. Simon, T. Bengtsson, Paolo Ocampo, Jian Dai","doi":"10.1109/ISBI52829.2022.9761661","DOIUrl":null,"url":null,"abstract":"One challenge of training deep neural networks with gigapixel whole-slide images (WSIs) in computational pathology is the lack of annotation at pixel level or regional level due to the high cost and time-consuming labeling effort. Multiple instance learning (MIL) and its attention-based versions are typical weakly supervised learning methods, which allow us to use slide-level labels directly, without the need for pixel or region labels, thus reducing the cost of annotation. However, training a deep neural network with thousands of image regions (patches) per slide is computationally expensive, and it needs a lot of time for convergence. This paper proposes a fast adaptive attention-based deep MIL approach. This approach adaptively selects image regions that are highly predictive of outcome and ignores image regions with little or no information. We empirically show that our proposed approach outperforms the random sampling approach while it is faster than the standard attention-based MIL method (which uses all image regions for training).","PeriodicalId":6827,"journal":{"name":"2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)","volume":"29 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISBI52829.2022.9761661","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
One challenge of training deep neural networks with gigapixel whole-slide images (WSIs) in computational pathology is the lack of annotation at pixel level or regional level due to the high cost and time-consuming labeling effort. Multiple instance learning (MIL) and its attention-based versions are typical weakly supervised learning methods, which allow us to use slide-level labels directly, without the need for pixel or region labels, thus reducing the cost of annotation. However, training a deep neural network with thousands of image regions (patches) per slide is computationally expensive, and it needs a lot of time for convergence. This paper proposes a fast adaptive attention-based deep MIL approach. This approach adaptively selects image regions that are highly predictive of outcome and ignores image regions with little or no information. We empirically show that our proposed approach outperforms the random sampling approach while it is faster than the standard attention-based MIL method (which uses all image regions for training).