Mohammad Alsharid, Harshita Sharma, Lior Drukker, Aris T Papageorgiou, J Alison Noble
{"title":"Weakly Supervised Captioning of Ultrasound Images.","authors":"Mohammad Alsharid, Harshita Sharma, Lior Drukker, Aris T Papageorgiou, J Alison Noble","doi":"10.1007/978-3-031-12053-4_14","DOIUrl":"https://doi.org/10.1007/978-3-031-12053-4_14","url":null,"abstract":"<p><p>Medical image captioning models generate text to describe the semantic contents of an image, aiding the non-experts in understanding and interpretation. We propose a weakly-supervised approach to improve the performance of image captioning models on small image-text datasets by leveraging a large anatomically-labelled image classification dataset. Our method generates pseudo-captions (weak labels) for caption-less but anatomically-labelled (class-labelled) images using an encoder-decoder sequence-to-sequence model. The augmented dataset is used to train an image-captioning model in a weakly supervised learning manner. For fetal ultrasound, we demonstrate that the proposed augmentation approach outperforms the baseline on semantics and syntax-based metrics, with nearly twice as much improvement in value on <i>BLEU-1</i> and <i>ROUGE-L</i>. Moreover, we observe that superior models are trained with the proposed data augmentation, when compared with the existing regularization techniques. This work allows seamless automatic annotation of images that lack human-prepared descriptive captions for training image-captioning models. Using pseudo-captions in the training data is particularly useful for medical image captioning when significant time and effort of medical experts is required to obtain real image captions.</p>","PeriodicalId":74147,"journal":{"name":"Medical image understanding and analysis : 26th annual conference, MIUA 2022, Cambridge, UK, July 27-29, 2022, proceedings. Medical Image Understanding and Analysis (Conference) (26th : 2022 : Cambridge, England)","volume":"13413 ","pages":"187-198"},"PeriodicalIF":0.0,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7614238/pdf/EMS159395.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9736100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"STAMP: A Self-training Student-Teacher Augmentation-Driven Meta Pseudo-Labeling Framework for 3D Cardiac MRI Image Segmentation.","authors":"S M Kamrul Hasan, Cristian Linte","doi":"10.1007/978-3-031-12053-4_28","DOIUrl":"10.1007/978-3-031-12053-4_28","url":null,"abstract":"<p><p>Medical image segmentation has significantly benefitted thanks to deep learning architectures. Furthermore, semi-supervised learning (SSL) has led to a significant improvement in overall model performance by leveraging abundant unlabeled data. Nevertheless, one shortcoming of pseudo-labeled based semi-supervised learning is pseudo-labeling bias, whose mitigation is the focus of this work. Here we propose a simple, yet effective SSL framework for image segmentation-<i>STAMP</i> (<i>Student-Teacher A</i>ugmentation-driven consistency regularization via <i>M</i>eta <i>P</i>seudo-Labeling). The proposed method uses self-training (through meta pseudo-labeling) in concert with a Teacher network that instructs the Student network by generating pseudo-labels given unlabeled input data. Unlike pseudo-labeling methods, for which the Teacher network remains unchanged, meta pseudo-labeling methods allow the Teacher network to constantly adapt in response to the performance of the Student network on the labeled dataset, hence enabling the Teacher to identify more effective pseudo-labels to instruct the Student. Moreover, to improve generalization and reduce error rate, we apply both strong and weak <i>data augmentation</i> policies, to ensure the segmentor outputs a consistent probability distribution regardless of the augmentation level. Our extensive experimentation with varied quantities of labeled data in the training sets demonstrates the effectiveness of our model in segmenting the left atrial cavity from Gadolinium-enhanced magnetic resonance (GE-MR) images. By exploiting unlabeled data with weak and strong augmentation effectively, our proposed model yielded a statistically significant 2.6% improvement <math><mo>(</mo> <mi>p</mi> <mo><</mo> <mn>0.001</mn> <mo>)</mo></math> in Dice and a 4.4% improvement <math><mo>(</mo> <mi>p</mi> <mo><</mo> <mn>0.001</mn> <mo>)</mo></math> in Jaccard over other state-of-the-art SSL methods using only 10% labeled data for training.</p>","PeriodicalId":74147,"journal":{"name":"Medical image understanding and analysis : 26th annual conference, MIUA 2022, Cambridge, UK, July 27-29, 2022, proceedings. Medical Image Understanding and Analysis (Conference) (26th : 2022 : Cambridge, England)","volume":"13413 ","pages":"371-386"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134897/pdf/nihms-1892744.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9455789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}