Generative AI and foundation models in medical image.

IF 1.5 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Radiological Physics and Technology Pub Date : 2025-10-06 DOI:10.1007/s12194-025-00968-1

Masahiro Oda

{"title":"Generative AI and foundation models in medical image.","authors":"Masahiro Oda","doi":"10.1007/s12194-025-00968-1","DOIUrl":null,"url":null,"abstract":"<p><p>In recent years, generative AI has attracted significant public attention, and its use has been rapidly expanding across a wide range of domains. From creative tasks such as text summarization, idea generation, and source code generation, to the streamlining of medical support tasks like diagnostic report generation and summarization, AI is now deeply involved in many areas. Today's breadth of AI applications is clearly distinct from what was seen before generative AI gained widespread recognition. Representative generative AI services include DALL·E 3 (OpenAI, California, USA) and Stable Diffusion (Stability AI, London, England, UK) for image generation, ChatGPT (OpenAI, California, USA), and Gemini (Google, California, USA) for text generation. The rise of generative AI has been influenced by advances in deep learning models and the scaling up of data, models, and computational resources based on the Scaling Laws. Moreover, the emergence of foundation models, which are trained on large-scale datasets and possess general-purpose knowledge applicable to various downstream tasks, is creating a new paradigm in AI development. These shifts brought about by generative AI and foundation models also profoundly impact medical image processing, fundamentally changing the framework for AI development in healthcare. This paper provides an overview of diffusion models used in image generation AI and large language models (LLMs) used in text generation AI, and introduces their applications in medical support. This paper also discusses foundation models, which are gaining attention alongside generative AI, including their construction methods and applications in the medical field. Finally, the paper explores how to develop foundation models and high-performance AI for medical support by fully utilizing national data and computational resources.</p>","PeriodicalId":46252,"journal":{"name":"Radiological Physics and Technology","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiological Physics and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s12194-025-00968-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, generative AI has attracted significant public attention, and its use has been rapidly expanding across a wide range of domains. From creative tasks such as text summarization, idea generation, and source code generation, to the streamlining of medical support tasks like diagnostic report generation and summarization, AI is now deeply involved in many areas. Today's breadth of AI applications is clearly distinct from what was seen before generative AI gained widespread recognition. Representative generative AI services include DALL·E 3 (OpenAI, California, USA) and Stable Diffusion (Stability AI, London, England, UK) for image generation, ChatGPT (OpenAI, California, USA), and Gemini (Google, California, USA) for text generation. The rise of generative AI has been influenced by advances in deep learning models and the scaling up of data, models, and computational resources based on the Scaling Laws. Moreover, the emergence of foundation models, which are trained on large-scale datasets and possess general-purpose knowledge applicable to various downstream tasks, is creating a new paradigm in AI development. These shifts brought about by generative AI and foundation models also profoundly impact medical image processing, fundamentally changing the framework for AI development in healthcare. This paper provides an overview of diffusion models used in image generation AI and large language models (LLMs) used in text generation AI, and introduces their applications in medical support. This paper also discusses foundation models, which are gaining attention alongside generative AI, including their construction methods and applications in the medical field. Finally, the paper explores how to develop foundation models and high-performance AI for medical support by fully utilizing national data and computational resources.

查看原文本刊更多论文

生成式AI与医学图像中的基础模型。

近年来，生成式人工智能引起了公众的广泛关注，它的应用已经迅速扩展到广泛的领域。从文本摘要、创意生成和源代码生成等创造性任务，到诊断报告生成和摘要等医疗支持任务的简化，人工智能现在深入到许多领域。今天人工智能应用的广度与生成式人工智能获得广泛认可之前的情况明显不同。具有代表性的生成式AI服务包括用于图像生成的DALL·e3 （OpenAI, California， USA）和Stable Diffusion (Stability AI, London, England， UK)，用于文本生成的ChatGPT （OpenAI, California， USA）和Gemini（谷歌，California， USA）。生成式人工智能的兴起受到深度学习模型的进步以及基于缩放定律的数据、模型和计算资源的扩展的影响。此外，基础模型的出现正在创造人工智能开发的新范式，这些模型是在大规模数据集上训练的，具有适用于各种下游任务的通用知识。生成式人工智能和基础模型带来的这些变化也深刻影响着医学图像处理，从根本上改变了人工智能在医疗保健领域的发展框架。本文概述了图像生成AI中使用的扩散模型和文本生成AI中使用的大语言模型（llm），并介绍了它们在医疗支持中的应用。本文还讨论了与生成式人工智能一起受到关注的基础模型，包括其构建方法和在医学领域的应用。最后，探讨如何充分利用国家数据和计算资源，开发医疗保障基础模型和高性能人工智能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Radiological Physics and Technology RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

3.00

自引率

12.50%

发文量

期刊介绍： The purpose of the journal Radiological Physics and Technology is to provide a forum for sharing new knowledge related to research and development in radiological science and technology, including medical physics and radiological technology in diagnostic radiology, nuclear medicine, and radiation therapy among many other radiological disciplines, as well as to contribute to progress and improvement in medical practice and patient health care.