Yue Zou, Zhenhao Li, Menghan Zhang, Ziwei Li, Xiaojie Yin, Long Yang, Weigang Hu, Jiazhou Wang
{"title":"用矢量量化变分自编码器生成模型预测放射治疗中的解剖变异","authors":"Yue Zou, Zhenhao Li, Menghan Zhang, Ziwei Li, Xiaojie Yin, Long Yang, Weigang Hu, Jiazhou Wang","doi":"10.1002/mp.18120","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Anatomical variations during radiotherapy fractions can lead to deviations in radiation delivery. Predicting these changes may benefit adaptive radiotherapy (ART) in nasopharyngeal cancer.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>This study proposes a vector quantized variational autoencoder (VQ-VAE) based generative model to predict anatomical changes in nasopharyngeal cancer patients.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>The model integrates a VQ-VAE with Adaptive Instance Normalization (AdaIN). VQ-VAE encodes anatomical structures from planning CT images, while a convolutional neural network (CNN) extracts latent codes that capture potential anatomical variations. AdaIN then modulates the VQ-VAE's latent space to generate daily CT images that reflect these anatomical changes. The model was trained and validated on 522 CT images from 90 nasopharyngeal cancer patients and tested using 102 CT images from 18 patients. The quality of the generated images was evaluated through visual inspection, while the model's accuracy was assessed by comparing the predicted and actual volumes of the parotid and submandibular glands at both individual and population levels.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>For individual patients, Mann–Whitney and Kruskal–Wallis tests found no significant differences in organ-at-risk (OAR) volume distributions between generated and actual daily CT images. At the population level, predicted mean ROI volumes (parotid glands: 26.9 ± 2.1 cm<sup>3</sup>; submandibular glands: 7.0 ± 0.71 cm<sup>3</sup>) closely matched ground truth values (parotid: 29.5 ± 3.2 cm<sup>3</sup>; submandibular: 7.2 ± 0.67 cm<sup>3</sup>) and outperformed the previous Daily Anatomy Model (DAM) model (parotid: 20.4 ± 1.9 cm<sup>3</sup>; submandibular: 6.2 ± 0.6 cm<sup>3</sup>). The Pearson correlation coefficients between actual and generated daily CT ROI volumes were 0.92, 0.87, 0.89, and 0.93 for right parotid, left parotid, right submandibular, and left submandibular, respectively.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>The VQ-VAE model effectively predicts anatomical changes during radiotherapy based on planning CT, demonstrating its potential to inform adaptive decision-making in radiotherapy.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 9","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting anatomical variations in radiotherapy with a vector quantized variational autoencoder generative model\",\"authors\":\"Yue Zou, Zhenhao Li, Menghan Zhang, Ziwei Li, Xiaojie Yin, Long Yang, Weigang Hu, Jiazhou Wang\",\"doi\":\"10.1002/mp.18120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>Anatomical variations during radiotherapy fractions can lead to deviations in radiation delivery. Predicting these changes may benefit adaptive radiotherapy (ART) in nasopharyngeal cancer.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>This study proposes a vector quantized variational autoencoder (VQ-VAE) based generative model to predict anatomical changes in nasopharyngeal cancer patients.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>The model integrates a VQ-VAE with Adaptive Instance Normalization (AdaIN). VQ-VAE encodes anatomical structures from planning CT images, while a convolutional neural network (CNN) extracts latent codes that capture potential anatomical variations. AdaIN then modulates the VQ-VAE's latent space to generate daily CT images that reflect these anatomical changes. The model was trained and validated on 522 CT images from 90 nasopharyngeal cancer patients and tested using 102 CT images from 18 patients. The quality of the generated images was evaluated through visual inspection, while the model's accuracy was assessed by comparing the predicted and actual volumes of the parotid and submandibular glands at both individual and population levels.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>For individual patients, Mann–Whitney and Kruskal–Wallis tests found no significant differences in organ-at-risk (OAR) volume distributions between generated and actual daily CT images. At the population level, predicted mean ROI volumes (parotid glands: 26.9 ± 2.1 cm<sup>3</sup>; submandibular glands: 7.0 ± 0.71 cm<sup>3</sup>) closely matched ground truth values (parotid: 29.5 ± 3.2 cm<sup>3</sup>; submandibular: 7.2 ± 0.67 cm<sup>3</sup>) and outperformed the previous Daily Anatomy Model (DAM) model (parotid: 20.4 ± 1.9 cm<sup>3</sup>; submandibular: 6.2 ± 0.6 cm<sup>3</sup>). The Pearson correlation coefficients between actual and generated daily CT ROI volumes were 0.92, 0.87, 0.89, and 0.93 for right parotid, left parotid, right submandibular, and left submandibular, respectively.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>The VQ-VAE model effectively predicts anatomical changes during radiotherapy based on planning CT, demonstrating its potential to inform adaptive decision-making in radiotherapy.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 9\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://aapm.onlinelibrary.wiley.com/doi/10.1002/mp.18120\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://aapm.onlinelibrary.wiley.com/doi/10.1002/mp.18120","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Predicting anatomical variations in radiotherapy with a vector quantized variational autoencoder generative model
Background
Anatomical variations during radiotherapy fractions can lead to deviations in radiation delivery. Predicting these changes may benefit adaptive radiotherapy (ART) in nasopharyngeal cancer.
Purpose
This study proposes a vector quantized variational autoencoder (VQ-VAE) based generative model to predict anatomical changes in nasopharyngeal cancer patients.
Methods
The model integrates a VQ-VAE with Adaptive Instance Normalization (AdaIN). VQ-VAE encodes anatomical structures from planning CT images, while a convolutional neural network (CNN) extracts latent codes that capture potential anatomical variations. AdaIN then modulates the VQ-VAE's latent space to generate daily CT images that reflect these anatomical changes. The model was trained and validated on 522 CT images from 90 nasopharyngeal cancer patients and tested using 102 CT images from 18 patients. The quality of the generated images was evaluated through visual inspection, while the model's accuracy was assessed by comparing the predicted and actual volumes of the parotid and submandibular glands at both individual and population levels.
Results
For individual patients, Mann–Whitney and Kruskal–Wallis tests found no significant differences in organ-at-risk (OAR) volume distributions between generated and actual daily CT images. At the population level, predicted mean ROI volumes (parotid glands: 26.9 ± 2.1 cm3; submandibular glands: 7.0 ± 0.71 cm3) closely matched ground truth values (parotid: 29.5 ± 3.2 cm3; submandibular: 7.2 ± 0.67 cm3) and outperformed the previous Daily Anatomy Model (DAM) model (parotid: 20.4 ± 1.9 cm3; submandibular: 6.2 ± 0.6 cm3). The Pearson correlation coefficients between actual and generated daily CT ROI volumes were 0.92, 0.87, 0.89, and 0.93 for right parotid, left parotid, right submandibular, and left submandibular, respectively.
Conclusions
The VQ-VAE model effectively predicts anatomical changes during radiotherapy based on planning CT, demonstrating its potential to inform adaptive decision-making in radiotherapy.
期刊介绍:
Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments
Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.