Emile Saillard, Aurélie Levillain, David Mitton, Jean-Baptiste Pialat, Cyrille Confavreux, Hélène Follet, Thomas Grenier
{"title":"Enhanced segmentation of femoral bone metastasis in CT scans of patients using synthetic data generation with 3D diffusion models","authors":"Emile Saillard, Aurélie Levillain, David Mitton, Jean-Baptiste Pialat, Cyrille Confavreux, Hélène Follet, Thomas Grenier","doi":"arxiv-2409.11011","DOIUrl":null,"url":null,"abstract":"Purpose: Bone metastasis have a major impact on the quality of life of\npatients and they are diverse in terms of size and location, making their\nsegmentation complex. Manual segmentation is time-consuming, and expert\nsegmentations are subject to operator variability, which makes obtaining\naccurate and reproducible segmentations of bone metastasis on CT-scans a\nchallenging yet important task to achieve. Materials and Methods: Deep learning\nmethods tackle segmentation tasks efficiently but require large datasets along\nwith expert manual segmentations to generalize on new images. We propose an\nautomated data synthesis pipeline using 3D Denoising Diffusion Probabilistic\nModels (DDPM) to enchance the segmentation of femoral metastasis from CT-scan\nvolumes of patients. We used 29 existing lesions along with 26 healthy femurs\nto create new realistic synthetic metastatic images, and trained a DDPM to\nimprove the diversity and realism of the simulated volumes. We also\ninvestigated the operator variability on manual segmentation. Results: We\ncreated 5675 new volumes, then trained 3D U-Net segmentation models on real and\nsynthetic data to compare segmentation performance, and we evaluated the\nperformance of the models depending on the amount of synthetic data used in\ntraining. Conclusion: Our results showed that segmentation models trained with\nsynthetic data outperformed those trained on real volumes only, and that those\nmodels perform especially well when considering operator variability.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Bone metastasis have a major impact on the quality of life of
patients and they are diverse in terms of size and location, making their
segmentation complex. Manual segmentation is time-consuming, and expert
segmentations are subject to operator variability, which makes obtaining
accurate and reproducible segmentations of bone metastasis on CT-scans a
challenging yet important task to achieve. Materials and Methods: Deep learning
methods tackle segmentation tasks efficiently but require large datasets along
with expert manual segmentations to generalize on new images. We propose an
automated data synthesis pipeline using 3D Denoising Diffusion Probabilistic
Models (DDPM) to enchance the segmentation of femoral metastasis from CT-scan
volumes of patients. We used 29 existing lesions along with 26 healthy femurs
to create new realistic synthetic metastatic images, and trained a DDPM to
improve the diversity and realism of the simulated volumes. We also
investigated the operator variability on manual segmentation. Results: We
created 5675 new volumes, then trained 3D U-Net segmentation models on real and
synthetic data to compare segmentation performance, and we evaluated the
performance of the models depending on the amount of synthetic data used in
training. Conclusion: Our results showed that segmentation models trained with
synthetic data outperformed those trained on real volumes only, and that those
models perform especially well when considering operator variability.