MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography

#1 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography [PDF²] [Copy] [Kimi] [REL]

Authors: Centre for Artificial Intelligence Daniel Barco, Centre for Artificial Intelligence Marc Stadelmann, Centre for Artificial Intelligence Martin Oswald, Institute of Applied Mathematics and Physics Ivo Herzig, Institute of Applied Mathematics and Physics Lukas Lichtensteiger, Varian Medical Systems Imaging Lab, Baden, Switzerland Pascal Paysan, Varian Medical Systems Imaging Lab, Baden, Switzerland Igor Peterlik, Varian Medical Systems Imaging Lab, Baden, Switzerland Michal Walczak, Biomedical Image Analysis and Machine Learning, University of Zurich, Zurich, Switzerland Bjoern Menze, Centre for Artificial Intelligence Frank-Peter Schilling

We present MInDI-3D (Medical Inversion by Direct Iteration in 3D), the first 3D conditional diffusion-based model for real-world sparse-view Cone Beam Computed Tomography (CBCT) artefact removal, aiming to reduce imaging radiation exposure. A key contribution is extending the "InDI" concept from 2D to a full 3D volumetric approach for medical images, implementing an iterative denoising process that refines the CBCT volume directly from sparse-view input. A further contribution is the generation of a large pseudo-CBCT dataset (16,182) from chest CT volumes of the CT-RATE public dataset to robustly train MInDI-3D. We performed a comprehensive evaluation, including quantitative metrics, scalability analysis, generalisation tests, and a clinical assessment by 11 clinicians. Our results show MInDI-3D's effectiveness, achieving a 12.96 (6.10) dB PSNR gain over uncorrected scans with only 50 projections on the CT-RATE pseudo-CBCT (independent real-world) test set and enabling an 8x reduction in imaging radiation exposure. We demonstrate its scalability by showing that performance improves with more training data. Importantly, MInDI-3D matches the performance of a 3D U-Net on real-world scans from 16 cancer patients across distortion and task-based metrics. It also generalises to new CBCT scanner geometries. Clinicians rated our model as sufficient for patient positioning across all anatomical sites and found it preserved lung tumour boundaries well.

Subjects: Computer Vision and Pattern Recognition , Artificial Intelligence

Publish: 2025-08-13 08:49:18 UTC

2508.09616

#1 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography [PDF2] [Copy] [Kimi] [REL]

#1 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography [PDF²] [Copy] [Kimi] [REL]