No Free Lunch for Synthetic Images under Data Scarcity Conditions

#1 No Free Lunch for Synthetic Images under Data Scarcity Conditions [PDF] [Copy] [Kimi¹] [REL]

Authors: Borja Arroyo Galende, Alejandro Almodóvar, Patricia A. Apellániz, Juan Parras, Silvia Uribe, Santiago Zazo

This study investigates the trade-offs between fidelity, privacy, and utility in synthetic data generation under conditions of data scarcity and privacy sensitivity. We propose an evaluation framework that jointly assesses these three dimensions and apply it to three widely used generative models, VAE, GAN, and DDPM. The evaluation spans three image datasets, MNIST, OCTMNIST, and OrganAMNIST, encompassing both general-purpose and medical imaging domains. Notable differences arise between the three models in their behaviour when differential privacy mechanisms are introduced during training. GAN and DDPM demonstrate greater robustness, maintaining higher fidelity and downstream utility across a range of noise levels, while VAE degrades more rapidly as privacy constraints increase. This study highlights the importance of a multidimensional evaluation of deep generative models, also noting that their behaviour significantly differs when privacy techniques are applied.

Subjects: Computer Vision and Pattern Recognition , Artificial Intelligence , Machine Learning

Publish: 2026-06-01 12:45:02 UTC

2606.07640

#1 No Free Lunch for Synthetic Images under Data Scarcity Conditions [PDF] [Copy] [Kimi1] [REL]

#1 No Free Lunch for Synthetic Images under Data Scarcity Conditions [PDF] [Copy] [Kimi¹] [REL]