2502.09520

Total: 1

#1 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization [PDF2] [Copy] [Kimi2] [REL]

Authors: Francesco Pezone, Sergio Barbarossa, Giuseppe Caire

This work introduces Semantically Masked VQ-GAN (SQ-GAN), a novel approach integrating generative models to optimize image compression for semantic/task-oriented communications. SQ-GAN employs off-the-shelf semantic semantic segmentation and a new specifically developed semantic-conditioned adaptive mask module (SAMM) to selectively encode semantically significant features of the images. SQ-GAN outperforms state-of-the-art image compression schemes such as JPEG2000 and BPG across multiple metrics, including perceptual quality and semantic segmentation accuracy on the post-decoding reconstructed image, at extreme low compression rates expressed in bits per pixel.

Subjects: Computer Vision and Pattern Recognition , Image and Video Processing

Publish: 2025-02-13 17:35:57 UTC