Zhao_MExD_An_Expert-Infused_Diffusion_Model_for_Whole-Slide_Image_Classification@CVPR2025@CVF

Total: 1

#1 MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification [PDF2] [Copy] [Kimi] [REL]

Authors: Jianwei Zhao, Xin Li, Fan Yang, Qiang Zhai, Ao Luo, Yang Zhao, Hong Cheng, Huazhu Fu

Whole Slide Image (WSI) classification poses unique challenges due to the vast image size and numerous non-informative regions, which introduce noise and cause data imbalance during feature aggregation. To address these issues, we propose MExD, an Expert-Infused Diffusion Model that combines the strengths of a Mixture-of-Experts (MoE) mechanism with a diffusion model for enhanced classification. MExD balances patch feature distribution through a novel MoE-based aggregator that selectively emphasizes relevant information, effectively filtering noise, addressing data imbalance, and extracting essential features. These features are then integrated via a diffusion-based generative process to directly yield the class distribution for the WSI. Moving beyond conventional discriminative approaches, MExD represents the first generative strategy in WSI classification, capturing fine-grained details for robust and precise results. Our MExD is validated on three widely-used benchmarks—Camelyon16, TCGA-NSCLC, and BRACS—consistently achieving state-of-the-art performance in both binary and multi-class tasks. The model and code will be made publicly available upon acceptance.

Subject: CVPR.2025 - Poster