2506.16418

Total: 1

#1 Efficient Transformations in Deep Learning Convolutional Neural Networks [PDF2] [Copy] [Kimi] [REL]

Authors: Berk Yilmaz, Daniel Fidel Harvey, Prajit Dhuri

This study investigates the integration of signal processing transformations -- Fast Fourier Transform (FFT), Walsh-Hadamard Transform (WHT), and Discrete Cosine Transform (DCT) -- within the ResNet50 convolutional neural network (CNN) model for image classification. The primary objective is to assess the trade-offs between computational efficiency, energy consumption, and classification accuracy during training and inference. Using the CIFAR-100 dataset (100 classes, 60,000 images), experiments demonstrated that incorporating WHT significantly reduced energy consumption while improving accuracy. Specifically, a baseline ResNet50 model achieved a testing accuracy of 66%, consuming an average of 25,606 kJ per model. In contrast, a modified ResNet50 incorporating WHT in the early convolutional layers achieved 74% accuracy, and an enhanced version with WHT applied to both early and late layers achieved 79% accuracy, with an average energy consumption of only 39 kJ per model. These results demonstrate the potential of WHT as a highly efficient and effective approach for energy-constrained CNN applications.

Subjects: Computer Vision and Pattern Recognition , Artificial Intelligence , Image and Video Processing , Signal Processing

Publish: 2025-06-19 15:54:59 UTC