WingBeats and Snapshots: Fusing Sound and Vision for Mosquito Monitoring (Student Abstract)

42196@AAAI

Total: 1

#1 WingBeats and Snapshots: Fusing Sound and Vision for Mosquito Monitoring (Student Abstract) [PDF] [Copy] [Kimi] [REL]

Authors: Ahana Chanda, Akshay Agarwal

Accurate identification of mosquito species is crucial for controlling vector-borne diseases, yet visual or acoustic methods alone are often insufficient. We propose a multimodal deep-learning framework that combines high-resolution images with wingbeat audio using a SwinV2 vision transformer and an Audio Spectrogram Transformer, thereby capturing complementary cues. On a six-species dataset, it achieves 97% accuracy, comparable to the best single-modality baseline, and is designed to improve robustness under noise or environmental variation, demonstrating the value of integrating multiple data sources for reliable mosquito surveillance.

Subject: AAAI.2026 - Student Abstract and Poster Program