2410.22197

Total: 1

#1 Class-Aware Contrastive Optimization for Imbalanced Text Classification [PDF5] [Copy] [Kimi7] [REL]

Authors: Grigorii Khvatskii, Nuno Moniz, Khoa Doan, Nitesh V Chawla

The unique characteristics of text data make classification tasks a complex problem. Advances in unsupervised and semi-supervised learning and autoencoder architectures addressed several challenges. However, they still struggle with imbalanced text classification tasks, a common scenario in real-world applications, demonstrating a tendency to produce embeddings with unfavorable properties, such as class overlap. In this paper, we show that leveraging class-aware contrastive optimization combined with denoising autoencoders can successfully tackle imbalanced text classification tasks, achieving better performance than the current state-of-the-art. Concretely, our proposal combines reconstruction loss with contrastive class separation in the embedding space, allowing a better balance between the truthfulness of the generated embeddings and the model's ability to separate different classes. Compared with an extensive set of traditional and state-of-the-art competing methods, our proposal demonstrates a notable increase in performance across a wide variety of text datasets.

Subject: Computation and Language

Publish: 2024-10-29 16:34:08 UTC