Learning Structured Representation for Text Classification via Reinforcement Learning

#1 Learning Structured Representation for Text Classification via Reinforcement Learning [PDF] [Copy] [Kimi] [REL]

Authors: Tianyang Zhang, Minlie Huang, Li Zhao

Representation learning is a fundamental problem in natural language processing. This paper studies how to learn a structured representation for text classification. Unlike most existing representation models that either use no structure or rely on pre-specified structures, we propose a reinforcement learning (RL) method to learn sentence representation by discovering optimized structures automatically. We demonstrate two attempts to build structured representation: Information Distilled LSTM (ID-LSTM) and Hierarchically Structured LSTM (HS-LSTM). ID-LSTM selects only important, task-relevant words, and HS-LSTM discovers phrase structures in a sentence. Structure discovery in the two representation models is formulated as a sequential decision problem: current decision of structure discovery affects following decisions, which can be addressed by policy gradient RL. Results show that our method can learn task-friendly representations by identifying important words or task-relevant structures without explicit structure annotations, and thus yields competitive performance.

Subject: AAAI.2018 - NLP and Text Mining

12047@AAAI

#1 Learning Structured Representation for Text Classification via Reinforcement Learning [PDF] [Copy] [Kimi] [REL]