2022.iwslt-1.20@ACL

Total: 1

#1 The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation [PDF] [Copy] [Kimi1]

Authors: Yinglu Li ; Minghan Wang ; Jiaxin Guo ; Xiaosong Qiao ; Yuxia Wang ; Daimeng Wei ; Chang Su ; Yimeng Chen ; Min Zhang ; Shimin Tao ; Hao Yang ; Ying Qin

This paper describes the HW-TSC’s designation of the Offline Speech Translation System submitted for IWSLT 2022 Evaluation. We explored both cascade and end-to-end system on three language tracks (en-de, en-zh and en-ja), and we chose the cascade one as our primary submission. For the automatic speech recognition (ASR) model of cascade system, there are three ASR models including Conformer, S2T-Transformer and U2 trained on the mixture of five datasets. During inference, transcripts are generated with the help of domain controlled generation strategy. Context-aware reranking and ensemble based anti-interference strategy are proposed to produce better ASR outputs. For machine translation part, we pretrained three translation models on WMT21 dataset and fine-tuned them on in-domain corpora. Our cascade system shows competitive performance than the known offline systems in the industry and academia.