Anda belum login :: 16 Apr 2025 21:19 WIB
Detail
ArtikelSource-Error Aware Phrase-Based Decoding for Robust Conversational Spoken Language Translation  
Oleh: Ananthakrishnan, Sankaranarayanan ; Wei Chen ; Kumar, Rohit ; Mehay, Dennis
Jenis: Article from Proceeding
Dalam koleksi: Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Germany: Dec. 5-6, 2013
Fulltext: Source-Error Aware Phrase.pdf (11.79MB)
Isi artikelSpoken language translation (SLT) systems typically follow a pipeline architecture, in which the best automatic speech recognition (ASR) hypothesis of an input utterance is fed into a statistical machine translation (SMT) system. Conversational speech often generates unrecoverable ASR errors owing to its rich vocabulary (e.g. out-of-vocabulary (OOV) named entities). In this paper, we study the possibility of alleviating the impact of unrecoverable ASR errors on translation performance by minimizing the contextual effects of incorrect source words in target hypotheses. Our approach is driven by locally-derived penalties applied to bilingual phrase pairs as well as target language model (LM) likelihoods in the vicinity of source errors. With oracle word error labels on an OOV word-rich English-to- Iraqi Arabic translation task, we show statistically significant relative improvements of 3.2%BLEUand 2.0% METEOR over an error-agnostic baseline SMT system. We then investigate the impact of imperfect source error labels on error-aware translation performance. Simulation experiments reveal that modest translation improvements are to be gained with this approach even when the source error labels are noisy.
Opini AndaKlik untuk menuliskan opini Anda tentang koleksi ini!

Kembali
design
 
Process time: 0.015625 second(s)