Anda belum login :: 16 Apr 2025 21:26 WIB
Home
|
Logon
Hidden
»
Administration
»
Collection Detail
Detail
CRF-based Disfluency Detection using Semantic Features for German to English Spoken Language Translation
Oleh:
Cho, Eunah
;
Ha, Thanh-Le
;
Waibel, Alex
Jenis:
Article from Proceeding
Dalam koleksi:
Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, Germany: Dec. 5-6, 2013
Fulltext:
CRF-based Disfluency Detection.pdf
(11.79MB)
Isi artikel
Disfluencies in speech pose severe difficulties in machine translation of spontaneous speech. This paper presents our conditional random field (CRF)-based speech disfluency detection system developed on German to improve spoken language translation performance. In order to detect speech disfluencies considering syntactics and semantics of speech utterances, we carried out a CRF-based approach using information learned from the word representation and the phrase table used for machine translation. The word representation is gained using recurrent neural networks and projected words are clustered using the k-means algorithm. Using the output from the model trained with the word representations and phrase table information, we achieve an improvement of 1.96 BLEU points on the lecture test set. By keeping or removing humanannotated disfluencies, we show an upper bound and lower bound of translation quality. In an oracle experiment we gain 3.16 BLEU points of improvement on the lecture test set, compared to the same set with all disfluencies.
Opini Anda
Klik untuk menuliskan opini Anda tentang koleksi ini!
Kembali
Process time: 0.015625 second(s)