[1910.00254] Multilingual End-to-End Speech Translation
https://arxiv.org/abs/1910.0025401.10.2019 · In this paper, we propose a simple yet effective framework for multilingual end-to-end speech translation (ST), in which speech utterances in source languages are directly translated to the desired target languages with a universal sequence-to-sequence architecture. While multilingual models have shown to be useful for automatic speech recognition (ASR) and …
Tutorial Proposal: End-to-End Speech Translation - ACL Anthology
aclanthology.org › 2021Speech translation is the translation of speech in one language typically to text in another, traditionally accomplished through a combination of automatic speech recognition and machine translation. Speech translation has attracted interest for many years, but the recent successful applications of deep learning to both individual tasks have enabled new opportunities through joint modeling, in what we today call ‘end-to-end speech translation.’.
[1802.04200] End-to-End Automatic Speech Translation of ...
https://arxiv.org/abs/1802.0420012.02.2018 · We investigate end-to-end speech-to-text translation on a corpus of audiobooks specifically augmented for this task. Previous works investigated the extreme case where source language transcription is not available during learning nor decoding, but we also study a midway case where source language transcription is available at training time only. In this case, a …