Comparative Study of Transformer and RNN in Speech Applications
Original link: https://arxiv.org/pdf/1909.06317.pdf Abstract Sequence-to-sequence models are widely used in end-to-end speech processing, such as Automatic Speech Recognition (ASR), Speech Translation (ST), and Text-to-Speech (TTS). This paper focuses on a novel sequence-to-sequence model called the Transformer, which has achieved state-of-the-art performance in neural machine translation and other natural language processing applications. We conducted an in-depth … Read more