Speech Recognition Method Based on Multi-Task Loss with Additional Language Model

Speech Recognition Method Based on Multi-Task Loss with Additional Language Model

Click the blue text to follow us DOI:10.3969/j.issn.1671-7775.2023.05.010 Open Science (Resource Service) Identifier Code (OSID): Citation Format: Liu Yongli, Zhang Shaoyang, Wang Yuheng, et al. Speech Recognition Method Based on Multi-Task Loss with Additional Language Model[J]. Journal of Jiangsu University (Natural Science Edition), 2023, 44(5):564-569. Fund Project: Shaanxi Provincial Key Industry Innovation Chain (Group) Project … Read more

Conformer: A Hybrid CNN-Transformer Model for Improved Feature Representation

Conformer: A Hybrid CNN-Transformer Model for Improved Feature Representation

Follow our public account to discover the beauty of CV technology 0 Introduction In Convolutional Neural Networks (CNN), convolution operations excel at extracting local features, but there are certain limitations in capturing global feature representations. In Vision Transformers, cascading self-attention modules can capture long-range feature dependencies but tend to overlook the details of local features. … Read more