Key Components of Artificial Intelligence Technology

Key Components of Artificial Intelligence Technology

Artificial intelligence, as the hottest technology in the current field of science and technology, has attracted the attention of many people both inside and outside the industry. However, the information we focus on daily is mostly about the investment and financing trends in the field of artificial intelligence, the dynamics of AI unicorn companies, the … Read more

Will Speech Recognition Accuracy Ever Reach 100%?

Will Speech Recognition Accuracy Ever Reach 100%?

Illustration by Jay Bendt Written by Wade Roush Translated by Zhao Jianlin Looking back to 2010, Matt Thompson predicted in a commentary for NPR that “in the near future, automatic speech transcription technology will become fast, easy to use, and free.” He referred to that moment as the “speech singularity,” cleverly borrowing from inventor Ray … Read more

Mozilla Open Source Speech Recognition Model and Dataset

Mozilla Open Source Speech Recognition Model and Dataset

Selected from Mozilla Translated by Machine Heart Contributor: Liu Xiaokun Mozilla has great expectations for the potential of speech recognition, but there are still significant barriers to innovation in this field. These challenges prompted the company to launch the DeepSpeech and Common Voice projects. Recently, they released their open-source speech recognition model for the first … Read more

From Speech Recognition to Image Recognition: How AI ‘Sees’ and ‘Hears’

From Speech Recognition to Image Recognition: How AI 'Sees' and 'Hears'

Introduction With the continuous advancement of artificial intelligence technology, the AI’s abilities to “hear” and “see” are becoming increasingly powerful. From speech recognition to image recognition, AI can not only interact with us through sound but also understand and analyze the surrounding world through vision. These technologies have not only changed the way we interact … Read more

Baidu Proposes New Framework for Speech Recognition Using GAN

Baidu Proposes New Framework for Speech Recognition Using GAN

Selected from arXiv Authors: Anuroop Sriram et al. Translated by Machine Heart Contributors: Li Yazhou, Li Zenan Baidu recently published a paper proposing the use of Generative Adversarial Networks (GAN) to achieve a robust speech recognition system. The authors state that the new framework does not rely on the domain-specific knowledge or simplified assumptions often … Read more

Enhancing Online Speech Recognition Efficiency with Upgraded Algorithms

Enhancing Online Speech Recognition Efficiency with Upgraded Algorithms

Recently, Alibaba algorithm expert Kun Cheng participated in the ICASSP 2017 conference with the paper titled Improving Latency-Controlled BLSTM Acoustic Models for Online Speech Recognition. Author Kun Cheng communicating with attendees The research of this paper is based on the premise that to achieve better speech recognition accuracy, the Latency-controlled BLSTM model was used in … Read more

An In-Depth Analysis of Baidu’s Speech Recognition and Wake-Up Technology

An In-Depth Analysis of Baidu's Speech Recognition and Wake-Up Technology

With the popularization of artificial intelligence, speech has become an important interaction method, especially since Baidu’s speech recognition and wake-up technology was launched, it has attracted widespread attention from developers. On August 6, at the 65th “Analysis and Practice of Baidu Speech Recognition and Wake-Up Technology” salon jointly held by Baidu Developer Center and InfoQ, … Read more

Overview of Unresolved Issues in Speech Recognition

Overview of Unresolved Issues in Speech Recognition

Excerpt from Awni Translation by Machine Heart Contributors:Nurhachu Null,Lu Xue Since the application of deep learning in the field of speech recognition, the word error rate has significantly decreased. However, speech recognition has not yet reached human-level performance and still faces multiple unresolved issues. This article discusses various aspects of the unresolved problems in speech … Read more