What Is Multimodal Interaction?

What Is Multimodal Interaction?

In recent years, the Internet of Things (IoT), which has gained significant popularity, can be considered a prototype of ubiquitous computing. Multiple small and inexpensive Internet devices are widely distributed in various places in daily life, serving users through interconnected methods. Computer devices will no longer rely solely on command lines and graphical interfaces for … Read more

Easily Convert Text and Voice

Easily Convert Text and Voice

Text and voice each have their strengths; one conserves hearing, while the other conserves sight. You cannot say which is better or worse. Therefore, sometimes we often take what we need from each. However, if you need text and only have voice or a video containing voice information, then conversion is necessary. Conversely, the same … Read more

Voice Recognition System Application in Peking University People’s Hospital Radiology Department

Voice Recognition System Application in Peking University People's Hospital Radiology Department

Click the blue words above to follow us, and click “Write a comment” at the end of the article to express your views. e-Medical Pang Tao   The application of voice recognition technology at Peking University People’s Hospital (hereinafter referred to as “PKUPH”) began with the Radiology Department. As the first doctor to engage with and … Read more

Ultra-High Sensitivity Fiber Optic Microphone with Corrugated Graphene Diaphragm for Voice Recognition

Ultra-High Sensitivity Fiber Optic Microphone with Corrugated Graphene Diaphragm for Voice Recognition

1Introduction To avoid the interference of unexpected background noise and obtain high-fidelity voice signals, voice recognition urgently requires acoustic sensors with high sensitivity, flat frequency response, and high signal-to-noise ratio (SNR). Graphene oxide (GO) has gained widespread attention due to its controllable thickness and high tensile strength. However, low mechanical sensitivity (SM) caused by undesirable … Read more

Baidu AI Series: Open Capabilities

One original article every week, focusing on 5G, IoT, and artificial intelligence. Follow my 【Top Viewpoint】 to consistently utilize fragmented time for learning. In the previous articles, we detailed Huawei’s AI capabilities and layout. Starting today, we will further explore Baidu’s AI capabilities and layout, one article per week. Everyone is welcome to join the … Read more

Exploration of Mobile Electronic Medical Records Based on Voice Recognition

Exploration of Mobile Electronic Medical Records Based on Voice Recognition

Click the above “China Digital Medicine” to subscribe! Introduction: With the rapid development of the internet and the widespread application of mobile terminals, voice recognition technology is increasingly being utilized in hospital information systems. This article explores how to effectively utilize voice recognition technology, mobile smart terminals, and electronic medical record information input to improve … Read more

Application and Development of Intelligent Voice Recognition Technology in Commercial Banks in the FinTech Era

Application and Development of Intelligent Voice Recognition Technology in Commercial Banks in the FinTech Era

Authors | Wang Yanbo, Gui Xiaoke, Yang Xuan – China Minsheng Bank Du Xinkai – ZTE Corporation Lu Jiahui – Wuhan University Influenced by multiple factors such as the current interest rate marketization, rapid development of internet finance, and the economic development entering a new era, the traditional operating model of domestic banks is facing … Read more

Voice Recognition Technology: A New Era of Medical Informatization

Voice Recognition Technology: A New Era of Medical Informatization

Click the blue text above to follow us, and click ‘Write a Comment’ at the end of the article to express your views. e-Medical Pang Tao   The medical industry is gradually evolving from informatization to intelligent construction due to the continuous deepening of informatization and strong policy promotion. As a technology that makes information production … Read more

How Intelligent Voice Technology Empowers Media Integration

How Intelligent Voice Technology Empowers Media Integration

Artificial intelligence technologies represented by voice recognition and voice synthesis have been widely applied in various fields such as telecommunications, media, and government services. 01 Methods News Voice Synthesis Technology Voice synthesis technology enables the conversion of text into speech, allowing audio to be read out fluently. It is also known as Text to Speech … Read more

Voice Recognition Control System Based on Python

Voice Recognition Control System Based on Python

The topic is simple: using voice recognition to identify spoken words and control the movement of graphics based on the content of those words. For example, if you say ‘up’, the graphic on the canvas will move upwards. This article uses the Baidu recognition API (because it’s free). Here’s a flowchart I created: Without further … Read more