Source: ZHUAN ZHI
This article is an introduction to a paper, suggested reading time is 5 minutes.
This article discusses three important themes in the fields of graphics and vision: multimodal facial animation, multimodal human body animation, and multimodal digital human image construction, introducing their methodologies and representative works.
Multimodal digital humans refer to realistic virtual humans that possess multimodal cognitive and interactive abilities, along with human-like thinking and behavioral logic. In recent years, with the cross-integration and vigorous development of fields such as computer vision and natural language processing, significant progress has been made in related technologies. This article discusses three important themes in graphics and vision: multimodal facial animation, multimodal human body animation, and multimodal digital human image construction, introducing their methodologies and representative works. Under the theme of multimodal facial animation, it introduces relevant works on voice-driven and expression-driven faces. Under the theme of multimodal human body animation, it discusses human animation generation based on Recurrent Neural Networks (RNN), Transformer-based methods, and denoising diffusion models. Under the theme of multimodal digital human image construction, it introduces visual-language similarity-guided virtual image construction, multimodal denoising diffusion model-guided virtual image construction, and three-dimensional multimodal virtual human generation models. This article categorizes and summarizes representative works in these directions and looks forward to potential future research directions.
About Us
Data Pie THU is a public account focused on data science, backed by the Tsinghua University Big Data Research Center, sharing cutting-edge data science and big data technology innovation research dynamics, continuously disseminating data science knowledge, striving to build a platform for gathering data talents, and creating the strongest army in China’s big data sector.
Sina Weibo: @Data Pie THU
WeChat Video Account: Data Pie THU
Today’s Headlines: Data Pie THU