Text and Visual: Introduction to Multiple Visual/Video BERT Papers
Reprinted from WeChat Official Account: AI Technology Review Author: Yang Xiaofan Since the success of Google’s BERT model in 2018, more and more researchers have drawn on BERT’s ideas for tasks beyond pure text, developing various visual/video (Visual/Video) fusion BERT models.Here we introduce the original VideoBERT paper and six other recent V-BERT papers (sorted in … Read more