Enhancing Multi-Modal Data: MixGen from Amazon’s Li Mu Team

Enhancing Multi-Modal Data: MixGen from Amazon's Li Mu Team

Follow our public account to discover the beauty of CV technology This article shares the paper「MixGen: A New Multi-Modal Data Augmentation」, how to perform data augmentation on multi-modal data? The Amazon Li Mu team proposed a simple and effective MixGen, significantly improving performance across multiple multi-modal tasks! Details are as follows: Paper link: https://arxiv.org/abs/2206.08358 Code … Read more