DeepSeek-VL: A Preliminary Exploration of Multimodal Models

DeepSeek-VL: A Preliminary Exploration of Multimodal Models

Following the release of large models for language, code, mathematics, etc., DeepSeek has brought another early achievement on the journey towards AGI… DeepSeekVL, jointly expanding training data, model architecture, and training strategies, attempts to build the strongest open-source 7B and 1.3B multimodal models. Highlights Data: Multi-source multimodal data enhances the model’s general cross-modal capabilities, mixing … Read more