Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Обновлено 2024-11-09 13:45:59 +03:00
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Обновлено 2024-07-25 14:07:31 +03:00
Multitask Multilingual Multimodal Pre-training
Обновлено 2021-05-13 09:56:36 +03:00