Ongoing research training transformer language models at scale, including: BERT & GPT-2
Обновлено 2024-09-04 08:42:52 +03:00
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Обновлено 2024-01-09 18:03:18 +03:00
Large-scale pretraining for dialogue
Обновлено 2022-10-18 02:41:52 +03:00