Ongoing research training transformer language models at scale, including: BERT & GPT-2
Обновлено 2024-10-18 13:31:05 +03:00
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Обновлено 2024-01-09 18:03:18 +03:00
Large-scale pretraining for dialogue
machine-learning
pytorch
transformer
dialogue
data-processing
dialogpt
gpt-2
text-data
text-generation
Обновлено 2022-10-18 02:41:52 +03:00