This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Обновлено 2024-07-31 00:01:52 +03:00
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Обновлено 2024-01-09 18:03:18 +03:00
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Обновлено 2023-07-25 17:21:55 +03:00
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Обновлено 2023-06-12 21:59:00 +03:00
Обновлено 2022-08-04 22:59:22 +03:00
A crash course for training speech recognition models using DeepSpeech.
Обновлено 2021-05-16 08:55:34 +03:00
This is a list of open-source projects at Microsoft Research NLP Group
Обновлено 2020-09-30 01:11:02 +03:00
Generate language models from OSCAR corpora
Обновлено 2020-03-27 19:12:25 +03:00