This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
natural-language-processing
python3
code-switching
linguistics
synthetic-data-generation
code-mixing
data-generation
language-modeling
Обновлено 2024-07-31 00:01:52 +03:00
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Обновлено 2024-01-09 18:03:18 +03:00
A library & tools to evaluate predictive language models.
nlp
language-model
evaluation
evaluation-toolkit
language-model-evaluation
lm-challenge
next-word-prediction
ngram-model
prediction-model
research-tool
Обновлено 2023-08-09 18:22:29 +03:00
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
deep-learning
natural-language-processing
representation-learning
transformers
language-model
natural-language-understanding
pretraining
contrastive-learning
pretrained-language-model
Обновлено 2023-07-25 17:21:55 +03:00
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
nlp
pytorch
azure-machine-learning
language-model
bert
tuning
finetuning
pretraining
azureml-bert
bert-model
pretrained-models
Обновлено 2023-06-12 21:59:00 +03:00
The implementation of DeBERTa
representation-learning
self-attention
deeplearning
bert
transformer-encoder
language-model
natural-language-understanding
roberta
Обновлено 2023-03-25 13:22:59 +03:00
Обновлено 2022-08-04 22:59:22 +03:00
A crash course for training speech recognition models using DeepSpeech.
Обновлено 2021-05-16 08:55:34 +03:00
This is a list of open-source projects at Microsoft Research NLP Group
Обновлено 2020-09-30 01:11:02 +03:00
Generate language models from OSCAR corpora
Обновлено 2020-03-27 19:12:25 +03:00