A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
machine-learning
deep-learning
data-science
python
automated-machine-learning
natural-language-processing
hyperparameter-optimization
automl
jupyter-notebook
timeseries-forecasting
tuning
classification
finetuning
hyperparam
natural-language-generation
random-forest
regression
scikit-learn
tabular-data
Обновлено 2024-11-20 10:51:18 +03:00
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
machine-learning
deep-learning
nlp
natural-language-processing
privacy
ner
transformers
pii
named-entity-recognition
spacy
flair
Обновлено 2024-10-27 15:51:02 +03:00
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
natural-language-processing
python3
code-switching
linguistics
synthetic-data-generation
code-mixing
data-generation
language-modeling
Обновлено 2024-07-31 00:01:52 +03:00
Azure Search Cognitive Skill to extract technical and business skills from text
Обновлено 2024-04-25 08:03:40 +03:00
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
deep-learning
natural-language-processing
representation-learning
transformers
language-model
natural-language-understanding
pretraining
contrastive-learning
pretrained-language-model
Обновлено 2023-07-25 17:21:55 +03:00
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
deep-learning
pytorch
natural-language-processing
artificial-intelligence
dnn
question-answering
model-compression
text-classification
knowledge-distillation
text-matching
qna
sequence-labeling
Обновлено 2023-07-22 06:07:54 +03:00
Multi-Task Deep Neural Networks for Natural Language Understanding
Обновлено 2023-06-13 00:28:35 +03:00
The implementation of DeBERTa
representation-learning
self-attention
deeplearning
bert
transformer-encoder
language-model
natural-language-understanding
roberta
Обновлено 2023-03-25 13:22:59 +03:00
Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Обновлено 2022-11-28 22:10:04 +03:00
Natural Language Processing Best Practices & Examples
machine-learning
deep-learning
nlp
natural-language-processing
best-practices
text-classification
natural-language-understanding
nlu
azure-ml
text
mlflow
pretrained-models
nli
natural-language
natural-language-inference
sota
transfomer
Обновлено 2022-08-29 16:59:54 +03:00
Datasets, tools, and benchmarks for representation learning of code.
machine-learning
deep-learning
data-science
ml
python
tensorflow
neural-networks
open-data
datasets
cnn
machine-learning-on-source-code
natural-language-processing
nlp
nlp-machine-learning
bert
programming-language-theory
representation-learning
rnn
self-attention
data
Обновлено 2022-01-31 12:25:07 +03:00
This is a list of open-source projects at Microsoft Research NLP Group
Обновлено 2020-09-30 01:11:02 +03:00
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
natural-language-processing
python3
code-mixing
code-switching
language-identification
language-tags
linguistics
mallet
Обновлено 2020-08-12 02:05:32 +03:00
Unsupervised factor-based text tokenizer for natural-language processing applications
Обновлено 2020-07-24 22:30:59 +03:00