Обзор - Git

microsoft / FLAML

Jupyter Notebook 0 0

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

machine-learning deep-learning data-science python automated-machine-learning natural-language-processing hyperparameter-optimization automl jupyter-notebook timeseries-forecasting tuning classification finetuning hyperparam natural-language-generation random-forest regression scikit-learn tabular-data

Обновлено 2024-11-20 10:51:18 +03:00

microsoft / presidio-research

Python 0 0

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

machine-learning deep-learning nlp natural-language-processing privacy ner transformers pii named-entity-recognition spacy flair

Обновлено 2024-10-27 15:51:02 +03:00

microsoft / CodeMixed-Text-Generator

JavaScript 0 0

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

natural-language-processing python3 code-switching linguistics synthetic-data-generation code-mixing data-generation language-modeling

Обновлено 2024-07-31 00:01:52 +03:00

microsoft / SkillsExtractorCognitiveSearch

Python 0 0

Azure Search Cognitive Skill to extract technical and business skills from text

natural-language-processing cognitive-search azure-search named-entity-recognition

Обновлено 2024-04-25 08:03:40 +03:00

microsoft / COCO-LM

Python 0 0

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

deep-learning natural-language-processing representation-learning transformers language-model natural-language-understanding pretraining contrastive-learning pretrained-language-model

Обновлено 2023-07-25 17:21:55 +03:00

microsoft / NeuronBlocks

Python 0 0

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

deep-learning pytorch natural-language-processing artificial-intelligence dnn question-answering model-compression text-classification knowledge-distillation text-matching qna sequence-labeling

Обновлено 2023-07-22 06:07:54 +03:00

microsoft / MT-DNN

Python 0 0

Multi-Task Deep Neural Networks for Natural Language Understanding

natural-language-processing natural-language-understanding mt-dnn multi-task-learning

Обновлено 2023-06-13 00:28:35 +03:00

microsoft / DeBERTa

Python 0 0

The implementation of DeBERTa

representation-learning self-attention deeplearning bert transformer-encoder language-model natural-language-understanding roberta

Обновлено 2023-03-25 13:22:59 +03:00

microsoft / cookiecutter-spacy-fastapi

Python 0 0

Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker

natural-language-processing cognitive-search azure-search fastapi spacy

Обновлено 2022-11-28 22:10:04 +03:00

microsoft / nlp-recipes

Python 0 0

Natural Language Processing Best Practices & Examples

machine-learning deep-learning nlp natural-language-processing best-practices text-classification natural-language-understanding nlu azure-ml text mlflow pretrained-models nli natural-language natural-language-inference sota transfomer

Обновлено 2022-08-29 16:59:54 +03:00

github / CodeSearchNet

Jupyter Notebook 0 0

Datasets, tools, and benchmarks for representation learning of code.

Обновлено 2022-01-31 12:25:07 +03:00

microsoft / MSR-NLP-Projects

Markdown 0 0

This is a list of open-source projects at Microsoft Research NLP Group

pytorch nlp natural-language-processing dialogue grounded-generation language-models

Обновлено 2020-09-30 01:11:02 +03:00

microsoft / LID-tool

Python 0 0

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.

natural-language-processing python3 code-mixing code-switching language-identification language-tags linguistics mallet

Обновлено 2020-08-12 02:05:32 +03:00

microsoft / factored-segmenter

C# 0 0

Unsupervised factor-based text tokenizer for natural-language processing applications

Обновлено 2020-07-24 22:30:59 +03:00