MLOS is a Data Science powered infrastructure and methodology to democratize and automate Performance Engineering. MLOS enables continuous, instance-based, robust, and trackable systems optimization.
Обновлено 2024-09-18 03:45:43 +03:00
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Обновлено 2024-09-17 20:41:42 +03:00
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
deep-learning
pytorch
gpu
compression
billion-parameters
data-parallelism
inference
mixture-of-experts
model-parallelism
pipeline-parallelism
trillion-parameters
zero
Обновлено 2024-09-17 19:29:18 +03:00
Collection of dockerized ETL jobs managed by data engineering.
Обновлено 2024-09-17 19:14:31 +03:00
LookML Generator for Glean and Mozilla Data
Обновлено 2024-09-17 18:47:31 +03:00
Best Practices on Recommendation Systems
azure
microsoft
machine-learning
python
deep-learning
kubernetes
data-science
artificial-intelligence
jupyter-notebook
tutorial
operationalization
ranking
rating
recommendation
recommendation-algorithm
recommendation-engine
recommendation-system
recommender
Обновлено 2024-09-17 05:40:34 +03:00
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
hacktoberfest
microsoft
python
privacy
transformers
anonymization
anonymization-service
data-anonymization
data-loss-prevention
data-masking
data-protection
de-identification
dlp
pii
pii-anonymization
pii-anonymization-service
presidio
privacy-protection
text-anonymization
Обновлено 2024-09-16 21:32:09 +03:00
A Mozilla release management tool to send reminders to Firefox developers and improve Bugzilla metadata
Обновлено 2024-09-16 16:38:49 +03:00
Delivering data to Firefox
Обновлено 2024-09-16 13:06:13 +03:00
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
machine-learning
deep-learning
python
platform
research
finance
algorithmic-trading
auto-quant
fintech
investment
paper
quant
quant-dataset
quant-models
quantitative-finance
quantitative-trading
research-paper
stock-data
Обновлено 2024-09-12 18:44:27 +03:00
Sharing Updatable Models (SUM) on Blockchain
machine-learning
python
react
ai
ml
artificial-intelligence
node
economics
blockchain
ethereum
prediction-mar
prediction-market
smart-contracts
truffle
Обновлено 2024-09-11 00:34:15 +03:00
A database of the imagesets and catalog files constituting the core WWT data corpus.
Обновлено 2024-09-10 17:06:25 +03:00
Extension (Magic) to Jupyter notebook and Jupyter lab, that enable notebook experience working with Kusto, ApplicationInsights, and LogAnalytics data.
Обновлено 2024-09-09 22:33:11 +03:00
Kusto client libraries for Python
Обновлено 2024-09-05 12:08:39 +03:00
Обновлено 2024-09-05 02:49:55 +03:00
Tools for parsing the metadata for Mozilla's glean telemetry SDK
Обновлено 2024-09-02 13:54:19 +03:00
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
machine-learning
data-science
causal-inference
causality
treatment-effects
bayesian-networks
causal-machine-learning
causal-models
do-calculus
graphical-models
python3
Обновлено 2024-08-29 21:13:51 +03:00
Library to access and aggregate several Mozilla data sources.
Обновлено 2024-08-19 16:44:31 +03:00
Structured data files for topics covered by GitHub's Transparency Report
Обновлено 2024-08-14 23:32:50 +03:00
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.
microsoft
machine-learning
computer-vision
video
benchmark
dataset
classification
few-shot-learning
meta-learning
object-recognition
Обновлено 2024-08-13 03:27:45 +03:00
Azure Storage Data Plane SDK supporting multiple API versions
Обновлено 2024-08-08 10:34:32 +03:00
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
machine-learning
deep-learning
nlp
natural-language-processing
privacy
ner
transformers
pii
named-entity-recognition
spacy
flair
Обновлено 2024-08-07 19:17:23 +03:00
Microsoft Azure Data Lake Store Filesystem Library for Python
Обновлено 2024-08-01 22:48:50 +03:00
Python package for graph statistics
Обновлено 2024-07-23 01:36:30 +03:00
Azure Storage transfer tool and data movement library
azure
python
azure-storage
python-library
docker-image
azure-blob
azure-blob-storage
azure-file
data-movement
file-transfer
Обновлено 2024-07-18 21:53:17 +03:00
Обновлено 2024-07-16 22:35:10 +03:00
A python library for intelligently building networks and network embeddings, and for analyzing connected data.
Обновлено 2024-07-11 21:49:18 +03:00
Dataset of Government Open Source Policies
Обновлено 2024-07-06 05:15:10 +03:00
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
machine-learning
deep-learning
data-science
python
tensorflow
mlops
pytorch
hyperparameter-optimization
hyperparameter-tuning
machine-learning-algorithms
model-compression
nas
neural-architecture-search
neural-network
automated-machine-learning
automl
bayesian-optimization
deep-neural-network
distributed
feature-engineering
Обновлено 2024-07-03 13:54:08 +03:00