PostgreSQL Tools Service that provides PostgreSQL Server data management capabilities.
Обновлено 2024-11-23 03:43:00 +03:00
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
machine-learning
data-science
causal-inference
causality
treatment-effects
bayesian-networks
causal-machine-learning
causal-models
do-calculus
graphical-models
python3
Обновлено 2024-11-22 19:51:17 +03:00
LookML Generator for Glean and Mozilla Data
Обновлено 2024-11-22 19:26:50 +03:00
Tools for parsing the metadata for Mozilla's glean telemetry SDK
Обновлено 2024-11-22 18:07:30 +03:00
MLOS is a Data Science powered infrastructure and methodology to democratize and automate Performance Engineering. MLOS enables continuous, instance-based, robust, and trackable systems optimization.
Обновлено 2024-11-22 04:25:26 +03:00
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
machine-learning
deep-learning
nlp
natural-language-processing
privacy
ner
transformers
pii
named-entity-recognition
spacy
flair
Обновлено 2024-11-22 01:17:04 +03:00
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
hacktoberfest
microsoft
python
privacy
transformers
anonymization
anonymization-service
data-anonymization
data-loss-prevention
data-masking
data-protection
de-identification
dlp
pii
pii-anonymization
pii-anonymization-service
presidio
privacy-protection
text-anonymization
Обновлено 2024-11-22 00:54:06 +03:00
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Обновлено 2024-11-21 10:30:46 +03:00
Delivering data to Firefox
Обновлено 2024-11-20 19:47:40 +03:00
A Mozilla release management tool to send reminders to Firefox developers and improve Bugzilla metadata
Обновлено 2024-11-20 18:30:26 +03:00
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
deep-learning
pytorch
gpu
compression
billion-parameters
data-parallelism
inference
mixture-of-experts
model-parallelism
pipeline-parallelism
trillion-parameters
zero
Обновлено 2024-11-20 04:04:47 +03:00
[ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.
Обновлено 2024-11-19 03:32:30 +03:00
Measure how our data deviates from normal distribution
Обновлено 2024-11-19 00:24:07 +03:00
Best Practices on Recommendation Systems
azure
microsoft
machine-learning
python
deep-learning
kubernetes
data-science
artificial-intelligence
jupyter-notebook
tutorial
operationalization
ranking
rating
recommendation
recommendation-algorithm
recommendation-engine
recommendation-system
recommender
Обновлено 2024-11-18 12:48:34 +03:00
Collection of dockerized ETL jobs managed by data engineering.
Обновлено 2024-11-14 18:35:16 +03:00
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
machine-learning
deep-learning
python
platform
research
finance
algorithmic-trading
auto-quant
fintech
investment
paper
quant
quant-dataset
quant-models
quantitative-finance
quantitative-trading
research-paper
stock-data
Обновлено 2024-11-13 06:41:06 +03:00
A database of the imagesets and catalog files constituting the core WWT data corpus.
Обновлено 2024-11-07 23:10:32 +03:00
Обновлено 2024-11-06 04:21:03 +03:00
Data access package for the SubseasonalClimateUSA dataset
Обновлено 2024-11-05 16:57:54 +03:00
Kusto client libraries for Python
Обновлено 2024-11-03 16:04:30 +03:00
Sharing Updatable Models (SUM) on Blockchain
machine-learning
python
react
ai
ml
artificial-intelligence
node
economics
blockchain
ethereum
prediction-mar
prediction-market
smart-contracts
truffle
Обновлено 2024-10-31 16:38:58 +03:00
Library to access and aggregate several Mozilla data sources.
Обновлено 2024-10-24 19:20:58 +03:00
Azure Storage Data Plane SDK supporting multiple API versions
Обновлено 2024-10-14 09:05:04 +03:00
Python package for graph statistics
Обновлено 2024-10-09 19:41:04 +03:00
Data loading process for OSDU on Azure
Обновлено 2024-10-02 23:54:05 +03:00
Structured data files for topics covered by GitHub's Transparency Report
Обновлено 2024-09-30 19:42:54 +03:00
Query Kusto like a pro from the comfort of your Jupyter notebook
Обновлено 2024-09-25 12:17:26 +03:00
Extension (Magic) to Jupyter notebook and Jupyter lab, that enable notebook experience working with Kusto, ApplicationInsights, and LogAnalytics data.
Обновлено 2024-09-09 22:33:11 +03:00
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.
microsoft
machine-learning
computer-vision
video
benchmark
dataset
classification
few-shot-learning
meta-learning
object-recognition
Обновлено 2024-08-13 03:27:45 +03:00