TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Обновлено 2024-11-21 10:30:46 +03:00
Обновлено 2024-11-06 04:21:03 +03:00
A high-performance modern set of graph rendering components, which enables users to visualize large graph datasets on the web.
Обновлено 2024-08-29 15:29:55 +03:00
Notebooks and documentation for AI-for-Earth-managed datasets on Azure
Обновлено 2024-07-25 14:51:04 +03:00
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
nlp
ml
ner
named-entity-recognition
entity-extraction
entity-linking
entity-resolution
grn
language-understanding
linkingpark
nlp-resources
unitrans
xl-ner
bertel
can-ner
cross-lingual-ner
entity-disambiguation
Обновлено 2024-03-16 09:53:11 +03:00
Unity's privacy-preserving human-centric synthetic data generator
unity
unity3d
deep-learning
computer-vision
pose-estimation
object-detection
synthetic-data
perception
synthetic-dataset-generation
billing-5160
synthetic-datasets
applied-ml-research
human-activity-recognition
human-centric-ml
human-pose-estimation
icml-2022
labeling
owner-machine-learning
synthetic-data-generation
transfer-learning
Обновлено 2024-03-05 04:05:37 +03:00
Normalized Trend Filtering for Biomedical Datasets
Обновлено 2023-07-07 01:07:14 +03:00
Tools to compare metrics between datasets, accounting for population differences and invariant features.
Обновлено 2023-07-07 00:36:40 +03:00
Open Datasets example notebooks
Обновлено 2022-11-28 22:35:18 +03:00
Synthetic Dataset Insights
Обновлено 2022-09-23 22:35:49 +03:00
Datasets, tools, and benchmarks for representation learning of code.
machine-learning
deep-learning
data-science
ml
python
tensorflow
neural-networks
open-data
datasets
cnn
machine-learning-on-source-code
natural-language-processing
nlp
nlp-machine-learning
bert
programming-language-theory
representation-learning
rnn
self-attention
data
Обновлено 2022-01-31 12:25:07 +03:00
Create WTML from lists of HiPS datasets
Обновлено 2021-12-27 18:51:34 +03:00
A model to merge firm data across datasets utilizing exact & non-exact firm identifiers
Обновлено 2021-08-27 20:01:20 +03:00
Обновлено 2021-05-21 20:47:45 +03:00
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
azure
data
r
visualization
workshop-materials
anomaly-detection
azure-databricks
databricks-notebooks
timeseries-forecasting
time-series-analysis
sparksql
311-data
aiforsocialgood
anomalydiscovery
datascience-machinelearning
eda
geospatial
leaflet
opendata
sparkr
Обновлено 2021-05-03 23:14:01 +03:00
Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)
Обновлено 2020-10-13 21:31:30 +03:00
Обновлено 2020-04-08 02:00:58 +03:00
Hive import statement generator for Parquet datasets
Обновлено 2018-11-29 01:22:00 +03:00
INACTIVE - http://mzl.la/ghe-archive - a wrapper around Promises to load & combine multiple datasets
Обновлено 2017-08-24 20:05:09 +03:00