Client library for Azure Databricks
Обновлено 2024-11-06 11:31:07 +03:00
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
azure
docker
react
nodejs
iot
cosmosdb
spark
hdinsight
big-data
spark-streaming
spark-sql
streaming
apache-spark
servicefabric
kafka
streaming-data
iothub
sparksql
eventhub
kafka-streams
Обновлено 2024-11-04 10:02:02 +03:00
Simple and Distributed Machine Learning
azure
microsoft
machine-learning
deep-learning
ai
data-science
opencv
ml
cognitive-services
spark
http
big-data
lightgbm
databricks
onnx
apache-spark
pyspark
scala
model-deployment
synapse
Обновлено 2024-10-25 12:11:14 +03:00
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
azure
microsoft
spark
event-hubs
eventhubs
streaming
databricks
stream
apache-spark
bigdata
apache
spark-streaming
kafka
scala
connector
structured-streaming
ingestion
continuous
real-time
Обновлено 2024-06-11 23:11:35 +03:00
End-to-end proof of concept showing core MLOps practices to develop, deploy and monitor a machine learning model for an employee retention workload using Databricks and Kubernetes on Microsoft Azure.
Обновлено 2024-05-28 07:06:17 +03:00
Apache Spark Connector for Azure Cosmos DB
spark
jupyter-notebook
cosmos-db
azure-cosmos-db
apache-spark
databricks
pyspark
databricks-notebooks
azure-databricks
connector
changefeed
lambda-architecture
Обновлено 2024-05-21 00:00:20 +03:00
Azure AI Camp - 2 day workshop on Databricks and Azure ML
Обновлено 2023-07-23 07:44:56 +03:00
Presented at Ready FY19 , refreshed for BUILD 2019 and Ready FY20, this repo follows the WhatTheHack workshop format
Обновлено 2023-06-12 22:27:38 +03:00
A set of Build and Release tasks for Building, Deploying and Testing Databricks notebooks
Обновлено 2023-06-12 21:21:29 +03:00
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
azure
python
security
spark
performance
deployment
provisioning
azuredatabricks
grafana
performance-monitoring
scalability
Обновлено 2023-03-28 19:44:37 +03:00
Sample notebooks for optimized training and inference of Hugging Face models on Azure Databricks
Обновлено 2023-03-28 19:41:14 +03:00
this is a python framework that helps to build any data engineering and data science solutions in Databricks
Обновлено 2023-03-22 20:23:36 +03:00
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
dotnet
csharp
dotnet-core
fsharp
microsoft
dotnet-standard
machine-learning
spark
spark-sql
spark-streaming
streaming
tpcds
analytics
tpch
apache-spark
azure
bigdata
databricks
emr
hdinsight
Обновлено 2023-02-18 00:56:32 +03:00
This is a Golang SDK for Azure DataBricks REST API 2.0
Обновлено 2022-12-21 12:33:01 +03:00
Testing framework for Databricks notebooks
Обновлено 2022-12-16 19:30:53 +03:00
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Обновлено 2022-11-28 22:10:31 +03:00
Обновлено 2021-12-21 04:38:10 +03:00
GitHub Action that imports Databricks notebooks from a local path into the Databricks worspace
Обновлено 2021-10-01 00:29:00 +03:00
GitHub Action that generates an Azure AD token for a service principal to access Azure Databricks
Обновлено 2021-09-23 22:28:59 +03:00
GitHub Action that installs Databricks CLI
Обновлено 2021-09-22 22:46:14 +03:00
Secure Databricks cluster with Data exfiltration Protection and other services using Bicep
Обновлено 2021-08-31 13:04:15 +03:00
Kubernetes Operator for Databricks
Обновлено 2021-06-04 09:33:13 +03:00
Kubernetes Operator for Databricks
Обновлено 2021-06-04 09:33:13 +03:00
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
azure
data
r
visualization
workshop-materials
anomaly-detection
azure-databricks
databricks-notebooks
timeseries-forecasting
time-series-analysis
sparksql
311-data
aiforsocialgood
anomalydiscovery
datascience-machinelearning
eda
geospatial
leaflet
opendata
sparkr
Обновлено 2021-05-03 23:14:01 +03:00
Обновлено 2021-01-07 14:24:18 +03:00
Обновлено 2020-07-22 17:00:13 +03:00
A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and MLflow
Обновлено 2020-07-17 18:09:01 +03:00
A mechanism to ensure a series of configured jobs are loaded into an Azure Databricks instance
Обновлено 2020-07-07 21:01:54 +03:00
A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight
Обновлено 2020-06-04 21:46:48 +03:00
A set of example build and release pipelines for deploying a modern Data Estate ETL Pipeline with Data Factory, Azure SQL, and Databricks Notebooks
Обновлено 2020-02-15 01:36:37 +03:00