5 readme
Dinesh Chandnani редактировал(а) эту страницу 2019-03-21 14:09:22 -07:00

Data Accelerator for Apache Spark - Open Source

Data Accelerator for Apache Spark simplifies on-boarding to streaming of Big Data using spark. It offers a no-code experience to build Rules and Alerts, as well as and numerous productivity improvements to develop and manage Spark SQL jobs on Azure HDInsights. Data Accelerator is used in production at Microsoft to process terabytes of streamed data every day.

Data Accelerator offers three level of experience

  • The first requires no code at all, using rules to create alerts on data content.
  • The second allows to write SQL-like query with additions like time window, accumulator and more
  • The third enables integrating custom code either in Scala or via Azure functions

You can get started locally for Windows, macOs and Linux on the Local-mode-with-Docker.
To deploy to Azure, you can use the ARM template; see instructions here Cloud-deployment.

The data-accelerator repository contains everything needed to set up an end-to-end data pipeline. There are many ways you can participate in the project:

Getting Started

You can get started with Data Accelerator for Spark in 5 minutes by using a Docker image. Please see the Local-mode-with-Docker to obtain the image then, follow the following steps to create your first data pipeline or Flow.

Once you are ready to deploy to Azure, you can use the deployment scripts and ARM template to instantiate the infrastructure and apply the right settings. See the Cloud-deployment. Data Accelerator for Spark runs on the following:

  • HDInsights with Spark 2.3
  • Service Fabric running DotNet Core 2.1
  • App Service with Node 10.6

See the wiki pages for further information on how to build, diagnose and maintain your data pipelines built using Data Accelerator for Spark.

Contributing

Some of the best ways to contribute are to try things out, join in design conversations, and make pull-requests.

Feedback

Please also see our Code of Conduct.

Security issues

Security issues and bugs should be reported privately, via email, to the Microsoft Security Response Center (MSRC) secure@microsoft.com. You should receive a response within 24 hours. If for some reason you do not, please follow up via email to ensure we received your original message. Further information, including the MSRC PGP key, can be found in the Security TechCenter.

License

This repository is licensed with the MIT license.