data-accelerator/DataProcessing
Yinyu Wang f5997b2904 refactor listFiles 2022-09-13 09:15:49 +00:00
..
DataX.Utilities Update to use .netcore 2.2 and aspnetcore to 2.2.6 (#111) 2019-08-07 17:05:19 -07:00
datax-core Jozavala/log4j applicationinsight appender (#495) 2022-08-29 22:00:46 -07:00
datax-host refactor listFiles 2022-09-13 09:15:49 +00:00
datax-keyvault Bump azure-security-keyvault-secrets in /DataProcessing/datax-keyvault (#498) 2022-08-31 10:10:51 -07:00
datax-udf-samples Jozavala/log4j applicationinsight appender (#495) 2022-08-29 22:00:46 -07:00
datax-utility Jozavala/log4j applicationinsight appender (#495) 2022-08-29 22:00:46 -07:00
.gitignore Add kafka input support for data processing (#57) 2019-05-20 20:08:24 -07:00
CONTRIBUTING.md Update CONTRIBUTING.md 2019-04-16 03:18:40 -07:00
README.md Update README.md 2019-04-16 00:08:46 -07:00
Spark.nuspec Switch from KeyVaultClient to SecretClient and remove sqldb dependency (#194) 2021-10-06 12:29:19 -07:00
deploy.cmd Initial Checkin 2019-04-15 23:57:37 -07:00

README.md

Data Accelerator for Spark Engine

Project Structure

Core

Interface and classes definition for contracts of the Data Accelerator Engine

Host

Spark-specific app jar for Data Accelerator

Keyvault

Classes for keyvault access

Samples

Examples for UDFs and UDAFs in Scala

Utility

Common classes and singleton helps used across projects

Properties

Some basic rules:

  • Property names are akin to a full JSON path to locate a leaf in the JSON object
  • The root namespace is datax.job
  • Property names are all lowercase for known fields from the JSON config object, except cases from Map and Array
  • Map case - e.g. the outputs is a Map of string to individual output config, in this case, put the string into the property name as part of the path
  • Array case - e.g. the timeWindows is an Array of time window specs, in this case, extract the name as part of the path into property name
  • When flatten Map/Array, change the plural words into singular term, e.g. change outputs to output, timeWindows to timewindow, etc.