* Add data monitoring dag
* made final tweaks to the data_monitoring DAG
* made changes as requested by @alekhyamoz in PR#1583
Co-authored-by: Alekhya Kommasani <akommasani@mozilla.com>
* script to validate dag tags and step to circle ci
* trying out dag tagg validation through parsing
* added missing tag so that tag check does not fail
* Using SQL approach for validation, added extra logging and clean up
* added check to make sure all DAGs have tags
* fixed 3 DAGs missing tags
* implemented suggestions by @haroldwoo in #1446
Co-authored-by: = <=>
* Added gitignore directives for artifacts left by tests and staging the airflow instance locally
* Converted the taar_lite_guidguid job to use SubDagOperator and moz_dataproc_pyspark_runner.
Converted AMO jobs to use GKEPodOperator
* deleted broken connection stub for `google_cloud_derived_datasets`
* changed job to use temp bucket while testing whole DAG
* Downsampled the taar-lite GUID GUID job to 5%
* swapped test gcs location for pyspark job with production location