Граф коммитов

48 Коммитов

Автор SHA1 Сообщение Дата
Jeff Klukas f58b859d53
Update example usage for modern Spark versions (#11)
sqlContext is deprecated
2018-07-24 12:04:38 -04:00
Jeff Klukas 6064192b8e
Update pyspark usage instructions to avoid warning (#10)
UserWarning: inferring schema from dict is deprecated,
please use pyspark.sql.Row instead
2018-07-24 12:04:27 -04:00
Jeff Klukas e5a24f2b0e
Prevent spPackage duplicating python files (#9)
Fixes deploy failure due to changes introduced in
https://github.com/mozilla/spark-hyperloglog/pull/8
2018-07-05 12:01:00 -04:00
Jeff Klukas 2cd6996e84
Add filters to CircleCI test job (#8)
Per https://circleci.com/docs/2.0/workflows/#git-tag-job-execution

> a job must have a filters tags section to run as a part of a tag push
2018-07-03 13:31:42 -04:00
Jeff Klukas fad9890c51
Bug 1466936 - Include python files in jar and use tag-based publishing (#7)
This allows us to access the python bindings for the package 
even if we pull from maven rather than spark-packages.org like so:

```
pyspark --packages com.mozilla.telemetry:spark-hyperloglog_2.11:2.2.0.1 --repositories https://s3-us-west-2.amazonaws.com/net-mozaws-data-us-west-2-ops-mavenrepo/releases/
```
2018-07-03 13:12:33 -04:00
Jeff Klukas 6c18f928e7
Merge pull request #6 from jklukas/spark-packages
Bug 1466936 - Distribute via spark-packages.org
2018-06-29 12:48:40 -04:00
Jeff Klukas ec3f75fba5 Comment on publish job in circle config 2018-06-29 12:38:11 -04:00
Jeff Klukas f221cf9f8a Move CI publish to a separate job 2018-06-29 11:48:21 -04:00
Jeff Klukas f16066b82d Avoid pyspark UserWarning 2018-06-28 11:42:13 -04:00
Jeff Klukas fe04a23146 Bug 1466936 - Distribute via spark-packages.org
This PR changes deployment for both the python and Scala packages.
In addition to deploying to Mozilla's S3 Maven repo, we deploy to
spark-packages.org so that both Scala and python bindings are available
via invoking Spark with the --packages option set.

We also move all CI to CircleCI 2.0, update some dependency versions, etc.
2018-06-28 09:39:52 -04:00
Anthony Miyaguchi ea3f4270ed
Merge pull request #5 from acmiyaguchi/pyspark-hyperloglog
Bug 1305087 - Add python bindings in pyspark-hyperloglog
2018-05-25 12:59:21 -07:00
Anthony Miyaguchi 456d340f11 Update README with troubleshooting 2018-05-25 12:57:41 -07:00
Anthony Miyaguchi 316cfb0bdc Update README and fix sparksql initialization 2018-02-05 11:58:45 -08:00
Anthony Miyaguchi fecbada627 Unify versioning for scala and python packages
Also adds newlines to the end of files
2018-02-05 11:58:45 -08:00
Anthony Miyaguchi 92ca32e24a Prepare package for distribution 2018-01-29 16:53:40 -08:00
Anthony Miyaguchi 009a089b30 Add initial python package and bindings 2018-01-29 16:52:30 -08:00
Anthony Miyaguchi 4dd61c25c3 Add registration function for methods 2018-01-29 16:52:30 -08:00
haroldwoo 6b54c3f61e
Merge pull request #4 from haroldwoo/spark-hyperloglog-fixbadge
fixing circleci readme badge
2017-11-03 07:57:33 -07:00
Harold Woo 8c4f882bce fixing readme badge 2017-11-03 07:25:43 -07:00
Frank Bertsch 8ce3415562
Merge pull request #3 from haroldwoo/spark-hyperloglog-addmaven
[Bug 1375018] Spark hyperloglog add maven local repo to replace sonatype
2017-11-02 10:23:43 -05:00
Harold Woo b2aa78df72 Bug 1375018 replace sonatype with local s3 maven repo 2017-11-02 07:02:01 -07:00
Frank Bertsch 6dd4e7627d Merge pull request #2 from fbertsch/sonatype
Ready spark-hyperloglog for publishing on Sonatype
2017-08-08 13:37:48 -05:00
Frank Bertsch ae1c584fc5 Ready spark-hyperloglog for publishing on Sonatype 2017-08-08 12:45:18 -05:00
Frank Bertsch 51e72ba4a5 Merge pull request #1 from fbertsch/updates
Add spark-hyperloglog to Mozilla
2017-07-18 07:39:27 -05:00
Frank Bertsch b1eb8fc1f9 Support filtered counts 2017-07-17 15:38:15 -05:00
Frank Bertsch 4462d48b78 Handle null and empty values 2017-07-13 22:50:43 -05:00
Frank Bertsch e4e0ec110c Update spark to 2.0.2 2017-07-13 22:47:43 -05:00
Roberto Agostino Vitillo 7a30b2efd7 Update settings for spPublish 2016-09-14 14:26:46 +01:00
Roberto Agostino Vitillo f483886061 Bump version to deal with Spark Packages error 2016-09-14 13:57:25 +01:00
Roberto Agostino Vitillo d7f51ebe97 Update sbt-spark-package plugin 2016-09-14 13:48:36 +01:00
Roberto Agostino Vitillo 71d8e6c992 Fix non working coverage icon 2016-09-14 13:48:36 +01:00
Roberto Agostino Vitillo 051915d903 Add deployment steps to documentation 2016-09-14 13:48:30 +01:00
Roberto Agostino Vitillo c463a7e609 Port package to Spark 2.0 2016-09-14 13:29:11 +01:00
Roberto Agostino Vitillo c792a491f1 Merge pull request #1 from harterrt/test_scope
Scope scalatest to test, not global, dependency
2016-09-06 16:47:02 +01:00
Ryan Harter 817ef07c6d Scope scalatest to test, not global, dependency
For context, see Bug 1298123: Refactor dataset classes so they do not
use case classes

Upgrading the telemetry-batch-view library to 2.11 creates a scalatest
dependency conflict with this package. Limiting the scalatest dependency
to testing will remove the current and prevent future conflicts.
2016-09-06 08:22:03 -07:00
Roberto Agostino Vitillo b3569e9368 Merge branch 'master' of github.com:vitillo/spark-hyperloglog 2016-04-11 13:24:02 +00:00
Roberto Agostino Vitillo 4aa07fe64d Bump version. 2016-04-11 13:23:12 +00:00
Roberto Agostino Vitillo 0f1be32975 Rename function package object. 2016-04-11 13:22:37 +00:00
Roberto Agostino Vitillo e298cc8ebb Update README.md 2016-04-09 15:58:27 +01:00
Roberto Agostino Vitillo faffc27b31 Add scalastyle checks. 2016-04-09 15:43:33 +01:00
Roberto Agostino Vitillo ae197818a3 Add coverage. 2016-04-09 15:32:33 +01:00
Roberto Agostino Vitillo 1116f59a8d Add travis configuration. 2016-04-09 15:20:45 +01:00
Roberto Agostino Vitillo 110ff7183c Add travis configuration. 2016-04-09 15:17:27 +01:00
Roberto Agostino Vitillo 574d9b9235 Add sbt plugins. 2016-04-09 15:14:17 +01:00
Roberto Agostino Vitillo c5b0d4e2b5 Bump version. 2016-04-09 15:10:44 +01:00
Roberto Agostino Vitillo 67937fc351 Configure distribution. 2016-04-09 15:07:08 +01:00
Roberto Agostino Vitillo f3908f507b Update README.md 2016-04-09 08:51:48 +01:00
Roberto Agostino Vitillo 7a1ebebbd6 First commit. 2016-04-09 08:49:56 +01:00