Jeff Klukas
f58b859d53
Update example usage for modern Spark versions ( #11 )
...
sqlContext is deprecated
2018-07-24 12:04:38 -04:00
Jeff Klukas
6064192b8e
Update pyspark usage instructions to avoid warning ( #10 )
...
UserWarning: inferring schema from dict is deprecated,
please use pyspark.sql.Row instead
2018-07-24 12:04:27 -04:00
Jeff Klukas
e5a24f2b0e
Prevent spPackage duplicating python files ( #9 )
...
Fixes deploy failure due to changes introduced in
https://github.com/mozilla/spark-hyperloglog/pull/8
2018-07-05 12:01:00 -04:00
Jeff Klukas
2cd6996e84
Add filters to CircleCI test job ( #8 )
...
Per https://circleci.com/docs/2.0/workflows/#git-tag-job-execution
> a job must have a filters tags section to run as a part of a tag push
2018-07-03 13:31:42 -04:00
Jeff Klukas
fad9890c51
Bug 1466936 - Include python files in jar and use tag-based publishing ( #7 )
...
This allows us to access the python bindings for the package
even if we pull from maven rather than spark-packages.org like so:
```
pyspark --packages com.mozilla.telemetry:spark-hyperloglog_2.11:2.2.0.1 --repositories https://s3-us-west-2.amazonaws.com/net-mozaws-data-us-west-2-ops-mavenrepo/releases/
```
2018-07-03 13:12:33 -04:00
Jeff Klukas
6c18f928e7
Merge pull request #6 from jklukas/spark-packages
...
Bug 1466936 - Distribute via spark-packages.org
2018-06-29 12:48:40 -04:00
Jeff Klukas
ec3f75fba5
Comment on publish job in circle config
2018-06-29 12:38:11 -04:00
Jeff Klukas
f221cf9f8a
Move CI publish to a separate job
2018-06-29 11:48:21 -04:00
Jeff Klukas
f16066b82d
Avoid pyspark UserWarning
2018-06-28 11:42:13 -04:00
Jeff Klukas
fe04a23146
Bug 1466936 - Distribute via spark-packages.org
...
This PR changes deployment for both the python and Scala packages.
In addition to deploying to Mozilla's S3 Maven repo, we deploy to
spark-packages.org so that both Scala and python bindings are available
via invoking Spark with the --packages option set.
We also move all CI to CircleCI 2.0, update some dependency versions, etc.
2018-06-28 09:39:52 -04:00
Anthony Miyaguchi
ea3f4270ed
Merge pull request #5 from acmiyaguchi/pyspark-hyperloglog
...
Bug 1305087 - Add python bindings in pyspark-hyperloglog
2018-05-25 12:59:21 -07:00
Anthony Miyaguchi
456d340f11
Update README with troubleshooting
2018-05-25 12:57:41 -07:00
Anthony Miyaguchi
316cfb0bdc
Update README and fix sparksql initialization
2018-02-05 11:58:45 -08:00
Anthony Miyaguchi
fecbada627
Unify versioning for scala and python packages
...
Also adds newlines to the end of files
2018-02-05 11:58:45 -08:00
Anthony Miyaguchi
92ca32e24a
Prepare package for distribution
2018-01-29 16:53:40 -08:00
Anthony Miyaguchi
009a089b30
Add initial python package and bindings
2018-01-29 16:52:30 -08:00
Anthony Miyaguchi
4dd61c25c3
Add registration function for methods
2018-01-29 16:52:30 -08:00
haroldwoo
6b54c3f61e
Merge pull request #4 from haroldwoo/spark-hyperloglog-fixbadge
...
fixing circleci readme badge
2017-11-03 07:57:33 -07:00
Harold Woo
8c4f882bce
fixing readme badge
2017-11-03 07:25:43 -07:00
Frank Bertsch
8ce3415562
Merge pull request #3 from haroldwoo/spark-hyperloglog-addmaven
...
[Bug 1375018] Spark hyperloglog add maven local repo to replace sonatype
2017-11-02 10:23:43 -05:00
Harold Woo
b2aa78df72
Bug 1375018 replace sonatype with local s3 maven repo
2017-11-02 07:02:01 -07:00
Frank Bertsch
6dd4e7627d
Merge pull request #2 from fbertsch/sonatype
...
Ready spark-hyperloglog for publishing on Sonatype
2017-08-08 13:37:48 -05:00
Frank Bertsch
ae1c584fc5
Ready spark-hyperloglog for publishing on Sonatype
2017-08-08 12:45:18 -05:00
Frank Bertsch
51e72ba4a5
Merge pull request #1 from fbertsch/updates
...
Add spark-hyperloglog to Mozilla
2017-07-18 07:39:27 -05:00
Frank Bertsch
b1eb8fc1f9
Support filtered counts
2017-07-17 15:38:15 -05:00
Frank Bertsch
4462d48b78
Handle null and empty values
2017-07-13 22:50:43 -05:00
Frank Bertsch
e4e0ec110c
Update spark to 2.0.2
2017-07-13 22:47:43 -05:00
Roberto Agostino Vitillo
7a30b2efd7
Update settings for spPublish
2016-09-14 14:26:46 +01:00
Roberto Agostino Vitillo
f483886061
Bump version to deal with Spark Packages error
2016-09-14 13:57:25 +01:00
Roberto Agostino Vitillo
d7f51ebe97
Update sbt-spark-package plugin
2016-09-14 13:48:36 +01:00
Roberto Agostino Vitillo
71d8e6c992
Fix non working coverage icon
2016-09-14 13:48:36 +01:00
Roberto Agostino Vitillo
051915d903
Add deployment steps to documentation
2016-09-14 13:48:30 +01:00
Roberto Agostino Vitillo
c463a7e609
Port package to Spark 2.0
2016-09-14 13:29:11 +01:00
Roberto Agostino Vitillo
c792a491f1
Merge pull request #1 from harterrt/test_scope
...
Scope scalatest to test, not global, dependency
2016-09-06 16:47:02 +01:00
Ryan Harter
817ef07c6d
Scope scalatest to test, not global, dependency
...
For context, see Bug 1298123: Refactor dataset classes so they do not
use case classes
Upgrading the telemetry-batch-view library to 2.11 creates a scalatest
dependency conflict with this package. Limiting the scalatest dependency
to testing will remove the current and prevent future conflicts.
2016-09-06 08:22:03 -07:00
Roberto Agostino Vitillo
b3569e9368
Merge branch 'master' of github.com:vitillo/spark-hyperloglog
2016-04-11 13:24:02 +00:00
Roberto Agostino Vitillo
4aa07fe64d
Bump version.
2016-04-11 13:23:12 +00:00
Roberto Agostino Vitillo
0f1be32975
Rename function package object.
2016-04-11 13:22:37 +00:00
Roberto Agostino Vitillo
e298cc8ebb
Update README.md
2016-04-09 15:58:27 +01:00
Roberto Agostino Vitillo
faffc27b31
Add scalastyle checks.
2016-04-09 15:43:33 +01:00
Roberto Agostino Vitillo
ae197818a3
Add coverage.
2016-04-09 15:32:33 +01:00
Roberto Agostino Vitillo
1116f59a8d
Add travis configuration.
2016-04-09 15:20:45 +01:00
Roberto Agostino Vitillo
110ff7183c
Add travis configuration.
2016-04-09 15:17:27 +01:00
Roberto Agostino Vitillo
574d9b9235
Add sbt plugins.
2016-04-09 15:14:17 +01:00
Roberto Agostino Vitillo
c5b0d4e2b5
Bump version.
2016-04-09 15:10:44 +01:00
Roberto Agostino Vitillo
67937fc351
Configure distribution.
2016-04-09 15:07:08 +01:00
Roberto Agostino Vitillo
f3908f507b
Update README.md
2016-04-09 08:51:48 +01:00
Roberto Agostino Vitillo
7a1ebebbd6
First commit.
2016-04-09 08:49:56 +01:00