Граф коммитов

123 Коммитов

Автор SHA1 Сообщение Дата
Sabee Grewal 5100b4a5e8 Merge pull request #164 from sabeegrewal/progressClean
Moving to version 0.15.0 of the EventHubs Java Client
2017-10-02 17:18:18 -07:00
sabeegrewal 2f6d2c2913 updating import order 2017-10-02 17:11:01 -07:00
sabeegrewal ef3781eeee updating to new namespace in tests as well 2017-10-02 17:08:47 -07:00
sabeegrewal ba45c101bb updating import statements to new eventhubs namespace 2017-10-02 17:06:52 -07:00
sabeegrewal 00fe018f0c moving to the latest eventhubs java client 2017-10-02 16:59:46 -07:00
Sabee Grewal 119192860e Merge pull request #163 from sabeegrewal/progressClean
Slight cleanup to code base
2017-10-02 16:38:26 -07:00
sabeegrewal 978013d7a3 splitting lines at arguments 2017-10-02 16:27:39 -07:00
sabeegrewal 62e026187e reverting minor style changes 2017-10-02 16:23:45 -07:00
sabeegrewal 2da0b9e507 removing unused print statement 2017-10-02 15:01:57 -07:00
sabeegrewal 8f85f53097 updating comments and spacing for more consistent style 2017-10-02 14:50:10 -07:00
sabeegrewal 93cb52cf86 reverting to scalastyle 0.8.0 2017-10-02 13:39:58 -07:00
sabeegrewal f65f54e0e3 fixing bug in test 2017-09-29 16:30:46 -07:00
sabeegrewal 784465dc5e more scalastyle fixes 2017-09-29 15:16:14 -07:00
sabeegrewal afb2fad2fb fixing scalastyle issues and build warnings 2017-09-29 15:12:37 -07:00
sabeegrewal ac08f53112 changing scalastyle version 2017-09-29 14:51:50 -07:00
sabeegrewal 7efda0fe29 cleaning up code and re-working progress tracking code 2017-09-29 14:43:24 -07:00
Nan Zhu 750c30df42 [2.1.x] fast initialization of progress tracker (#158)
* metadata path setup and fall back to list all mechanism

* delete progress file directory after each test

* fix the failed test

* add test for metdata exist

* add test for read progress file

* revise mockito test

* update pom

* pass in all params with mockit

* change the return value of getLatestFile

* add test for checking if really queried metadata

* test: When metadata presents, we should respect it even the more recent progress file is there

* update import

* add writing logic of metadata

* update tests

* add log

* add logging

* revise logging

* add logging in commit() of structured streaming

* fix the test

* revise test for metadata not committed

* revise tests

* fix the failed test

* add partially committed progress file test

* add test of upgrade

* update upgrade test

* add test for upgrading structured streaming

* add test for no metadata

* update test name

* enabling cleanup thread

* cancel cleanup task

* add tracing

* test for fishy change

* test for fishy change

* fix the path

* code clean

* address the comments

* missed one
2017-09-28 08:39:45 -07:00
Sabee Grewal 8c9b755355 [spark-2.1.x] making EventHub API call every batch (#156)
* making EventHub API call every batch

* remove unused import

* style fix

* Bug fix

* same bug fix but for DStreams
2017-09-25 12:00:09 -07:00
SJ f24cf42e64 Merge pull request #155 from sabeegrewal/2.1.x
making EventHubs API call for every partition each batch
2017-09-23 21:01:37 -07:00
Sabee Grewal 612cb99d30 making EventHubs API call for every partition each batch 2017-09-23 17:23:20 -07:00
Nan Zhu b1e026758a update version number (#154)
* change version number to 2.1.4

* update to 2.1.5-SNAPSHOT
2017-09-21 09:49:16 -07:00
Nan Zhu 83ecc28dd1 [2.1.x] should update highest offset when wake up by notify() (#153)
* should update highest offset when wake up by notify()

* fix test
2017-09-20 13:44:59 -07:00
Nan Zhu c107b5c928 update version to 2.1.4-SNAPSHOT (#147) 2017-09-19 13:32:24 -07:00
Nan Zhu 44f1d9e46a fix wrong start offset in the first batch (#145) 2017-09-18 15:14:05 -07:00
Nan Zhu 23f8d5e872 Update Change Log 2017-09-18 11:07:10 -07:00
Nan Zhu 416fed24fe Update README.md 2017-09-18 10:58:41 -07:00
Nan Zhu 506573d4ac update version for 2.1.3 2017-09-18 10:13:43 -07:00
Nan Zhu f7923233ad fix issue for unnecessary wait (#141) 2017-09-18 10:11:24 -07:00
Nan Zhu 6bc8b296d1 Update README.md 2017-09-12 18:15:38 -07:00
Sabee Grewal cf3288a074 updating receiver identifier with different info (#135)
* updating receiver identifier with different info

* minor fix in spacing
2017-09-11 15:50:08 -07:00
Nan Zhu c67bcf0ced [spark-2.1.x] eliminate list operations in progress tracker (#134)
* update collect progress record files

* fix compilation issue

* handle partially evaluated rdd

* fix compilation issue

* add comments

* revise code
2017-09-11 15:49:42 -07:00
Nan Zhu 207d9834e4 [2.1.x] optimize thread synchronization and show metrics caused by reading progress files (#124)
* fix flaky test

* remove duplicate code

* change sync order and add metrics

* update pom

* update version number

* change back pom

* test fix
2017-08-24 10:59:32 -07:00
sabeegrewal 305109fc14 check for null reference in RDD to DataFrame conversion (#123)
* check for null reference in RDD to DataFrame conversion

* adding unit test for null properties during DataFrame generation
2017-08-23 12:41:53 -07:00
Jacek Laskowski 993da4e23a Update ss.md (#129)
Think the changes make the code "prettier" (certainly using the recommended API, esp. for a trigger).
2017-08-22 08:30:30 -07:00
Nan Zhu feee774f07 fix flaky test (#120)
* fix flaky test

* remove duplicate code
2017-08-22 08:29:18 -07:00
sabeegrewal bd6f17b598 adding iothub/spark setup instructions (#117) 2017-08-10 09:36:11 -07:00
sabeegrewal 95fae3d776 adding 'getting started' documentation (#116)
* adding 'getting started' documentation

* updated a few issues in readme
2017-08-08 13:49:03 -07:00
Nan Zhu f3896776a3 release 2.1.2 and 2.0.8 (#113) 2017-07-31 12:44:15 -07:00
Yun Tang aae33839ee Correct notes for "eventhubs.filter.enqueuetime" (#111)
* Correct notes for "eventhubs.filter.enqueuetime"

* Correct notes for "eventhubs.filter.enqueuetime"
2017-07-28 19:01:08 -07:00
Nan Zhu e9babda4a0 [2.1.x] post receiver id to eventhubs (#109)
* post receiver id to eventhubs

* fix compilation issue

* fix compilation issue
2017-07-28 19:00:35 -07:00
Nan Zhu 72b4484679 fix receiver_leak (#105) 2017-07-19 08:54:22 -07:00
Nan Zhu 8b07e7b983 Update change_log.md 2017-06-26 10:40:05 -07:00
Nan Zhu c27cd92926 Update README.md 2017-06-26 10:36:14 -07:00
Nan Zhu 717b297ca8 [2.1.x] replace rest client with amqp one (#97)
* replace rest client with amqp one

* fix the failed tests
2017-06-26 09:39:24 -07:00
Nan Zhu 4554c0aa0f fix outdated comments (#92)
* update pom version number

* update change log

* remove outdated comments

* fix comments
2017-06-02 13:45:55 -07:00
Nan Zhu ca60440183 update change log for 2.0.6 (#90)
* update pom version number

* update change log
2017-05-25 10:52:14 -07:00
Nan Zhu 49a47512ae update pom version number (#88) 2017-05-25 09:06:52 -07:00
Nan Zhu bc4be2b2b5 change log (#87)
* temp

* define more keys

* update user doc

* update

* update

* add note

* Update ss.md

* Update ss.md

* update change log
2017-05-25 08:50:13 -07:00
Nan Zhu c1f7f60991 User doc for structured streaming (#86) 2017-05-25 08:31:40 -07:00
Nan Zhu 170a90c562 enqueue time support for 2.1 (#84)
* refactor client part

* stylistic fix

* refactor rate control

* sketch

* sync

* add ProgressTrackerBase

* refactor read

* create progress package in common and rename CommonUtils to RateControlUtils

* refactor RDD classes

* refactor test cases

* get rid of stream id and partial progress tracker in eventhubssource

fix the last commit

* upgrade dependency and basic flow in EventHubsSource

* fix issues of initial offsets

* Integration test (#16)

* init commit of integration test

* fix examples

* fix output mode

* fix enqueuetime type issue

* fix dir structure issue

* fix source registration

* fix uid issue

* add logging

* fix the offset management issue

* fix offset update issue

* add logging

* add logging

* fix the issue of immature initialization of progress tracker

* test

* use parquet output stream

* fix

* fix serializedOffset issue

* fix issue

* test

* fix

* fix typo

* fix batch id

* addlogging

* fix recovery logic

* fix

* sync

* refactor ProgressTracker to support multi streams in structured streaming

* fix

* refactor progress writer

* test

* test

* try

* test

* test

* fix it

* fix

* test

* fix directory structure

* test

* test

* partial fix

* fix the failed test case

* do not clean temp files at the beginning

* Added structured streaming progress tracker suite

* Refactoring simulated eventhubs setup

* Added simple EventHubsSource tests

* use Row instead of InternalRow

* Add unit test for user-defined keys

* accept optional userDefinedKey config entry

* fix unser issue

* test with userDefinedKeys

* refactor client part

* stylistic fix

* refactor rate control

* sync

* add ProgressTrackerBase

* refactor read

* refactor commit methods

* create progress package in common and rename CommonUtils to RateControlUtils

* refactor test cases

* get rid of stream id and partial progress tracker in eventhubssource

fix the last commit

* upgrade dependency and basic flow in EventHubsSource

* make EventHubsOffset a nested class

* Integration test (#16)

* init commit of integration test

* fix examples

* fix output mode

* fix enqueuetime type issue

* fix dir structure issue

* fix source registration

* fix uid issue

* add logging

* fix the offset management issue

* fix offset update issue

* add logging

* add logging

* fix the issue of immature initialization of progress tracker

* test

* use parquet output stream

* fix

* fix serializedOffset issue

* fix issue

* test

* fix

* fix typo

* fix batch id

* addlogging

* fix recovery logic

* fix

* sync

* refactor ProgressTracker to support multi streams in structured streaming

* fix

* refactor progress writer

* test

* test

* try

* test

* test

* fix it

* fix

* test

* fix directory structure

* test

* test

* partial fix

* fix the failed test case

* do not clean temp files at the beginning

* sync

* sync

* try to test

* fix the typo

* unit test for enqueueTime

* implement eventhubs search with time functionality

* fix compilation issue

* logging

* logging more

* logging

* fix bug in search params

* build date from instant

* fix issues in expected output

* fix

* fix offset

* add test case for checkpointing offset type

* fix test case

* enable fallback read in test

* remove duplicate code

* test enqueuetim

* configure enqueuetime

* fix

* update to 0.13.1

* test

* test

* try instant parsing

* fix typo

* fix

* logging

* test

* test

* add test for illegal filtering time

* test

* fix failed test cases

* fix inconsistent epoch second / milliseconds

* Eventhub reader (#17)

* Added more Event Hubs source unit tests

* Few more event hubs source unit tests added

* Addressed review comments

* New Eventhubs source unit test

* small changes

* try to fix

* Broken unit test

* try to fix

* try to fix

* try to fix

* try to fix

* fix compilation issue

* changes

* changes

* Updated broken unit test

* Fix unit test

* Refactored code for unit tests

* test

* test

* hi

* fix

* resolve directory conflict

* merge and clean

* stylistic fix

* fix compilation issue

* fix compilation issue

* recover some changes

* stylistic fix

* fix adding data issue

* add test for filtering

* fix typo

* add more logging

* adjust logging

* update logging

* test

* test

* test

* partial check

* more test

* fix test

* test

* test

* test

* add logging

* test

* test

* fix ambiguous of test

* advance clock

* test

* fix eh name

* test

* test

* add unit test

* remove unnecessary printStackTrace
2017-05-17 23:05:56 -07:00