Граф коммитов

300 Коммитов

Автор SHA1 Сообщение Дата
SJ 33ee66e23a
Several bug fixes and improvements for 2.3.14 release (#465)
* [Bug fix] fix IO pipe issue that causes a blocking issue on send calls
* [Bug fix] fix broken TCP connection issue that causes prolonged delay in receiving events
* [Configuration] provide a configuration knob to specify whether an epoch receiver should be used. Default value is true.
* [Tracing] add a Spark task ID to log messages and emit latency data for receive calls
2019-10-23 09:44:13 -07:00
SJ d5ad0d6caa
Update version number for new release (2.3.13) (#451) 2019-07-26 13:29:19 -07:00
jgiardin f10a85f03e add event hub name to the OffsetRange toString function so that it is present in the spark metadata. add simple unit test for validation of the string (#450) 2019-06-27 08:41:01 -07:00
Dirceu Semighini Filho 64d3fb5eca small misstype in EventPosition (#449) 2019-06-05 22:51:46 -07:00
SJ 1ad742d461
Update version number for new release (2.3.12) (#446) 2019-05-10 15:13:49 -07:00
gison93 0e909b8cc0 Fix typo Strucutred -> Structured (#447) 2019-05-10 12:10:43 -07:00
SJ 081e76b0cb
Update version numbers in the document and mark 2.3.11 as Preview release (#445) 2019-05-02 18:08:00 -07:00
SJ d368d7c6b0
Update version numbers for new release (2.3.11 & 2.2.11) (#443) 2019-05-01 21:45:19 -07:00
SJ d9f4fe02e5
performance improvement for EventHubs writer (#441) 2019-04-12 09:30:45 -07:00
SJ d5e2734c47
Updated the doc - new settings (prefetchCount & threadPoolSize) (#439) 2019-04-08 17:43:46 -07:00
SJ f5fe40d994
Prepare 2.3.10 & 2.2.10 release (#438) 2019-04-08 14:52:51 -07:00
SJ 08c0c58650
Update dpendency - Event Hubs client SDK (2.2.0 --> 2.3.0) (#437) 2019-04-08 09:36:54 -07:00
SJ 90bd0e8608
Update maximum prefetch count setting and improve logging (#436) 2019-04-01 19:58:28 -07:00
SJ 2a6a5b9f28
Update README.md (#434) 2019-03-19 17:30:32 -07:00
SJ 3986bae133
Update README.md (#429) 2019-01-28 10:58:18 -08:00
SJ 420e4bd2a2
Prepare 2.3.9 & 2.2.9 release (#427) 2019-01-18 15:53:32 -08:00
SJ 9f7c9d0818
Handle a case where messages in an offet range were already expired (#426) 2019-01-18 13:31:51 -08:00
SJ cdea9ae746
Prepare 2.3.8 & 2.2.8 release and update Java client SDK dependency (#425) 2019-01-15 19:47:23 -08:00
SJ 0a59666dda
Provide a knob to specify the size of thread pool and recreate a receiver if receive call fails for an extended period of time (#423) 2019-01-14 09:47:44 -08:00
SJ 6932f56d0a
Release 2.2.7 for spark runtime 2.1 and 2.2 (#421) 2019-01-05 22:07:40 -08:00
SJ 86fb6db24d
Prepare for 2.3.7 release (#420)
* Prepare for 2.3.7 release
* Enable ReceiverTimeout and PrefetchCount configuraion
* Update Java client dependency
2019-01-05 18:25:23 -08:00
SJ 498746f308
Fix future timeout issue (#419) 2019-01-02 17:45:34 -08:00
SJ 53c1d79433
Provide a knob to specify prefetch count for the underlying receiver. (#417)
* Provide a knob to specify prefetch count for the underlying receiver
2018-12-11 18:08:26 -08:00
Fokko Driesprong 2bdfa15d40 Take the actual sequence numbers, instead of relying on Seq (#409)
For the tests we want to use the actual sequence number instead of
using the size of the underlying Seq. In real world, the consumer
group of the event hub might nog start at zero, but at a high number
2018-11-28 17:47:56 -08:00
SJ 0f75b0e9fe Use OperationTimeout config value instead of fixed timeout when specifying timeout for internal API calls (#415)
* Use OperationTimeout instead of fixed timeout for retries

* fix format issue

* update comments for internalOperationTimeout API and declare it lazy val

* Update API doc
2018-11-28 17:36:40 -08:00
SJ 1c198ba39a
Prepare for 2.3.6 & 2.2.6 release (#407) 2018-11-08 09:06:05 -08:00
SJ e0faa35fbc
Update logging for EventHubsConf class (#406) 2018-11-05 23:21:20 -08:00
Sabee Grewal 58750b7b31
docs - updating default starting position (#402) 2018-10-19 13:56:42 -07:00
Sabee Grewal 69fa95729a
spark 2.2.5 readme update (#401) 2018-10-19 13:48:57 -07:00
Sabee Grewal 1c42c2f45b
prep for 2.3.5 release (#399) 2018-10-19 11:53:46 -07:00
Sabee Grewal 57d083f2f8
add all system properties to schema (#395)
* add deviceID to schema

* add all properties

* Removing duplicated keys from SystemProperties map
2018-10-11 11:07:11 -07:00
Sabee Grewal 45fb7b6595
Add foreachwriter docs (#397)
* Add foreachwriter docs

* update
2018-10-07 10:07:46 -07:00
Quynh Anh "Emma" Nguyen c19dff2026 Add unit tests for RetryUtils (#396)
* Add unit tests for RetryUtils

* Address code review comments
2018-10-07 09:15:31 -07:00
Sabee Grewal 7ceae44f3b
Receive calls one at a time to avoid overloading thread pool (#394)
* Receive calls one at a time to avoid overloading thread pool

* bumping to 1.2.0 of java client
2018-10-03 16:24:26 -07:00
Sabee Grewal 22d7936c61 2.3.4 and 2.2.4 release (#387) 2018-09-10 15:06:55 -07:00
Sabee Grewal 34f5488e56
Add consumer group to exception messaging in cached receiver (#386) 2018-09-10 07:59:18 -07:00
Sabee Grewal 43717e0e90
Redo check cursor (#384)
* Redo check cursor

* Update to 2.3.4 snapshot
2018-09-07 14:32:41 -07:00
Sabee Grewal 0733567c2b
2.3.3 and 2.2.3 release (#381) 2018-09-04 20:20:05 -07:00
Sabee Grewal 4646ee2eb4
Increased strictness in receive calls (#380) 2018-09-02 16:25:02 -05:00
Sabee Grewal 61aba38334
Properties can be added in EventHubsSink (#375)
* Properties can be added in EventHubsSink

* doc update
2018-08-20 14:34:05 -05:00
Sabee Grewal f1bb4ea7e9
Cached receivers are async (#377)
* Cached receivers are async

* avoid receiver creation and deletion if cursor isn't correct
2018-08-20 12:10:42 -05:00
Sabee Grewal 334255e6a3
Retry receive when it returns null in translate (#369) 2018-08-07 23:06:05 -07:00
Sabee Grewal c9112b0420
Handled DescribedType in properties (#373) 2018-08-07 23:01:01 -07:00
Fokko Driesprong 68b91a21d0 Change the ehClient to a lazy variable (#368) 2018-08-01 10:10:58 -07:00
Fokko Driesprong 0fe7d8f5e0 Simplify configuration and creation of the client (#359)
* Dont read messages that are already pruned by EventHub

In the getPartitions method of the EventHubsRDD class we check if the
offsets are still valid. It is possible that the retention has kicked
in and the messages are no longer available on the bus.

For more info, refer to this issue:
https://github.com/Azure/azure-event-hubs-spark/issues/313

Did some minor refactoring:

- Made the clientFactory static so we don't need to pass this constructor
  around
- Changed the signature of allBoundedSeqNos from a Seq to a Map, since
  the partitionId is unique and later on in the code it is also converted
  to a map.
- Removed the trim method, since passing EventHub config keys to Spark
  does not do any harm. Without this change, the tests are failing since
  they are not being switched to the simulator.

* Remove the offset calculation

* Bump version to 2.3.3-SNAPSHOT

* Restore trimmed config when creating a EventHubsRDD
2018-07-29 14:19:17 -07:00
Sabee Grewal 47b6d31c77
CachedReceiver properly adjusts cursor (#364) 2018-07-25 15:01:32 -07:00
Fokko Driesprong 8c11923a24 Split the classes from EventHubsTestUtils.scala in seperate files (#360)
The EventHubsTestUtils.scala was very long, therefore it is better
to split the classes/objects into seperate files
2018-07-24 12:33:39 -07:00
Fokko Driesprong 599a979cf2 Make the code more Scala idomatic (#356)
There is a lot of mutable code in the source, we want to get rid of it
2018-07-23 10:15:16 -07:00
Sander Ardinois e46c031fb8 Typo fix in docs (#358)
Fix a typo in the Pyspark structured streaming docs
2018-07-22 15:08:22 -07:00
Sabee Grewal 7ac76d8f49
update to 2.2 in readme (#355) 2018-07-06 14:29:02 -07:00