Граф коммитов

313 Коммитов

Автор SHA1 Сообщение Дата
Sebastian Burckhardt cb07152e0a
Add tracing to observe taskhub operations (#83) 2021-10-14 09:26:28 -07:00
Sebastian Burckhardt 4a9ffbec31
Update to FASTER 1.9.6 (#80)
* update to FASTER 1.9.3

* update to 1.9.6
2021-10-14 09:26:02 -07:00
Sebastian Burckhardt 4ae87f2eee
fix incorrect uses of the resolved connection string where the connection name should be used. (#77) 2021-10-11 14:59:06 -07:00
Sebastian Burckhardt 8b1d315ae4
fix recovery parameter for FASTER (#79) 2021-10-08 15:22:53 -07:00
Sebastian Burckhardt 0c7aa42853
CacheOrchestrationCursors must be set to false when running out-of-proc (#71) 2021-10-05 11:29:44 -07:00
Christian Lechner 0390605bd9
New Code Samples for Netherite (#72)
* new samples for Netherite

* corrections after review
2021-09-20 11:48:15 -07:00
Sebastian Burckhardt bcb5f67c51
Improve performance of query results (#70)
* launch query and prefetch events earlier to improve latency

* add nicer query endpoint to perf tests

(cherry picked from commit 9851a1de91)

* add tracing for when a sender is starting to send and event.

* support use of multiple client channels; fix range of partitions; support pipelined return of query results

* use custom serialization to compress query results

* fix bug in test
2021-09-13 12:28:22 -07:00
Xiangfeng Zhu 6b3b3720af
Better Scheduling/Load Balancing for activities (#52)
* Add random scheduler

* add loadmonitor component

* add scheduling options & filehash tweaks

* keep partition prefix consistent with all the other tracing

* fix bug (failed to update estimated load when remote activities complete)

* Add aggresive scheduler mode

* fixed a typo

* basic load monitor

* checkpoint

* fix bug in ETW tracing

* change conditions for load monitor reporting

* use full tracing

* trace more information for OffloadCommandReceived

* change tracing of task messages to provide more consistent results

* several package updates, including DurableTask.Core 2.5.6

* update to account for tracing changes

* remove load monitor interval and add idle message

* add latency field to ActivityCompleted and RemoteActivityResultReceived

* added offload algorithms based on waiting time

* fix overflow in filehash

* fixes to loadmonitor logic, autoscaler, and temporary info sender

* fix missing file and update scale tracing

* change loadmonitor hosting so it moves less often

* implement overlay of pending commands on offload estimation

* improve precision of load estimation and distinguish between stationary and mobile.

* remove unnecessary left over line of code.

* replace push based algo with pull based

* fix so we don't pull more than mobile

* fix dumb mistake

* fix formula

* need more conservative default estimate for completion time, otherwise offload is too aggressive

* remove unnecessary constants.

* add tracing for RTT

* tweak parameters for RTT and smoothing

* batchworker tracing for senders

* use array instead of dictionary, fix concurrent modification exception

* revise sender for load monitor events to send only latest state, and to do rate limiting

* fix tracing

* turn off all loadmonitor activity when the parameter ActivityScheduler is not set LoadMonitor

* implemented "Static" setting for ActivityScheduler

* reduce severity of tracing in loadmonitor

* make filehash scale configurable

* update names, remove old algorithms

* improve terminology and implement solicitation

* fixes and simplifications

* minor updates

* simplify data structure for local activities

* use smarter concurrency defaults, same as for default backend

* fix tracing in LoadMonitor

* update filehash a bit to make it more uniform

* add test series for comparing local vs locavore

Co-authored-by: Sebastian Burckhardt <sburckha@microsoft.com>
2021-09-01 12:39:02 -07:00
Sebastian Burckhardt 9c8e0d3439 fix verbosity mistake in tracing 2021-09-01 12:36:45 -07:00
Sebastian Burckhardt fe24c4b48e update sample and README.md to point to 0.5.0 2021-08-13 13:09:39 -07:00
Sebastian Burckhardt 2491651f3e update version to 0.5.0-alpha 2021-08-13 12:17:12 -07:00
Sebastian Burckhardt b8a4900960
revise time-based checkpointing so it happens only in idle moments; and do not take state checkpoints when shutting partitions down quickly. (#68) 2021-08-13 11:37:28 -07:00
Sebastian Burckhardt 0799410e6d
change tracing of task messages to provide more consistent results (#66) 2021-07-28 09:02:15 -07:00
Sebastian Burckhardt 71993f2a8e
resilience improvements for EventHubsProcessor (#64) 2021-07-26 08:27:34 -07:00
Sebastian Burckhardt a2588349f1
several package updates, including DurableTask.Core 2.5.6 (#63) 2021-07-22 14:58:11 -07:00
Sebastian Burckhardt 5ea53e959c
Fix dll load problem and throw exception on 32 bit (#62)
* Go back to System.Threading.Channels version 4.7.1

* throw exception on 32bit process

* update to latest Microsoft.NET.Sdk.Functions

* use trace verbosity for unit tests
2021-07-21 15:15:54 -07:00
Sebastian Burckhardt d955ed0420
Update README.md 2021-06-24 17:34:00 -07:00
Sebastian Burckhardt 91810954e8 Update sample to use 0.4.0-alpha 2021-06-24 13:41:29 -07:00
Sebastian Burckhardt 4eb5fafa3d update version to 0.4.0-alpha 2021-06-24 05:56:50 -07:00
Sebastian Burckhardt 6621e9a5a3
Revise connection settings (#58)
* revisit the mechanism for specifying alternate page blob storage: use more descriptive names, and implement deletion

* fix missing code.

* fix bug

* fix checkpoint devices so they use page blob storage as well.

(cherry picked from commit 1ea23e69899798e61f54eeba9dfeacbac2d4c483)
2021-06-24 05:44:25 -07:00
Sebastian Burckhardt fbe045449d
ETW tracing improvements (#60)
* do not trace batchworker work cycles where the batch is empty

* use 'details' instead of 'reason' for checkpoint events

* include partition id in eventhubs traces

* use LatencyMs instead of Latency for batch worker tracing.

* Do not emit warnings for ReceiverDisconnectedExceptions observed inside EventProcessorHost
2021-06-24 05:44:05 -07:00
Sebastian Burckhardt e9a98fab8a fix warning 2021-06-22 17:04:01 -07:00
Sebastian Burckhardt c869561c18
validate that existing taskhubs are connected to the original event hubs, and if not, throw exception with informational message. (#59) 2021-06-22 16:18:44 -07:00
Sebastian Burckhardt f1cdd83122
Mitigations for dll load problem (#57)
* remove obsolete code for query support.

* add fast-fail for dll loading problem (#55), and go back to last functioning version.
2021-06-22 10:46:19 -07:00
Sebastian Burckhardt 2033ccc977
remove obsolete code for query support. (#56) 2021-06-22 09:12:19 -07:00
Sebastian Burckhardt d10708ae0e
Update Packages (#54)
* Update package references.

* fix blob triggers to use new versions, and fix a typo
2021-06-18 07:07:21 -07:00
Sebastian Burckhardt 2e07cbb775
Fix JSON serialization of storage format string (#47)
* Use more explicit serialization/deserialization for storage format string.

* include settings for deserialization also.
2021-06-18 06:32:58 -07:00
Sebastian Burckhardt c618ba7f49
fix copy-paste bug that caused eventhubs clients to not be closed correctly. (#53) 2021-06-17 07:54:06 -07:00
Sebastian Burckhardt 7af87877c8
fix replay bug in activity scheduler, and add configuration setting (#51) 2021-06-14 14:26:03 -07:00
Sebastian Burckhardt 1c0cc940dc
consolidate and reorganize performance tests, add scripts for automatic deployment, running, and results collection (#50) 2021-06-14 12:32:12 -07:00
Sebastian Burckhardt 002978d167 fix type inference issue that breaks compilation in some environments 2021-06-14 12:29:50 -07:00
Sebastian Burckhardt a946bfdf39 add very basic unit test for purge and list. 2021-06-03 15:24:56 -07:00
Sebastian Burckhardt 1672852ebc Use unique taskhubname for each run of DurableTask.Netherite.AzureFunctions.Test 2021-06-03 15:13:03 -07:00
Sebastian Burckhardt 244286ee91
introduce 'CacheOrchestrationCursor' setting, instead of using ExtendedSessionsEnabled (#43) 2021-05-12 07:10:05 -07:00
Sebastian Burckhardt b5ca7f9dfb
fix bug in scalemonitor, and add support for using blobs instead of tables (#42) 2021-05-11 15:57:24 -07:00
Sebastian Burckhardt b832af36c6
Update README.md 2021-05-07 10:03:20 -07:00
Sebastian Burckhardt 6f8ba021ed
Fix test configuration (#40)
* fix implementation of local file storage for FASTER

* fix test configuration in DurableTask.Netherite.AzureFunctions

* generalize the CI pipeline so it runs all tests again.
2021-05-07 09:39:51 -07:00
Sebastian Burckhardt fd281e7ce0 Add test for periodic orchestration; display exact value of next timer in load partition tabel 2021-04-23 15:31:47 -07:00
Sebastian Burckhardt 5b4484100d fix tracing bug and change default setting. 2021-04-23 15:02:28 -07:00
Sebastian Burckhardt 8be85adf8f
add EventSourceName override to durability provider, and rename eventId args in EtwSource to avoid clashes in linux logging (#37) 2021-04-20 09:27:32 -07:00
Sebastian Burckhardt 3c5c586fc6 update hello sample to use correct host.json and to support autoscaling 2021-04-02 12:37:48 -07:00
Sebastian Burckhardt e663c1fad8 update hello sample to use 0.3.0-alpha 2021-03-31 22:50:19 -07:00
Sebastian Burckhardt 859abccee8 update version to 0.3.0-alpha 2021-03-31 22:12:44 -07:00
Sebastian Burckhardt cb7b2d633c fix hang in partition shutdown 2021-03-31 18:30:29 -07:00
Sebastian Burckhardt fa47f71c35 Revert "update package versions (#34)"
This reverts commit 8ff9d1fb65.
2021-03-31 14:31:02 -07:00
Sebastian Burckhardt 21e23228e0 fix NullReferenceException during partition shutdown. 2021-03-31 13:29:29 -07:00
Sebastian Burckhardt ae3d25c1c2
implement taskhub storage format descriptor. (#35) 2021-03-31 12:47:02 -07:00
Sebastian Burckhardt c2c974a1c2
take a pass through all the tracing, and add AppName. (#32) 2021-03-31 12:05:37 -07:00
Sebastian Burckhardt 8ff9d1fb65
update package versions (#34) 2021-03-31 12:03:20 -07:00
Sebastian Burckhardt 2e2bf1fde6 Update azure-pipelines.yml for Azure Pipelines 2021-03-31 11:53:16 -07:00