Граф коммитов

406 Коммитов

Автор SHA1 Сообщение Дата
Sebastian Burckhardt b47eae7f40
Always release partition lease when a partition shuts down (#335)
* always release lease when a partition shuts down

* address PR feedback (add comment)

* recognize another empiric case of transient storage exception

* change default of partition checkpointing

* do not release lease if lease was already lost.
2024-01-03 09:48:52 -08:00
Sebastian Burckhardt fc777fed0a
Revise partition startup (#332)
* add "waiting" transition and timeout to partition startup

* address PR feedback
2023-11-03 11:03:18 -07:00
sebastianburckhardt f24f6de01a Merge remote-tracking branch 'origin/main' into dev
# Conflicts:
#	src/DurableTask.Netherite/StorageLayer/Faster/StoreWorker.cs
2023-10-24 13:33:05 -07:00
Sebastian Burckhardt bbbabbaeb1
update version to 1.4.1 (#328) 2023-10-24 13:22:45 -07:00
Sebastian Burckhardt 96b832eb5c
remove ? to prevent nullable warnings (#327) 2023-10-24 10:57:34 -07:00
Sebastian Burckhardt 7a6e760b59
Fix fencing of checkpoints (#326)
* fix fencing of checkpoints that caused occasional concurrent modification exceptions and can cause history corruption

* add more comments to explain sessions and fencing
2023-10-23 13:56:27 -07:00
Varshitha Bachu 76d01adabc
Copy ParentTraceContext when cloning ExecutionStartedEvent (#309)
* Update CreationRequestReceived.cs

* remove the faster-alternate data store and fix broken deserialization. (#308)

* fix and instrument commitlog serialization and deserialization. (#305)

* fix discard of packets so the next packet to expect is updated correctly (#302)

* New EventHubs performance tests (#178)

* rebase, and remove changes to scale monitor

* add hello cities test that prints the history of a nested orchestration

* implement a watchdog that terminates CompletePending quickly if it hangs (#318)

* add unique id to scale monitor constructor (#316)

* fix bugs from PR #302 (#315)

* fix cache size reporting (#321)

* added a comment

* update durable task package references to 2.15.1 and 2.12.0 (#317)

* update DT and DF package references

* update to latest

* sync dev w/ main (#324)

* Update GH automation (#303)

* initial commit (#290)

* Revert "initial commit (#290)" (#314)

This reverts commit 3a2d193e57.

* Bump Azure.Identity from 1.7.0 to 1.10.2 in /samples/TokenCredentialDTFx (#323)

Bumps [Azure.Identity](https://github.com/Azure/azure-sdk-for-net) from 1.7.0 to 1.10.2.
- [Release notes](https://github.com/Azure/azure-sdk-for-net/releases)
- [Commits](https://github.com/Azure/azure-sdk-for-net/compare/Azure.Identity_1.7.0...Azure.Identity_1.10.2)

---
updated-dependencies:
- dependency-name: Azure.Identity
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Terminate partition when FASTER refuses to checkpoint for over a minute (#301)

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Varshitha Bachu <vabachu@microsoft.com>
Co-authored-by: Sebastian Burckhardt <sburckha@microsoft.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Sebastian Burckhardt <sburckha@microsoft.com>
Co-authored-by: David Justo <david.justo.1996@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-23 13:55:55 -07:00
David Justo fb4aa617ff
sync dev w/ main (#324)
* Update GH automation (#303)

* initial commit (#290)

* Revert "initial commit (#290)" (#314)

This reverts commit 3a2d193e57.

* Bump Azure.Identity from 1.7.0 to 1.10.2 in /samples/TokenCredentialDTFx (#323)

Bumps [Azure.Identity](https://github.com/Azure/azure-sdk-for-net) from 1.7.0 to 1.10.2.
- [Release notes](https://github.com/Azure/azure-sdk-for-net/releases)
- [Commits](https://github.com/Azure/azure-sdk-for-net/compare/Azure.Identity_1.7.0...Azure.Identity_1.10.2)

---
updated-dependencies:
- dependency-name: Azure.Identity
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Terminate partition when FASTER refuses to checkpoint for over a minute (#301)

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Varshitha Bachu <vabachu@microsoft.com>
Co-authored-by: Sebastian Burckhardt <sburckha@microsoft.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-23 11:20:13 -07:00
Sebastian Burckhardt 08e760aeaf
update durable task package references to 2.15.1 and 2.12.0 (#317)
* update DT and DF package references

* update to latest
2023-10-23 11:19:37 -07:00
David Justo ba15eed262
Terminate partition when FASTER refuses to checkpoint for over a minute (#301) 2023-10-19 13:55:54 -07:00
dependabot[bot] 6effbf572c
Bump Azure.Identity from 1.7.0 to 1.10.2 in /samples/TokenCredentialDTFx (#323)
Bumps [Azure.Identity](https://github.com/Azure/azure-sdk-for-net) from 1.7.0 to 1.10.2.
- [Release notes](https://github.com/Azure/azure-sdk-for-net/releases)
- [Commits](https://github.com/Azure/azure-sdk-for-net/compare/Azure.Identity_1.7.0...Azure.Identity_1.10.2)

---
updated-dependencies:
- dependency-name: Azure.Identity
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-18 14:46:37 -07:00
Sebastian Burckhardt 76d7830f08
fix cache size reporting (#321) 2023-10-17 14:59:19 -07:00
Sebastian Burckhardt bdf943738b
fix bugs from PR #302 (#315) 2023-10-17 11:18:01 -07:00
Sebastian Burckhardt 2e86c3a8a5
add unique id to scale monitor constructor (#316) 2023-10-17 11:16:53 -07:00
Sebastian Burckhardt 4cc4ca72e3
implement a watchdog that terminates CompletePending quickly if it hangs (#318) 2023-10-17 11:16:18 -07:00
Sebastian Burckhardt 83d16d2bd9
New EventHubs performance tests (#178)
* rebase, and remove changes to scale monitor

* add hello cities test that prints the history of a nested orchestration
2023-10-12 14:22:33 -07:00
Sebastian Burckhardt eacb42fb0b
Revert "initial commit (#290)" (#314)
This reverts commit 3a2d193e57.
2023-10-11 10:58:20 -07:00
Varshitha Bachu 3a2d193e57
initial commit (#290) 2023-10-10 10:38:53 -07:00
Sebastian Burckhardt 2ae01f892b
fix discard of packets so the next packet to expect is updated correctly (#302) 2023-10-10 10:37:18 -07:00
Sebastian Burckhardt d06cd50820
fix and instrument commitlog serialization and deserialization. (#305) 2023-10-10 10:36:35 -07:00
Sebastian Burckhardt 5cc4a9ed30
remove the faster-alternate data store and fix broken deserialization. (#308) 2023-10-10 10:35:26 -07:00
Varshitha Bachu d9550b7780
Update GH automation (#303) 2023-08-31 12:03:02 -07:00
sebastianburckhardt 0b8291a105 Merge branch 'dev' 2023-07-07 12:59:46 -07:00
sebastianburckhardt ee47f0c9f8 update version to 1.4.0 2023-07-07 12:55:18 -07:00
Sebastian Burckhardt 3e3ef5bb08
revise handling of fatal exception, implement emergency shutdown path for host (#285) 2023-07-07 11:37:02 -07:00
Sebastian Burckhardt d1011144e0
update a bunch of versions to latest, and remove net5.0 target (#284) 2023-07-07 09:01:14 -07:00
Sebastian Burckhardt a909cc7c8c
limit compaction size by absolute size, not relative size (#273)
* compaction size should be limited by absolute size, not relative size

* update minimum for compaction area size
2023-07-07 07:16:38 -07:00
microsoft-github-policy-service[bot] 21d7b5a18c
Onboarding to GitOps.ResourceManagement (#274)
* Add prIssueManagement.yml to onboard repo to GitOps.ResourceManagement as FabricBot replacement

* Deleting fabricbot.json

---------

Co-authored-by: microsoft-github-policy-service[bot] <77245923+microsoft-github-policy-service[bot]@users.noreply.github.com>
2023-07-06 10:06:33 -07:00
Sebastian Burckhardt ebda6e7fac
implement blob batching to improve EH sender throughput (#275)
* implement eh blob batching

* use client guid, not task hub guid, on the client partitions, to avoid downloading/deleting blobs destined for other clients

* fix tracing
2023-07-06 10:04:52 -07:00
Sebastian Burckhardt d47dc356be
Implement clean shutdown of Event Hubs senders (#257)
* shutdown eventhubs senders

* reorder the stopping of timeouts and clientrequests (to avoid ObjectDisposed exceptions)
2023-06-16 06:21:46 -07:00
sebastianburckhardt 0654d60c88 increase version to 1.3.5 2023-05-04 14:26:39 -07:00
Sebastian Burckhardt 6322c43016
reduce FASTER store log memory size to fix excessive memory use (#266) 2023-05-04 11:27:07 -07:00
Sebastian Burckhardt b5ed7f220c
Fixes to azure storage calls (#260)
* do not report error when a retried deletion observes not-found exception

* fix missing Dispose() calls on Azure responses

* remove dependency on FirstAsync()

* address PR feedback
2023-05-04 11:25:30 -07:00
Sebastian Burckhardt 22b8195c4d
limit frequency of failing partition starts (#261)
* add delay when starting partitions repeatedly fails

* Apply suggestions from code review

Co-authored-by: David Justo <david.justo.1996@gmail.com>

* undo unintentional change

---------

Co-authored-by: David Justo <david.justo.1996@gmail.com>
2023-05-04 11:25:05 -07:00
Sebastian Burckhardt c39eb7c73e
fix LoadPublisher.FlushAsync (#262)
* fix LoadPublisher.FlushAsync

* address PR feedback

* address PR feedback (add comment)
2023-05-04 11:24:48 -07:00
Sebastian Burckhardt 76ff28e1b4
update fatal exception handling so we still trace what happened, and terminate the partition (#259) 2023-05-04 11:16:55 -07:00
Sebastian Burckhardt be69241474
Improve trace messages for non-executable instances (#258)
* add partition id to the tracing of client requests

* improve precision of warnings when encountering messages that target non-existent or invalid runtime states

* address PR feedback
2023-05-04 09:33:24 -07:00
sebastianburckhardt 6e1ec94b6d fix line that was missed in #254 2023-04-24 06:52:27 -07:00
Sebastian Burckhardt 2ca5422bf6
fix handling of expired groups in ReassemblyState (#256) 2023-04-21 14:35:39 -07:00
Sebastian Burckhardt 51bd195495
fix bug where an effect is applied twice due to an RMW executing twice in FASTER (#252) 2023-04-21 14:35:17 -07:00
Sebastian Burckhardt 9f1d97db3d
fix exceptions in scaler caused by null metrics (#248) 2023-04-21 13:16:16 -07:00
Sebastian Burckhardt 08f823c711
Fix race in memory tracker (#254)
* fixes a data race in memory tracker that can cause divide-by-zero exceptions

* Add comment to clarify that there is a potential race condition

Co-authored-by: David Justo <david.justo.1996@gmail.com>

---------

Co-authored-by: David Justo <david.justo.1996@gmail.com>
2023-04-21 09:24:00 -07:00
Sebastian Burckhardt 67255a9072
Fix race condition in OutboxState (#250)
* fix race condition in OutboxState

* Apply suggestions from code review

Co-authored-by: David Justo <david.justo.1996@gmail.com>

---------

Co-authored-by: David Justo <david.justo.1996@gmail.com>
2023-04-20 09:02:31 -07:00
Sebastian Burckhardt 6090def4d0
Optimize host shutdown (#249)
* make shutdown for clients and workers parallel

* address PR feedback
2023-04-20 09:02:05 -07:00
sebastianburckhardt 021d5c61f8 update to 1.3.4 2023-04-12 12:55:02 -07:00
Sebastian Burckhardt d432faec4f
replace instance-count assertion failures with non-breaking warnings (#247) 2023-04-12 12:49:59 -07:00
Sebastian Burckhardt 72b865e2d9
fix ignored tuning parameter. (#243) 2023-04-07 16:42:09 -07:00
Sebastian Burckhardt 0d504ab2fc
Add semaphore example to PerformanceBenchmarks. (#242) 2023-04-07 09:42:23 -07:00
sebastianburckhardt 985cf72e2b increase version to 1.3.3 2023-03-17 14:02:24 -07:00
Sebastian Burckhardt aa54a0f776
fix recovery failures caused by changes in FASTER's memory page size parameter (#234) (#235) 2023-03-17 14:01:20 -07:00