Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Per Partition Automatic Failover] Concurrent Detection of Write Regions During Failover #4858

Open
kundadebdatta opened this issue Oct 28, 2024 · 0 comments

Comments

@kundadebdatta
Copy link
Member

Acceptance Criteria:

During the partition level failover through the .NET v3 SDK, it is often identified that the detection of the write regions takes longer, sometimes more than a minute. The primary reason for that is the following two:

  • The offending partition takes at least a minute from the backend to failover to another region.

  • The current (write region) detection logic from the SDK is on a round robin fashion, therefore, the SDK loops through all the regions in the account topology to find out the potential write region.

Though it’s beyond the scope of the SDK to optimize the backend failover time, however the detection logic from the SDK can be made a bit faster. This design proposes an optimization to detect the write regions in parallel by issuing concurrent hedging requests to all the available regions in the account topology.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: In Progress
Development

No branches or pull requests

1 participant