-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Description
From http://blog.acolyer.org/2015/01/15/the-tail-at-scale/:
Hedged requests: send the same requests to multiple servers, and use whatever response comes back first. To avoid doubling or tripling your computation load though, don’t send the hedging requests straight away:
defer sending a secondary request until the first request has been outstanding for more than the 95th-percentile expected latency for this class of requests. This approach limits the additional load to approximately 5% while substantially shortening the tail latency.
Metadata
Metadata
Assignees
Labels
No labels