Tales of the Tail 论文
2014引用 232
Cloud Computing and Resource ManagementDistributed systems and fault toleranceParallel Computing and Optimization Techniques
摘要
Interactive services often have large-scale parallel implementations. To deliver fast responses, the median and tail latencies of a service's components must be low. In this paper, we explore the hardware, OS, and application-level sources of poor tail latency in high throughput servers executing on multi-core machines.