在异质服务器存在下可扩展负载平衡

论文标题

在异质服务器存在下可扩展负载平衡

Scalable Load Balancing in the Presence of Heterogeneous Servers

论文作者

Gardner, Kristen, Jaleel, Jazeem Abdul, Wickeham, Alexander, Doroudi, Sherwin

论文摘要

在现代大型计算机系统中，异质性变得越来越无处不在。制定良好的负载平衡政策，其资源的速度不同，对于达到较低的响应时间至关重要。的确，如何最好地将作业转移到服务器上是排队文献中的经典且研究的问题。然而，大规模系统上的大部分工作都假设了均匀的服务器。不幸的是，在均匀环境中表现良好的政策可能会导致性能差 - 甚至不稳定 - 在异质系统中。我们适应了联接式标题和最短的标题策略的“ D PARESS”版本，以设计两个相应的异质性 - 意识派遣策略的家族，每种策略都由一对路由概率进行参数化。与它们的异质性 - unaware对应物不同，我们的策略在选择要查询哪个服务器的服务器时都使用服务器速度信息，以及当概率地决定（在查询服务器中）在哪里派遣作业。我们的两个政策家族在分析上都是可以进行分析的：我们的平均响应时间和队列长度分布分析是准确的，因为在标准假设下，服务器的数量接近无穷大。此外，我们的政策家族实现了最大的稳定性，超过了众所周知的派遣规则 - 包括在平均响应时间方面的异质性意识性政策，例如最短指望的票据。

Heterogeneity is becoming increasingly ubiquitous in modern large-scale computer systems. Developing good load balancing policies for systems whose resources have varying speeds is crucial in achieving low response times. Indeed, how best to dispatch jobs to servers is a classical and well-studied problem in the queueing literature. Yet the bulk of existing work on large-scale systems assumes homogeneous servers; unfortunately, policies that perform well in the homogeneous setting can cause unacceptably poor performance---or even instability---in heterogeneous systems. We adapt the "power-of-d" versions of both the Join-the-Idle-Queue and Join-the-Shortest-Queue policies to design two corresponding families of heterogeneity-aware dispatching policies, each of which is parameterized by a pair of routing probabilities. Unlike their heterogeneity-unaware counterparts, our policies use server speed information both when choosing which servers to query and when probabilistically deciding where (among the queried servers) to dispatch jobs. Both of our policy families are analytically tractable: our mean response time and queue length distribution analyses are exact as the number of servers approaches infinity, under standard assumptions. Furthermore, our policy families achieve maximal stability and outperform well-known dispatching rules---including heterogeneity-aware policies such as Shortest-Expected-Delay---with respect to mean response time.

下载PDF全文

下载文献需遵守相关版权规定

论文标题