论文标题
KRCORE:用于弹性计算的微秒尺度RDMA控制平面
KRCORE: a microsecond-scale RDMA control plane for elastic computing
论文作者
论文摘要
我们提出了KRCORE,这是一个RDMA库,具有用于弹性计算的商品RDMA硬件上的微秒尺度控制平面。 KRCORE可以在10μs内建立成熟的RDMA连接(比动词快数百倍或数千倍),而在每个节点上只能维持(小)固定尺寸的连接元数据,无论群集量表如何。关键想法包括虚拟化原始化的内核空间RDMA连接,而不是从头开始创建一个连接,以及通过静态运输的高级RDMA动态连接传输进行改造,以供低连接开销和高网络速度。在负载尖峰下,KRCORE可以缩短现有的分解密钥价值商店(即种族哈希)的工人引导时间的时间83%。在无服务器计算(即FN)中,KRCore还可以将通过RDMA传输数据的延迟减少99%。
We present KRCORE, an RDMA library with a microsecond-scale control plane on commodity RDMA hardware for elastic computing. KRCORE can establish a full-fledged RDMA connection within 10μs (hundreds or thousands of times faster than verbs), while only maintaining a (small) fixed-sized connection metadata at each node, regardless of the cluster scale. The key ideas include virtualizing pre-initialized kernel-space RDMA connections instead of creating one from scratch, and retrofitting advanced RDMA dynamic connected transport with static transport for both low connection overhead and high networking speed. Under load spikes, KRCORE can shorten the worker bootstrap time of an existing disaggregated key-value store (namely RACE Hashing) by 83%. In serverless computing (namely Fn), KRCORE can also reduce the latency for transferring data through RDMA by 99%.