论文标题
自动专家选择多幕科和多任务搜索
Automatic Expert Selection for Multi-Scenario and Multi-Task Search
论文作者
论文摘要
Multi-Scenario学习(MSL)使服务提供商可以通过用户的地理区域将服务提供者分开,以满足用户的细粒度需求。在每种情况下,都需要优化多个特定于任务的目标,例如,单击速率和转换率,称为多任务学习(MTL)。 MSL和MTL的最新解决方案主要基于多门的混合物(MMOE)体系结构。 MMOE结构通常是静态的,其设计需要特定于域的知识,从而使其在处理MSL和MTL方面的有效性降低。在本文中,我们提出了一个新颖的自动专家选择框架,用于多幕科和多任务搜索,名为AESM^{2}。 AESM^{2}将MSL和MTL集成到具有自动结构学习的统一框架中。具体而言,AESM^{2}堆叠多任务层在多幕层上。这种分层设计使我们能够在不同方案之间灵活建立固有的连接,同时还支持针对不同任务的高级功能提取。在每个多幕科/多任务层中,提出了一种新颖的专家选择算法,以自动识别每个输入的方案 - 特定于任务和共享的专家。两个现实世界中的大规模数据集的实验证明了AESM^{2}对一组强基础的有效性。在线A/B测试还显示了多个指标的大量性能增长。当前,AESM^{2}已在线部署以服务主要流量。
Multi-scenario learning (MSL) enables a service provider to cater for users' fine-grained demands by separating services for different user sectors, e.g., by user's geographical region. Under each scenario there is a need to optimize multiple task-specific targets e.g., click through rate and conversion rate, known as multi-task learning (MTL). Recent solutions for MSL and MTL are mostly based on the multi-gate mixture-of-experts (MMoE) architecture. MMoE structure is typically static and its design requires domain-specific knowledge, making it less effective in handling both MSL and MTL. In this paper, we propose a novel Automatic Expert Selection framework for Multi-scenario and Multi-task search, named AESM^{2}. AESM^{2} integrates both MSL and MTL into a unified framework with an automatic structure learning. Specifically, AESM^{2} stacks multi-task layers over multi-scenario layers. This hierarchical design enables us to flexibly establish intrinsic connections between different scenarios, and at the same time also supports high-level feature extraction for different tasks. At each multi-scenario/multi-task layer, a novel expert selection algorithm is proposed to automatically identify scenario-/task-specific and shared experts for each input. Experiments over two real-world large-scale datasets demonstrate the effectiveness of AESM^{2} over a battery of strong baselines. Online A/B test also shows substantial performance gain on multiple metrics. Currently, AESM^{2} has been deployed online for serving major traffic.