自动专家选择多幕科和多任务搜索

论文标题

自动专家选择多幕科和多任务搜索

Automatic Expert Selection for Multi-Scenario and Multi-Task Search

论文作者

Zou, Xinyu, Hu, Zhi, Zhao, Yiming, Ding, Xuchu, Liu, Zhongyi, Li, Chenliang, Sun, Aixin

论文摘要

Multi-Scenario学习（MSL）使服务提供商可以通过用户的地理区域将服务提供者分开，以满足用户的细粒度需求。在每种情况下，都需要优化多个特定于任务的目标，例如，单击速率和转换率，称为多任务学习（MTL）。 MSL和MTL的最新解决方案主要基于多门的混合物（MMOE）体系结构。 MMOE结构通常是静态的，其设计需要特定于域的知识，从而使其在处理MSL和MTL方面的有效性降低。在本文中，我们提出了一个新颖的自动专家选择框架，用于多幕科和多任务搜索，名为AESM^{2}。 AESM^{2}将MSL和MTL集成到具有自动结构学习的统一框架中。具体而言，AESM^{2}堆叠多任务层在多幕层上。这种分层设计使我们能够在不同方案之间灵活建立固有的连接，同时还支持针对不同任务的高级功能提取。在每个多幕科/多任务层中，提出了一种新颖的专家选择算法，以自动识别每个输入的方案 - 特定于任务和共享的专家。两个现实世界中的大规模数据集的实验证明了AESM^{2}对一组强基础的有效性。在线A/B测试还显示了多个指标的大量性能增长。当前，AESM^{2}已在线部署以服务主要流量。

Multi-scenario learning (MSL) enables a service provider to cater for users' fine-grained demands by separating services for different user sectors, e.g., by user's geographical region. Under each scenario there is a need to optimize multiple task-specific targets e.g., click through rate and conversion rate, known as multi-task learning (MTL). Recent solutions for MSL and MTL are mostly based on the multi-gate mixture-of-experts (MMoE) architecture. MMoE structure is typically static and its design requires domain-specific knowledge, making it less effective in handling both MSL and MTL. In this paper, we propose a novel Automatic Expert Selection framework for Multi-scenario and Multi-task search, named AESM^{2}. AESM^{2} integrates both MSL and MTL into a unified framework with an automatic structure learning. Specifically, AESM^{2} stacks multi-task layers over multi-scenario layers. This hierarchical design enables us to flexibly establish intrinsic connections between different scenarios, and at the same time also supports high-level feature extraction for different tasks. At each multi-scenario/multi-task layer, a novel expert selection algorithm is proposed to automatically identify scenario-/task-specific and shared experts for each input. Experiments over two real-world large-scale datasets demonstrate the effectiveness of AESM^{2} over a battery of strong baselines. Online A/B test also shows substantial performance gain on multiple metrics. Currently, AESM^{2} has been deployed online for serving major traffic.

下载PDF全文

下载文献需遵守相关版权规定

论文标题