在光束搜索下学习最佳树模型

论文标题

在光束搜索下学习最佳树模型

Learning Optimal Tree Models Under Beam Search

论文作者

Zhuo, Jingwei, Xu, Ziru, Dai, Wei, Zhu, Han, Li, Han, Xu, Jian, Gai, Kun

论文摘要

从计算限制下从一个极大的目标设置中检索相关目标是信息检索和推荐系统的普遍挑战。树模型将目标制定为具有可训练节点得分手的树木的叶子，由于它们在训练和测试中都对数计算复杂性，因此在应对这一挑战方面引起了很多兴趣。基于树的深层模型（TDM）和概率标签树（PLT）是两种代表性的类型。尽管取得了许多实际的成功，但现有的树模型遭受了训练测试差异的困扰，在训练中未考虑由测试中的梁搜索引起的检索性能恶化。这导致最相关的目标与光束搜索的目标之间的固有差距，甚至是最佳训练的节点得分者。我们迈出了理论上理解和分析此问题的第一步，并在光束搜索下的梁搜索和校准下发展了贝叶斯最佳性的概念，作为为此目的的一般分析工具。此外，为了消除差异，我们提出了一种新型算法，用于在梁搜索下学习最佳树模型。关于合成和实际数据的实验验证了我们的理论分析的合理性，并证明了与最新方法相比，我们的算法的优越性。

Retrieving relevant targets from an extremely large target set under computational limits is a common challenge for information retrieval and recommendation systems. Tree models, which formulate targets as leaves of a tree with trainable node-wise scorers, have attracted a lot of interests in tackling this challenge due to their logarithmic computational complexity in both training and testing. Tree-based deep models (TDMs) and probabilistic label trees (PLTs) are two representative kinds of them. Though achieving many practical successes, existing tree models suffer from the training-testing discrepancy, where the retrieval performance deterioration caused by beam search in testing is not considered in training. This leads to an intrinsic gap between the most relevant targets and those retrieved by beam search with even the optimally trained node-wise scorers. We take a first step towards understanding and analyzing this problem theoretically, and develop the concept of Bayes optimality under beam search and calibration under beam search as general analyzing tools for this purpose. Moreover, to eliminate the discrepancy, we propose a novel algorithm for learning optimal tree models under beam search. Experiments on both synthetic and real data verify the rationality of our theoretical analysis and demonstrate the superiority of our algorithm compared to state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题