论文标题

部分可观测时空混沌系统的无模型预测

NGAME: Negative Mining-aware Mini-batching for Extreme Classification

论文作者

Dahiya, Kunal, Gupta, Nilesh, Saini, Deepak, Soni, Akshay, Wang, Yajun, Dave, Kushal, Jiao, Jian, K, Gururaj, Dey, Prasenjit, Singh, Amit, Hada, Deepesh, Jain, Vidit, Paliwal, Bhawna, Mittal, Anshul, Mehta, Sonu, Ramjee, Ramachandran, Agarwal, Sumeet, Kar, Purushottam, Varma, Manik

论文摘要

极端分类(XC)试图用最大的标签集中标记标签子集标记数据点。通过使用稀疏,手工制作的特征的早期XC方法优越,使用XC的优越性,以其数据点和标签进行深度XC的深入XC引起了很多关注。负挖掘技术已成为所有深XC方法的关键组成部分,使它们可以扩展到数百万个标签。但是,尽管有最近的进步,但使用大型编码器体系结构(例如变形金刚)培训深度XC模型仍然具有挑战性。本文确定,流行的负挖掘技术的内存通常会迫使迷你批量尺寸保持小而缓慢的训练。作为回应,本文介绍了Ngame,这是一种轻巧的迷你批次创建技术,可证明可证明准确的内部负面样品。这使得与现有负面抽样技术相比,具有更大的迷你批次培训,提供更快的收敛性和更高的精度。发现Ngame的准确性比各种基准数据集的最先进方法要高16%,以进行极端分类,并且在响应用户网页访问中检索搜索引擎查询以显示个性化的广告时,可以检索搜索引擎查询。在流行搜索引擎的实时A/B测试中,Ngame在点击率率中的收益最高可达23%。

Extreme Classification (XC) seeks to tag data points with the most relevant subset of labels from an extremely large label set. Performing deep XC with dense, learnt representations for data points and labels has attracted much attention due to its superiority over earlier XC methods that used sparse, hand-crafted features. Negative mining techniques have emerged as a critical component of all deep XC methods that allow them to scale to millions of labels. However, despite recent advances, training deep XC models with large encoder architectures such as transformers remains challenging. This paper identifies that memory overheads of popular negative mining techniques often force mini-batch sizes to remain small and slow training down. In response, this paper introduces NGAME, a light-weight mini-batch creation technique that offers provably accurate in-batch negative samples. This allows training with larger mini-batches offering significantly faster convergence and higher accuracies than existing negative sampling techniques. NGAME was found to be up to 16% more accurate than state-of-the-art methods on a wide array of benchmark datasets for extreme classification, as well as 3% more accurate at retrieving search engine queries in response to a user webpage visit to show personalized ads. In live A/B tests on a popular search engine, NGAME yielded up to 23% gains in click-through-rates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源