论文标题
建模个性化的项目频率信息,以供下一键推荐
Modeling Personalized Item Frequency Information for Next-basket Recommendation
论文作者
论文摘要
在电子商务和零售业中,下一篮子推荐(NBR)很普遍。在这种情况下,用户一次购买一组物品(篮子)。 NBR基于一系列篮子进行顺序建模和建议。通常,NBR比广泛研究的顺序(基于会话)的建议更复杂,该建议建议基于一系列项目的下一个项目。事实证明,复发性神经网络(RNN)非常有效,因此适用于NBR。但是,我们认为现有的RNN无法在建议方案中直接捕获项目频率信息。 通过仔细分析现实世界数据集,我们发现{\ em个性化项目频率}(PIF)信息(记录了用户购买的每个项目的次数)为NBR提供了两个关键信号。但是,现有方法在很大程度上被忽略了。即使现有的基于RNN的方法具有强大的表示能力,我们的经验结果表明它们无法学习和捕获PIF。结果,现有方法无法完全利用PIF中包含的关键信号。鉴于RNN的这种固有限制,我们提出了一个简单的基于项目频率的k-nearest邻居(KNN)方法,以直接利用这些关键信号。我们在四个公共现实世界数据集上评估了我们的方法。尽管它相对简单,但当与PIF相关的模式在数据中起重要作用时,我们的方法经常优于最先进的NBR方法(包括基于RNN的深度学习方法)。
Next-basket recommendation (NBR) is prevalent in e-commerce and retail industry. In this scenario, a user purchases a set of items (a basket) at a time. NBR performs sequential modeling and recommendation based on a sequence of baskets. NBR is in general more complex than the widely studied sequential (session-based) recommendation which recommends the next item based on a sequence of items. Recurrent neural network (RNN) has proved to be very effective for sequential modeling and thus been adapted for NBR. However, we argue that existing RNNs cannot directly capture item frequency information in the recommendation scenario. Through careful analysis of real-world datasets, we find that {\em personalized item frequency} (PIF) information (which records the number of times that each item is purchased by a user) provides two critical signals for NBR. But, this has been largely ignored by existing methods. Even though existing methods such as RNN based methods have strong representation ability, our empirical results show that they fail to learn and capture PIF. As a result, existing methods cannot fully exploit the critical signals contained in PIF. Given this inherent limitation of RNNs, we propose a simple item frequency based k-nearest neighbors (kNN) method to directly utilize these critical signals. We evaluate our method on four public real-world datasets. Despite its relative simplicity, our method frequently outperforms the state-of-the-art NBR methods -- including deep learning based methods using RNNs -- when patterns associated with PIF play an important role in the data.