论文标题
DVR:Micro-Video建议优化持续时间偏差的观察时间增益
DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias
论文作者
论文摘要
推荐系统容易被数据中的偏差误导。接受偏见数据培训的模型无法捕获用户的真实利益,因此减轻偏见的影响以实现无偏见的建议至关重要。在这项工作中,我们关注的是微视频推荐的基本偏见,即持续时间偏差。具体而言,现有的微观录像带系统通常将观看时间视为最关键的指标,它可以衡量用户观看视频的时间。由于持续时间较长的视频往往会有更长的观看时间,因此存在一种持续时间的偏见,使更长的视频倾向于使用简短的视频推荐。在本文中,我们从经验上表明,普遍使用的指标容易受到持续时间偏见的影响,因此不适合评估微观效率推荐。为了解决这个问题,我们进一步提出了一个公正的评估公制,称为WTG(缩写观察时间增加)。经验结果表明,WTG可以减轻持续时间偏差并更好地衡量建议性能。此外,我们设计了一个名为DVR的简单但有效的模型(用于脱锯的视频推荐简称),可以提供无偏见的微视频推荐,其持续时间不同,并通过对抗性学习学习无偏见的用户偏好。基于两个现实世界数据集的广泛实验表明,DVR成功消除了持续时间偏差,并显着提高了30%的相对进步的建议性能。代码和数据集在https://github.com/tsinghua-fib-lab/wtg-dvr上发布。
Recommender systems are prone to be misled by biases in the data. Models trained with biased data fail to capture the real interests of users, thus it is critical to alleviate the impact of bias to achieve unbiased recommendation. In this work, we focus on an essential bias in micro-video recommendation, duration bias. Specifically, existing micro-video recommender systems usually consider watch time as the most critical metric, which measures how long a user watches a video. Since videos with longer duration tend to have longer watch time, there exists a kind of duration bias, making longer videos tend to be recommended more against short videos. In this paper, we empirically show that commonly-used metrics are vulnerable to duration bias, making them NOT suitable for evaluating micro-video recommendation. To address it, we further propose an unbiased evaluation metric, called WTG (short for Watch Time Gain). Empirical results reveal that WTG can alleviate duration bias and better measure recommendation performance. Moreover, we design a simple yet effective model named DVR (short for Debiased Video Recommendation) that can provide unbiased recommendation of micro-videos with varying duration, and learn unbiased user preferences via adversarial learning. Extensive experiments based on two real-world datasets demonstrate that DVR successfully eliminates duration bias and significantly improves recommendation performance with over 30% relative progress. Codes and datasets are released at https://github.com/tsinghua-fib-lab/WTG-DVR.