论文标题
通过阈值优化从多个子任务中的可靠决策:野外内容中等
Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild
论文作者
论文摘要
社交媒体平台难以通过内容审核来保护用户免受有害内容的影响。这些平台最近利用机器学习模型来应对每天大量的用户生成内容。由于节制政策因国家和产品类型而有所不同,因此每项政策训练和部署模型是很常见的。但是,这种方法效率很低,尤其是当策略发生变化时,需要在移动的数据分布上重新标记并重新训练数据集。为了减轻这种成本降低,社交媒体平台经常采用第三方内容审核服务,这些服务提供了多个子任务的预测得分,例如预测未成年人,粗鲁的手势或武器的存在,而不是直接提供最终的节制决策。但是,尚未广泛探索从多个子任务的预测分数中做出可靠的自动审核决策。在这项研究中,我们制定了内容节制的现实情况,并引入了一种简单而有效的阈值优化方法,该方法搜索了多个子任务的最佳阈值,以以具有成本效益的方式做出可靠的适度决策。广泛的实验表明,与现有的阈值优化方法和启发式方法相比,我们的方法在内容节制中表现出更好的性能。
Social media platforms struggle to protect users from harmful content through content moderation. These platforms have recently leveraged machine learning models to cope with the vast amount of user-generated content daily. Since moderation policies vary depending on countries and types of products, it is common to train and deploy the models per policy. However, this approach is highly inefficient, especially when the policies change, requiring dataset re-labeling and model re-training on the shifted data distribution. To alleviate this cost inefficiency, social media platforms often employ third-party content moderation services that provide prediction scores of multiple subtasks, such as predicting the existence of underage personnel, rude gestures, or weapons, instead of directly providing final moderation decisions. However, making a reliable automated moderation decision from the prediction scores of the multiple subtasks for a specific target policy has not been widely explored yet. In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way. Extensive experiments demonstrate that our approach shows better performance in content moderation compared to existing threshold optimization methods and heuristics.