论文标题

用标签噪声进行半监督学习的粒子竞争与合作

Particle Competition and Cooperation for Semi-Supervised Learning with Label Noise

论文作者

Breve, Fabricio Aparecido, Zhao, Liang, Quiles, Marcos Gonçalves

论文摘要

半监督学习方法通​​常用于数据集的分类中,其中只有一小部分数据项被标记。在这些情况下,标签噪声是一个至关重要的问题,因为噪声可能很容易扩散到大部分甚至整个数据集,从而导致分类精度的重大降级。因此,开发新技术以减少半监督学习中标签噪声的讨厌影响是一个至关重要的问题。最近,开发了一种基于粒子竞争和合作的基于图的半监督学习方法。在此模型中,粒子在数据集构建的图中行走。竞争发生在代表不同类标签的粒子之间,而合作发生在具有相同标签的粒子之间。本文提出了一种新的粒子竞争与合作算法,该算法是专门设计的,旨在提高标签噪声的鲁棒性,从而提高其标签噪声耐受性。与其他方法不同,提出的一种不需要单独的技术来处理标签噪声。它在独特的过程中对未标记的节点进行分类和由标签噪声影响的节点的重新分类。计算机模拟显示了所提出的方法的分类精度,当应用于某些人工和现实世界数据集时,我们引入了越来越多的标签噪声。将分类精度与以前的粒子竞争与合作算法和其他使用相同情况的基于图形的半监督学习方法进行了比较。结果显示了所提出的方法的有效性。

Semi-supervised learning methods are usually employed in the classification of data sets where only a small subset of the data items is labeled. In these scenarios, label noise is a crucial issue, since the noise may easily spread to a large portion or even the entire data set, leading to major degradation in classification accuracy. Therefore, the development of new techniques to reduce the nasty effects of label noise in semi-supervised learning is a vital issue. Recently, a graph-based semi-supervised learning approach based on Particle competition and cooperation was developed. In this model, particles walk in the graphs constructed from the data sets. Competition takes place among particles representing different class labels, while the cooperation occurs among particles with the same label. This paper presents a new particle competition and cooperation algorithm, specifically designed to increase the robustness to the presence of label noise, improving its label noise tolerance. Different from other methods, the proposed one does not require a separate technique to deal with label noise. It performs classification of unlabeled nodes and reclassification of the nodes affected by label noise in a unique process. Computer simulations show the classification accuracy of the proposed method when applied to some artificial and real-world data sets, in which we introduce increasing amounts of label noise. The classification accuracy is compared to those achieved by previous particle competition and cooperation algorithms and other representative graph-based semi-supervised learning methods using the same scenarios. Results show the effectiveness of the proposed method.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源