论文标题
从已知到未知:质量意识的自我提高图神经网络,用于开放式社交事件检测
From Known to Unknown: Quality-aware Self-improving Graph Neural Network for Open Set Social Event Detection
论文作者
论文摘要
最先进的图形神经网络(GNNS)在仅限于封闭的事件时,在社交事件检测任务中取得了巨大的成功。但是,考虑到训练神经网络所需的大量数据以及神经网络处理以前未知数据的有限能力,对于现有基于GNN的方法在开放设置中运行的方法仍然是一个挑战。为了解决这个问题,我们设计了一种质量意识的自我改进图神经网络(QSGNN),该图通过利用最佳的已知样本和可靠的知识转移来扩展知识从已知到未知。具体而言,为了充分利用标记的数据,我们提出了一种新型的监督成对损失,并具有额外的正交间层间关系约束,以训练骨干GNN编码器。知识渊博的事件进一步成为未知事件的强大参考基础,这极大地促使知识获取和转移。当该模型推广到未知数据时,为了确保有效性和可靠性,我们进一步利用了伪成对标签生成,选择和质量评估的参考相似性分布向量。遵循主动学习的多样性原则,我们的方法选择具有生成的伪标签的多种对样品来微调GNN编码器。此外,我们提出了一种新颖的质量指导优化,其中伪标签的贡献是根据一致性加权的。我们在两个大型现实社交活动数据集上彻底评估了我们的模型。实验表明,我们的模型可实现最新的结果,并很好地扩展到未知事件。
State-of-the-art Graph Neural Networks (GNNs) have achieved tremendous success in social event detection tasks when restricted to a closed set of events. However, considering the large amount of data needed for training a neural network and the limited ability of a neural network in handling previously unknown data, it remains a challenge for existing GNN-based methods to operate in an open set setting. To address this problem, we design a Quality-aware Self-improving Graph Neural Network (QSGNN) which extends the knowledge from known to unknown by leveraging the best of known samples and reliable knowledge transfer. Specifically, to fully exploit the labeled data, we propose a novel supervised pairwise loss with an additional orthogonal inter-class relation constraint to train the backbone GNN encoder. The learnt, already-known events further serve as strong reference bases for the unknown ones, which greatly prompts knowledge acquisition and transfer. When the model is generalized to unknown data, to ensure the effectiveness and reliability, we further leverage the reference similarity distribution vectors for pseudo pairwise label generation, selection and quality assessment. Following the diversity principle of active learning, our method selects diverse pair samples with the generated pseudo labels to fine-tune the GNN encoder. Besides, we propose a novel quality-guided optimization in which the contributions of pseudo labels are weighted based on consistency. We thoroughly evaluate our model on two large real-world social event datasets. Experiments demonstrate that our model achieves state-of-the-art results and extends well to unknown events.