论文标题
Oracle引导的对比聚类
Oracle-guided Contrastive Clustering
论文作者
论文摘要
深层聚类旨在通过深层体系结构学习聚类表示。大多数现有方法通常以最大化聚类性能的独特目标进行聚类,这忽略了聚类任务的个性化需求。%,并导致无引导的聚类解决方案。但是,在实际情况下,甲壳可能通过利用不同的标准(例如不同的语义(背景,颜色,对象等)),然后提出个性化的聚类任务来聚集未标记的数据。为了实现任务意识的聚类结果,在这项研究中,通过交互方式对``相同的群集''查询``相同群集''查询来进行聚类,以``同一群集''查询的询问,具体是特定于独特的。相同的群集对扩展了对比度学习的积极实例对,以提取方向感知的特征表示,在具有独特需求的甲骨文的指导下,查询结果可能会导致OCC的聚类导致所需的方向的群集在理论上,以确保较高的范围。在实验上,广泛的结果验证了OCC可以准确地沿特定方向聚类,并且据我们所知,它也大大优于SOTA聚类方法。
Deep clustering aims to learn a clustering representation through deep architectures. Most of the existing methods usually conduct clustering with the unique goal of maximizing clustering performance, that ignores the personalized demand of clustering tasks.% and results in unguided clustering solutions. However, in real scenarios, oracles may tend to cluster unlabeled data by exploiting distinct criteria, such as distinct semantics (background, color, object, etc.), and then put forward personalized clustering tasks. To achieve task-aware clustering results, in this study, Oracle-guided Contrastive Clustering(OCC) is then proposed to cluster by interactively making pairwise ``same-cluster" queries to oracles with distinctive demands. Specifically, inspired by active learning, some informative instance pairs are queried, and evaluated by oracles whether the pairs are in the same cluster according to their desired orientation. And then these queried same-cluster pairs extend the set of positive instance pairs for contrastive learning, guiding OCC to extract orientation-aware feature representation. Accordingly, the query results, guided by oracles with distinctive demands, may drive the OCC's clustering results in a desired orientation. Theoretically, the clustering risk in an active learning manner is given with a tighter upper bound, that guarantees active queries to oracles do mitigate the clustering risk. Experimentally, extensive results verify that OCC can cluster accurately along the specific orientation and it substantially outperforms the SOTA clustering methods as well. To the best of our knowledge, it is the first deep framework to perform personalized clustering.