论文标题
快速在线最近邻居搜索的接近图维护
Proximity Graph Maintenance for Fast Online Nearest Neighbor Search
论文作者
论文摘要
大约最近的邻居(ANN)搜索是(例如)推荐系统部署的基本技术。最近的研究将基于图形的近端方法带入从业人员的注意力中 - 基于图形的近端方法优于其他解决方案,例如量化,哈希和基于树的ANN算法族。在当前的推荐系统中,随着用户和项目动态变化,数据点插入,删除和查询以在线方式流入系统中。随着接近图是通过将数据点插入新顶点逐步构建的,在线插入和查询在接近图中得到很好的支持。但是,数据删除会导致从接近图索引中删除顶点,而在先前的研究中没有讨论正确的图索引更新机制。为了应对挑战,我们建议在线ANN的增量接近图维护(IPGM)算法。 IPGM支持在接近图上的顶点删除和插入。给定顶点删除请求,我们彻底研究解决方案以更新顶点的连接。提出的更新方案消除了在线ANN方法上的性能下降,使该算法适用于实用系统。
Approximate Nearest Neighbor (ANN) search is a fundamental technique for (e.g.,) the deployment of recommender systems. Recent studies bring proximity graph-based methods into practitioners' attention -- proximity graph-based methods outperform other solutions such as quantization, hashing, and tree-based ANN algorithm families. In current recommendation systems, data point insertions, deletions, and queries are streamed into the system in an online fashion as users and items change dynamically. As proximity graphs are constructed incrementally by inserting data points as new vertices into the graph, online insertions and queries are well-supported in proximity graph. However, a data point deletion incurs removing a vertex from the proximity graph index, while no proper graph index updating mechanisms are discussed in previous studies. To tackle the challenge, we propose an incremental proximity graph maintenance (IPGM) algorithm for online ANN. IPGM supports both vertex deletion and insertion on proximity graphs. Given a vertex deletion request, we thoroughly investigate solutions to update the connections of the vertex. The proposed updating scheme eliminates the performance drop in online ANN methods on proximity graphs, making the algorithm suitable for practical systems.