论文标题
社交对话中以实体为中心背景的统一方法
A Unified Approach to Entity-Centric Context Tracking in Social Conversations
论文作者
论文摘要
在人类对话中,上下文跟踪涉及识别重要实体并跟踪其属性和关系。这是一个具有挑战性的问题,涵盖了几个子任务,例如插槽标签,核心分辨率,解决复数提及和实体链接。我们将这个问题作为端到端建模任务,其中对话上下文是由包含到目前为止所述实体参考的实体存储库表示的,其属性及其之间的关系。存储库进行逐圈的更新,从而使培训和推理计算有效,即使在长时间的对话中也是如此。本文通过两种方式对该框架进行调查奠定了基础。首先,我们释放了一个大规模的人类对话语料库,可与人和位置注释进行跟踪。它包含7000多个对话,平均每次对话为11.8转,5.8个实体和15.2个参考。其次,我们开源一个用于上下文跟踪的神经网络体系结构。最后,我们将该网络与IT所包含的子任务的最新方法进行了比较,并报告了有关权衡的结果。
In human-human conversations, Context Tracking deals with identifying important entities and keeping track of their properties and relationships. This is a challenging problem that encompasses several subtasks such as slot tagging, coreference resolution, resolving plural mentions and entity linking. We approach this problem as an end-to-end modeling task where the conversational context is represented by an entity repository containing the entity references mentioned so far, their properties and the relationships between them. The repository is updated turn-by-turn, thus making training and inference computationally efficient even for long conversations. This paper lays the groundwork for an investigation of this framework in two ways. First, we release Contrack, a large scale human-human conversation corpus for context tracking with people and location annotations. It contains over 7000 conversations with an average of 11.8 turns, 5.8 entities and 15.2 references per conversation. Second, we open-source a neural network architecture for context tracking. Finally we compare this network to state-of-the-art approaches for the subtasks it subsumes and report results on the involved tradeoffs.