论文标题
用预构建的问题空间回答的开放域问题
Open-Domain Question Answering with Pre-Constructed Question Spaces
论文作者
论文摘要
开放域问题回答旨在解决在大量文档集合中找到用户生成问题答案的任务。有两个可用的解决方案家族:猎犬阅读器和基于知识的方法。回猎犬阅读器通常首先使用诸如TF-IDF之类的信息检索方法来定位一些可能与问题相关的文档或段落,然后将检索到的文本提供给神经网络读取器以提取答案。另外,可以从语料库构建知识图,并与之询问以回答用户问题。我们提出了一种新型算法,具有与两个家庭不同的读者重新结构结构。我们的读者回归者首先使用离线阅读器阅读语料库并生成与答案相关的所有可回答问题的集合,然后使用在线检索器通过搜索预先构造的问题空间来响应用户查询,以获取答案,这些答案最有可能以给定的方式询问。我们通过检查两个组件之间的一致性,将回犬阅读器和读取器 - 回归者结果结合到一个答案中。我们声称我们的算法在现有工作中解决了一些瓶颈,并证明了它在现实世界数据集上的准确性卓越。
Open-domain question answering aims at solving the task of locating the answers to user-generated questions in massive collections of documents. There are two families of solutions available: retriever-readers, and knowledge-graph-based approaches. A retriever-reader usually first uses information retrieval methods like TF-IDF to locate some documents or paragraphs that are likely to be relevant to the question, and then feeds the retrieved text to a neural network reader to extract the answer. Alternatively, knowledge graphs can be constructed from the corpus and be queried against to answer user questions. We propose a novel algorithm with a reader-retriever structure that differs from both families. Our reader-retriever first uses an offline reader to read the corpus and generate collections of all answerable questions associated with their answers, and then uses an online retriever to respond to user queries by searching the pre-constructed question spaces for answers that are most likely to be asked in the given way. We further combine retriever-reader and reader-retriever results into one single answer by examining the consistency between the two components. We claim that our algorithm solves some bottlenecks in existing work, and demonstrate that it achieves superior accuracy on real-world datasets.