将同时思考用于数值推理

论文标题

将同时思考用于数值推理

Chaining Simultaneous Thoughts for Numerical Reasoning

论文作者

Shao, Zhihong, Huang, Fei, Huang, Minlie

论文摘要

鉴于丰富的信息隐藏在文本中无处不在的数字后面，因此文本上的数值推理应该是AI系统的重要技能。为了得出精确的方程式以解决数值推理问题，先前的工作着重于建模方程结构，并提出了各种结构化解码器。尽管结构建模被证明是有效的，但这些结构化解码器在预定义的自回归顺序中构建一个单一方程式，有可能对模型应如何掌握推理过程的不必要限制。直觉上，人类可能没有预定的秩序弹出许多思想。思想不仅限于手头的问题，甚至可能关注其他相关问题。通过比较多样化的思想和链接相关的作品，人类不容易出错。在本文中，我们采用了这种灵感，并提出了Cantor，这是一种数值原因，它使用有向的无环形图对推理步骤进行建模，我们同时制作了不同的推理步骤，而无需预先定义的解码依赖关系，并比较和链条相关的推理步骤以达到解决方案。广泛的实验证明了在完全监督和弱监督的环境下Cantor的有效性。

Given that rich information is hidden behind ubiquitous numbers in text, numerical reasoning over text should be an essential skill of AI systems. To derive precise equations to solve numerical reasoning problems, previous work focused on modeling the structures of equations, and has proposed various structured decoders. Though structure modeling proves to be effective, these structured decoders construct a single equation in a pre-defined autoregressive order, potentially placing an unnecessary restriction on how a model should grasp the reasoning process. Intuitively, humans may have numerous pieces of thoughts popping up in no pre-defined order; thoughts are not limited to the problem at hand, and can even be concerned with other related problems. By comparing diverse thoughts and chaining relevant pieces, humans are less prone to errors. In this paper, we take this inspiration and propose CANTOR, a numerical reasoner that models reasoning steps using a directed acyclic graph where we produce diverse reasoning steps simultaneously without pre-defined decoding dependencies, and compare and chain relevant ones to reach a solution. Extensive experiments demonstrated the effectiveness of CANTOR under both fully-supervised and weakly-supervised settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题