论文标题
值得支票的事实主张的基准数据集
A Benchmark Dataset of Check-worthy Factual Claims
论文作者
论文摘要
在本文中,我们介绍了从美国大选总统辩论中提取的23,533项陈述的索赔,并由人类编码人员注释。可以利用索赔的数据集来构建计算方法,以确定值得从数字或传统媒体的众多来源中进行事实检查的主张。索赔的数据集可公开提供给研究社区,可以在http://doi.org/10.5281/zenodo.3609356上找到。
In this paper we present the ClaimBuster dataset of 23,533 statements extracted from all U.S. general election presidential debates and annotated by human coders. The ClaimBuster dataset can be leveraged in building computational methods to identify claims that are worth fact-checking from the myriad of sources of digital or traditional media. The ClaimBuster dataset is publicly available to the research community, and it can be found at http://doi.org/10.5281/zenodo.3609356.