论文标题

PTPARL-D:44年葡萄牙议会辩论的注释语料库

PTPARL-D: Annotated Corpus of 44 years of Portuguese Parliament debates

论文作者

Almeida, Paulo, Marques-Pita, Manuel, Gonçalves-Sá, Joana

论文摘要

在代表民主制度中,有些人以其他人的名义决定,这些民选官员通常聚集在公共议会中,例如议会,他们讨论政策,立法和对基本倡议的投票。这种民主进程的一个核心方面是全体辩论,进行了重要的公众讨论。世界各地的许多议会越来越多地将这些辩论的成绩单和其他议会数据的成绩单以可供公众访问的数字格式延长,从而提高了透明度和问责制。此外,一些议会将旧纸笔录带入半结构化数字格式。但是,这些记录通常仅作为原始文本甚至是图像提供,几乎没有注释,并且格式不一致,从而使它们难以分析和研究,从而降低了透明度和公众范围。在这里,我们介绍了1976年至2019年葡萄牙议会中注释的辩论语料库PTPARL-D,涵盖了整个葡萄牙民主时期。

In a representative democracy, some decide in the name of the rest, and these elected officials are commonly gathered in public assemblies, such as parliaments, where they discuss policies, legislate, and vote on fundamental initiatives. A core aspect of such democratic processes are the plenary debates, where important public discussions take place. Many parliaments around the world are increasingly keeping the transcripts of such debates, and other parliamentary data, in digital formats accessible to the public, increasing transparency and accountability. Furthermore, some parliaments are bringing old paper transcripts to semi-structured digital formats. However, these records are often only provided as raw text or even as images, with little to no annotation, and inconsistent formats, making them difficult to analyze and study, reducing both transparency and public reach. Here, we present PTPARL-D, an annotated corpus of debates in the Portuguese Parliament, from 1976 to 2019, covering the entire period of Portuguese democracy.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源