论文标题
PowerTransFormer:无监督的可控修订,用于偏见语言校正
PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction
论文作者
论文摘要
无意识的偏见在现代文本和媒体中仍然很普遍,呼吁算法可以帮助作家进行偏见。例如,一个故事中的女性角色通常被描绘成被动和无能为力(“她做白日梦是关于成为医生的”),而男人则被描绘成更积极和强大的(“他追求成为医生的梦想”)。 我们制定 *可控性偏见 *,这是一项新的修订任务,旨在重写给定文本,以纠正字符刻画中隐式且潜在的不良偏见。然后,我们将PowerTransFormer引入一种方法,该方法通过内涵框架的镜头(SAP等,2017)进行了DEBIAS,该方法对动词谓词进行了对隐含动力动力学的务实知识。我们任务的一个关键挑战是缺乏平行语料库。为了应对这一挑战,我们使用辅助监督采用了一种无监督的方法,该方法具有基于重建损失的相关任务,例如释义和自学,并以预审计的语言模型为基础。 通过基于自动和人类评估的全面实验,我们证明我们的方法的表现优于相关任务中的消融和现有方法。此外,我们演示了使用PowerTransFormer作为减轻电影脚本中角色刻画中有据可查的性别偏见的一步。
Unconscious biases continue to be prevalent in modern text and media, calling for algorithms that can assist writers with bias correction. For example, a female character in a story is often portrayed as passive and powerless ("She daydreams about being a doctor") while a man is portrayed as more proactive and powerful ("He pursues his dream of being a doctor"). We formulate *Controllable Debiasing*, a new revision task that aims to rewrite a given text to correct the implicit and potentially undesirable bias in character portrayals. We then introduce PowerTransformer as an approach that debiases text through the lens of connotation frames (Sap et al., 2017), which encode pragmatic knowledge of implied power dynamics with respect to verb predicates. One key challenge of our task is the lack of parallel corpora. To address this challenge, we adopt an unsupervised approach using auxiliary supervision with related tasks such as paraphrasing and self-supervision based on a reconstruction loss, building on pretrained language models. Through comprehensive experiments based on automatic and human evaluations, we demonstrate that our approach outperforms ablations and existing methods from related tasks. Furthermore, we demonstrate the use of PowerTransformer as a step toward mitigating the well-documented gender bias in character portrayal in movie scripts.