对抗性风格测量实验的再现和复制

论文标题

对抗性风格测量实验的再现和复制

Reproduction and Replication of an Adversarial Stylometry Experiment

论文作者

Wang, Haining, Juola, Patrick, Riddell, Allen

论文摘要

在使用自然语言进行交流时保持匿名仍然是一个挑战。即使候选人的数量很高，分析候选人作者的写作风格的标准作者归因技术也达到了不舒服的精度。对抗性风格测定法可以防止作者归因，目的是防止不必要的脱名字化。本文在针对作者身份归因的防御措施的开创性研究中重现并复制实验（Brennan等，2012）。尽管我们得出的结论是，由于原始研究中缺乏对照组，我们得出的结论是，我们能够成功地复制和复制原始结果。在我们的复制中，我们发现了新的证据表明，一种完全自动的方法，往返翻译，值得重新检查，因为它似乎降低了已建立的作者归因方法的有效性。

Maintaining anonymity while communicating using natural language remains a challenge. Standard authorship attribution techniques that analyze candidate authors' writing styles achieve uncomfortably high accuracy even when the number of candidate authors is high. Adversarial stylometry defends against authorship attribution with the goal of preventing unwanted deanonymization. This paper reproduces and replicates experiments in a seminal study of defenses against authorship attribution (Brennan et al., 2012). We are able to successfully reproduce and replicate the original results, although we conclude that the effectiveness of the defenses studied is overstated due to a lack of a control group in the original study. In our replication, we find new evidence suggesting that an entirely automatic method, round-trip translation, merits re-examination as it appears to reduce the effectiveness of established authorship attribution methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题