论文标题
两次候选投票选举审计方法的统一评估
A Unified Evaluation of Two-Candidate Ballot-Polling Election Auditing Methods
论文作者
论文摘要
计数票非常复杂且容易出错。已经开发了几种统计方法来通过手动检查随机选择的物理选票来评估选举准确性。两种“原则性”方法是风险限制审核(RLA)和贝叶斯审核(BAS)。 RLA使用BAS基于贝叶斯推断的频繁统计推断。直到最近,两者都被认为是根本上不同的。 我们提出了将“投票” RLAS和BAS(仅需要从所有演员投票卡中随机采样的能力)进行统一和脱落的结果,以进行两次候选多数竞赛,这些竞赛是审核更复杂的社交选择功能的基础,包括一些优先投票系统。我们强调方法之间的连接并探索其性能。 首先,基于先前关于古典方法和贝叶斯方法的数学等效性的演示,我们表明,经过适当校准的BAS是风险限制的。其次,我们比较了各种竞赛大小和利润的方法的效率,重点是达到给定风险限制所需的样本量的分布。第三,我们概述了改善绩效的几种方法,并展示了数学对等如何解释改进的方法。
Counting votes is complex and error-prone. Several statistical methods have been developed to assess election accuracy by manually inspecting randomly selected physical ballots. Two 'principled' methods are risk-limiting audits (RLAs) and Bayesian audits (BAs). RLAs use frequentist statistical inference while BAs are based on Bayesian inference. Until recently, the two have been thought of as fundamentally different. We present results that unify and shed light upon 'ballot-polling' RLAs and BAs (which only require the ability to sample uniformly at random from all cast ballot cards) for two-candidate plurality contests, which are building blocks for auditing more complex social choice functions, including some preferential voting systems. We highlight the connections between the methods and explore their performance. First, building on a previous demonstration of the mathematical equivalence of classical and Bayesian approaches, we show that BAs, suitably calibrated, are risk-limiting. Second, we compare the efficiency of the methods across a wide range of contest sizes and margins, focusing on the distribution of sample sizes required to attain a given risk limit. Third, we outline several ways to improve performance and show how the mathematical equivalence explains the improvements.