论文标题
单词嵌入中的区域负偏见预测种族动画 - 但仅通过名称频率
Regional Negative Bias in Word Embeddings Predicts Racial Animus--but only via Name Frequency
论文作者
论文摘要
嵌入协会测试(WEAT)一词是衡量针对社会群体(例如大型文本语料库中的少数民族)语言偏见的重要方法。它通过比较群体的单词的语义相关性(例如,这些群体独有的名称)和属性单词(例如'leaste''和'不愉快的单词)来做到这一点。我们表明,在大都市统计领域水平上,来自地理标签的社交媒体数据的反黑色weat估计与种族歧视的几种措施密切相关 - 即使在控制社会人口统计学协变量时也是如此。但是,我们还表明,这些相关性中的每一个都由第三个变量解释:基础语料库中黑名的频率相对于白名称。之所以发生这种情况,是因为单词嵌入在估计的语义空间中倾向于将正(负)单词和频繁(稀有)单词分组在一起。由于社交媒体上的黑名字的频率与黑人美国人在人口中的流行密切相关,因此无论几乎没有黑人美国人居住的地方,这都会导致虚假的反黑人weat估计。这表明使用WEAT测量偏见的研究应考虑术语频率,还证明了使用诸如词嵌入(嵌入单词嵌入)研究人类认知和行为的黑框模型的潜在后果。
The word embedding association test (WEAT) is an important method for measuring linguistic biases against social groups such as ethnic minorities in large text corpora. It does so by comparing the semantic relatedness of words prototypical of the groups (e.g., names unique to those groups) and attribute words (e.g., 'pleasant' and 'unpleasant' words). We show that anti-black WEAT estimates from geo-tagged social media data at the level of metropolitan statistical areas strongly correlate with several measures of racial animus--even when controlling for sociodemographic covariates. However, we also show that every one of these correlations is explained by a third variable: the frequency of Black names in the underlying corpora relative to White names. This occurs because word embeddings tend to group positive (negative) words and frequent (rare) words together in the estimated semantic space. As the frequency of Black names on social media is strongly correlated with Black Americans' prevalence in the population, this results in spurious anti-Black WEAT estimates wherever few Black Americans live. This suggests that research using the WEAT to measure bias should consider term frequency, and also demonstrates the potential consequences of using black-box models like word embeddings to study human cognition and behavior.