On the Independence of Association Bias and Empirical Fairness in Language Models

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Documents

Fulltext
Submitted manuscript, 862 KB, PDF document

The societal impact of pre-trained language models has prompted researchers to probe them for strong associations between protected attributes and value-loaded terms, from slur to prestigious job titles. Such work is said to probe models for bias or fairness - or such probes 'into representational biases' are said to be 'motivated by fairness' - suggesting an intimate connection between bias and fairness. We provide conceptual clarity by distinguishing between association biases [11] and empirical fairness [56] and show the two can be independent. Our main contribution, however, is showing why this should not come as a surprise. To this end, we first provide a thought experiment, showing how association bias and empirical fairness can be completely orthogonal. Next, we provide empirical evidence that there is no correlation between bias metrics and fairness metrics across the most widely used language models. Finally, we survey the sociological and psychological literature and show how this literature provides ample support for expecting these metrics to be uncorrelated.

Original language	English
Title of host publication	Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023
Publisher	Association for Computing Machinery, Inc.
Publication date	2023
Pages	370-378
ISBN (Electronic)	9781450372527
DOIs	https://doi.org/10.1145/3593013.3594004
Publication status	Published - 2023
Event	6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023 - Chicago, United States Duration: 12 Jun 2023 → 15 Jun 2023

Conference

Conference	6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023
Land	United States
By	Chicago
Periode	12/06/2023 → 15/06/2023

Bibliographical note

Research areas

Fairness, Natural Language Processing, Representational Bias

ID: 381563506

Department of Computer Science