Should We Ban English NLP for a Year?

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Documents

Fulltext
Final published version, 216 KB, PDF document

Søgaard, Anders

Around two thirds of NLP research at top venues is devoted exclusively to developing technology for speakers of English, most speech data comes from young urban speakers, and most texts used to train language models come from male writers. These biases feed into consumer technologies to widen existing inequality gaps, not only within, but also across, societies. Many have argued that it is almost impossible to mitigate inequality amplification. I argue that, on the contrary, it is quite simple to do so, and that counter-measures would have little-to-no negative impact, except for, perhaps, in the very short term.

Original language	English
Title of host publication	Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Publisher	Association for Computational Linguistics
Publication date	2022
Pages	5254-5260
Publication status	Published - 2022
Event	2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022

Conference

Conference	2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022
Land	United Arab Emirates
By	Abu Dhabi
Periode	07/12/2022 → 11/12/2022

Department of Computer Science