Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Standard

Square One Bias in NLP : Towards a Multi-Dimensional Exploration of the Research Manifold. / Ruder, Sebastian; Vulić, Ivan; Søgaard, Anders.

ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022. ed. / Smaranda Muresan; Preslav Nakov; Aline Villavicencio. Association for Computational Linguistics (ACL), 2022. p. 2340-2354.

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Harvard

Ruder, S, Vulić, I & Søgaard, A 2022, Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold. in S Muresan, P Nakov & A Villavicencio (eds), ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022. Association for Computational Linguistics (ACL), pp. 2340-2354, 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022, Dublin, Ireland, 22/05/2022. https://doi.org/10.18653/v1/2022.findings-acl.184

APA

Ruder, S., Vulić, I., & Søgaard, A. (2022). Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold. In S. Muresan, P. Nakov, & A. Villavicencio (Eds.), ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022 (pp. 2340-2354). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-acl.184

Vancouver

Ruder S, Vulić I, Søgaard A. Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold. In Muresan S, Nakov P, Villavicencio A, editors, ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022. Association for Computational Linguistics (ACL). 2022. p. 2340-2354 https://doi.org/10.18653/v1/2022.findings-acl.184

Author

Ruder, Sebastian ; Vulić, Ivan ; Søgaard, Anders. / Square One Bias in NLP : Towards a Multi-Dimensional Exploration of the Research Manifold. ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022. editor / Smaranda Muresan ; Preslav Nakov ; Aline Villavicencio. Association for Computational Linguistics (ACL), 2022. pp. 2340-2354

Bibtex

@inproceedings{c270389955f24ae895b797f6dc5e2e3e,
title = "Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold",
abstract = "The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. Such one-dimensionality of most research means we are only exploring a fraction of the NLP research search space. We provide historical and recent examples of how the square one bias has led researchers to draw false conclusions or make unwise choices, point to promising yet unexplored directions on the research manifold, and make practical recommendations to enable more multi-dimensional research. We open-source the results of our annotations to enable further analysis.",
author = "Sebastian Ruder and Ivan Vuli{\'c} and Anders S{\o}gaard",
note = "Publisher Copyright: {\textcopyright} 2022 Association for Computational Linguistics.; 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022 ; Conference date: 22-05-2022 Through 27-05-2022",
year = "2022",
doi = "10.18653/v1/2022.findings-acl.184",
language = "English",
pages = "2340--2354",
editor = "Smaranda Muresan and Preslav Nakov and Aline Villavicencio",
booktitle = "ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022",
publisher = "Association for Computational Linguistics (ACL)",
address = "United States",

}

RIS

TY - GEN

T1 - Square One Bias in NLP

T2 - 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022

AU - Ruder, Sebastian

AU - Vulić, Ivan

AU - Søgaard, Anders

N1 - Publisher Copyright: © 2022 Association for Computational Linguistics.

PY - 2022

Y1 - 2022

N2 - The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. Such one-dimensionality of most research means we are only exploring a fraction of the NLP research search space. We provide historical and recent examples of how the square one bias has led researchers to draw false conclusions or make unwise choices, point to promising yet unexplored directions on the research manifold, and make practical recommendations to enable more multi-dimensional research. We open-source the results of our annotations to enable further analysis.

AB - The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. Such one-dimensionality of most research means we are only exploring a fraction of the NLP research search space. We provide historical and recent examples of how the square one bias has led researchers to draw false conclusions or make unwise choices, point to promising yet unexplored directions on the research manifold, and make practical recommendations to enable more multi-dimensional research. We open-source the results of our annotations to enable further analysis.

UR - http://www.scopus.com/inward/record.url?scp=85131943763&partnerID=8YFLogxK

U2 - 10.18653/v1/2022.findings-acl.184

DO - 10.18653/v1/2022.findings-acl.184

M3 - Article in proceedings

AN - SCOPUS:85131943763

SP - 2340

EP - 2354

BT - ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022

A2 - Muresan, Smaranda

A2 - Nakov, Preslav

A2 - Villavicencio, Aline

PB - Association for Computational Linguistics (ACL)

Y2 - 22 May 2022 through 27 May 2022

ER -

ID: 341486380