Objective: Existing studies estimate that between 0.3% and 2% of adults in the U.S. (between 900,000 and 2.6 million in 2020) identify as a nonbinary gender or otherwise gender nonconforming. In response to the RDAP 2021 theme of radical change, this article examines the need to change how datasets represent nonbinary persons and how research involving gender data should approach the curation of this data at each stage of the research lifecycle.
Methods: In this article, we examine some of the known challenges of gender inclusion in datasets and summarize some solutions underway. Using a critical lens, we examine the difference between current practice and inclusive practice in gender representation, describing inclusive practices at each stage of the research lifecycle from writing a data management plan to sharing data.
Results: Data structures that limit gender to “male” and “female” or ontological structures that use mapping to collapse gender demographics to binary values exclude nonbinary and gender diverse populations. Some data collection instruments attempt inclusivity by adding the gender category of “other,” but using the “other” gender category labels nonbinary persons as intrinsically alien. Inclusive change must go farther, to move from alienation to inclusive categories. We describe several techniques for inclusively representing gender in data, from the data management planning stage, to collecting data, cleaning data, and sharing data. To facilitate better sharing of gender data, repositories must also allow mapping that includes nonbinary genders explicitly and allow for ontological mapping for long-term representation of diverse gender identities.
Conclusions: A good practice during research design is to consider two levels of critique in the data collection plan. First, consider the research question at hand and remove unnecessary gendering from the data. Secondly, if the research question needs gender, make sure to include nonbinary genders explicitly. Allies must take on this problem without leaving it to those who are most affected by it. Further, more voices calling for inclusionary practices surrounding data rises to a crescendo that cannot be ignored.
gender demographics, nonbinary, transgender, data collection
Disclosures: The content of this article is based upon a lightning talk presentation at RDAP Summit 2021 titled “Do I have to be an “other” to be myself?” available at https://osf.io/4duya.
Gofman A, Leif SA, Gunderman H, Exner N. Do I Have To Be An “Other” To Be Myself? Exploring Gender Diversity In Taxonomy, Data Collection, And Through The Research Data Lifecycle. Journal of eScience Librarianship 2021;10(4): e1219. https://doi.org/10.7191/jeslib.2021.1219. Retrieved from https://escholarship.umassmed.edu/jeslib/vol10/iss4/6
Rights and Permissions
Copyright © 2021 Gofman et al. This is an open access article licensed under the terms of the Creative Commons Attribution License.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Databases and Information Systems Commons, Data Science Commons, Feminist, Gender, and Sexuality Studies Commons, Gender and Sexuality Commons, Gender, Race, Sexuality, and Ethnicity in Communication Commons, Scholarly Publishing Commons