Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

Technological University Dublin

Datasets

Series

Machine learning

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Dataset For Gendered Language, Shweta Soundararajan Jan 2023

Dataset For Gendered Language, Shweta Soundararajan

Datasets

Gendered language is the use of words that denote an individual’s gender. This can be explicit where the gender is evident in the actual word used, e.g. mother, she, man, but it can also be implicit where social roles or behaviours can signal an individual’s gender - for example, expectations that women display communal traits (e.g., affectionate, caring, gentle) and men display agentic traits (e.g., assertive, competitive, decisive). The use of gendered language in NLP systems can perpetuate gender stereotypes and bias. This paper proposes an approach to generating gendered language datasets using ChatGPT which will provide data for data-driven …