Manuel Goyanes, Luis de-Marcos, Adrián Domínguez-Díaz
{"title":"自动性别检测:利用 ChatGPT 和性别 API 从姓名中计算推断性别的方法程序和建议","authors":"Manuel Goyanes, Luis de-Marcos, Adrián Domínguez-Díaz","doi":"10.1007/s11192-024-05149-2","DOIUrl":null,"url":null,"abstract":"<p>Both computational social scientists and scientometric scholars alike, interested in gender-related research questions, need to classify the gender of observations. However, in most public and private databases, this information is typically unavailable, making it difficult to design studies aimed at understanding the role of gender in influencing citizens’ perceptions, attitudes, and behaviors. Against this backdrop, it is essential to design methodological procedures to infer the gender automatically and computationally from data already provided, thus facilitating the exploration and examination of gender-related research questions or hypotheses. Researchers can use automatic gender detection tools like Namsor or Gender-API, which are already on the market. However, recent developments in conversational bots offer a new, still relatively underexplored, alternative. This study offers a step-by-step research guide, with relevant examples and detailed clarifications, to automatically classify the gender from names through ChatGPT and two partially free gender detection tool (Namsor and Gender-API). In addition, the study provides methodological suggestions and recommendations on how to gather, interpret, and report results coming from both platforms. The study methodologically contributes to the scientometric literature by describing an easy-to-execute methodological procedure that enables the computational codification of gender from names. This procedure could be implemented by scholars without advanced computing skills.</p>","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"100 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic gender detection: a methodological procedure and recommendations to computationally infer the gender from names with ChatGPT and gender APIs\",\"authors\":\"Manuel Goyanes, Luis de-Marcos, Adrián Domínguez-Díaz\",\"doi\":\"10.1007/s11192-024-05149-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Both computational social scientists and scientometric scholars alike, interested in gender-related research questions, need to classify the gender of observations. However, in most public and private databases, this information is typically unavailable, making it difficult to design studies aimed at understanding the role of gender in influencing citizens’ perceptions, attitudes, and behaviors. Against this backdrop, it is essential to design methodological procedures to infer the gender automatically and computationally from data already provided, thus facilitating the exploration and examination of gender-related research questions or hypotheses. Researchers can use automatic gender detection tools like Namsor or Gender-API, which are already on the market. However, recent developments in conversational bots offer a new, still relatively underexplored, alternative. This study offers a step-by-step research guide, with relevant examples and detailed clarifications, to automatically classify the gender from names through ChatGPT and two partially free gender detection tool (Namsor and Gender-API). In addition, the study provides methodological suggestions and recommendations on how to gather, interpret, and report results coming from both platforms. The study methodologically contributes to the scientometric literature by describing an easy-to-execute methodological procedure that enables the computational codification of gender from names. This procedure could be implemented by scholars without advanced computing skills.</p>\",\"PeriodicalId\":21755,\"journal\":{\"name\":\"Scientometrics\",\"volume\":\"100 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientometrics\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1007/s11192-024-05149-2\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientometrics","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s11192-024-05149-2","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Automatic gender detection: a methodological procedure and recommendations to computationally infer the gender from names with ChatGPT and gender APIs
Both computational social scientists and scientometric scholars alike, interested in gender-related research questions, need to classify the gender of observations. However, in most public and private databases, this information is typically unavailable, making it difficult to design studies aimed at understanding the role of gender in influencing citizens’ perceptions, attitudes, and behaviors. Against this backdrop, it is essential to design methodological procedures to infer the gender automatically and computationally from data already provided, thus facilitating the exploration and examination of gender-related research questions or hypotheses. Researchers can use automatic gender detection tools like Namsor or Gender-API, which are already on the market. However, recent developments in conversational bots offer a new, still relatively underexplored, alternative. This study offers a step-by-step research guide, with relevant examples and detailed clarifications, to automatically classify the gender from names through ChatGPT and two partially free gender detection tool (Namsor and Gender-API). In addition, the study provides methodological suggestions and recommendations on how to gather, interpret, and report results coming from both platforms. The study methodologically contributes to the scientometric literature by describing an easy-to-execute methodological procedure that enables the computational codification of gender from names. This procedure could be implemented by scholars without advanced computing skills.
期刊介绍:
Scientometrics aims at publishing original studies, short communications, preliminary reports, review papers, letters to the editor and book reviews on scientometrics. The topics covered are results of research concerned with the quantitative features and characteristics of science. Emphasis is placed on investigations in which the development and mechanism of science are studied by means of (statistical) mathematical methods.
The Journal also provides the reader with important up-to-date information about international meetings and events in scientometrics and related fields. Appropriate bibliographic compilations are published as a separate section. Due to its fully interdisciplinary character, Scientometrics is indispensable to research workers and research administrators throughout the world. It provides valuable assistance to librarians and documentalists in central scientific agencies, ministries, research institutes and laboratories.
Scientometrics includes the Journal of Research Communication Studies. Consequently its aims and scope cover that of the latter, namely, to bring the results of research investigations together in one place, in such a form that they will be of use not only to the investigators themselves but also to the entrepreneurs and research workers who form the object of these studies.