{"title":"Speaking Volumes: Introducing the UNGA Speech Corpus","authors":"Linnea R Turco","doi":"10.1093/isq/sqae001","DOIUrl":null,"url":null,"abstract":"Many theoretical conclusions core to the study of international politics rely on having access to, and understanding, the rhetoric of international actors. One important development in advancing the empirical study of international relations (IR) theory, therefore, is the availability of machine-analyzable speech data. A collection of fine-grained textual representations of states’ speeches in the context of an important international organization, such as the United Nations General Assembly (UNGA), is needed to understand the ideas, preferences, and values that states put forth in their statements. An especially promising use for such a corpus of texts would be to measure states’ preferences based on their statements. In an effort to add to the burgeoning field of text-as-data in IR, I present the UNGA Speech Corpus, a collection of over 34,000 speeches delivered by states in the UNGA from 1993 to 2018. I use it to improve on recent work that links text to preferences in IR by combining a structural topic model with locally trained word embeddings to estimate the policy positions of states on specific topics. I then show how these “topic scores” can help scholars to improve their analyses of exigent international issues, such as global climate change governance.","PeriodicalId":48313,"journal":{"name":"International Studies Quarterly","volume":"14 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Studies Quarterly","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1093/isq/sqae001","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INTERNATIONAL RELATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Many theoretical conclusions core to the study of international politics rely on having access to, and understanding, the rhetoric of international actors. One important development in advancing the empirical study of international relations (IR) theory, therefore, is the availability of machine-analyzable speech data. A collection of fine-grained textual representations of states’ speeches in the context of an important international organization, such as the United Nations General Assembly (UNGA), is needed to understand the ideas, preferences, and values that states put forth in their statements. An especially promising use for such a corpus of texts would be to measure states’ preferences based on their statements. In an effort to add to the burgeoning field of text-as-data in IR, I present the UNGA Speech Corpus, a collection of over 34,000 speeches delivered by states in the UNGA from 1993 to 2018. I use it to improve on recent work that links text to preferences in IR by combining a structural topic model with locally trained word embeddings to estimate the policy positions of states on specific topics. I then show how these “topic scores” can help scholars to improve their analyses of exigent international issues, such as global climate change governance.
期刊介绍:
International Studies Quarterly, the official journal of the International Studies Association, seeks to acquaint a broad audience of readers with the best work being done in the variety of intellectual traditions included under the rubric of international studies. Therefore, the editors welcome all submissions addressing this community"s theoretical, empirical, and normative concerns. First preference will continue to be given to articles that address and contribute to important disciplinary and interdisciplinary questions and controversies.