{"title":"Concept Class Analysis: A Method for Identifying Cultural Schemas in Texts","authors":"Marshall A. Taylor, Dustin S. Stoltz","doi":"10.15195/v7.a23","DOIUrl":null,"url":null,"abstract":"Recent methodological work at the intersection of culture, cognition, and computational methods has drawn attention to how cultural schemas can be 'recovered' from social survey data. Defining cultural schemas as slowly learned, implicit, and unevenly distributed relational memory structures, researchers show how schemas—or rather, the downstream consequences of people drawing upon them—can be operationalized and measured from domain-specific survey modules. Respondents can then be sorted into 'classes' on the basis of the schema to which their survey response patterns best align. In this article, we extend this 'schematic class analysis' method to text data. We introduce concept class analysis (CoCA): a hybrid model that combines word embeddings and correlational class analysis to group documents across a corpus by the similarity of schemas recovered from them. We introduce the CoCA model, illustrate its validity and utility using simulations, and conclude with considerations for future research and applications.","PeriodicalId":22029,"journal":{"name":"Sociological Science","volume":"97 3","pages":"544-569"},"PeriodicalIF":2.7000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sociological Science","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.15195/v7.a23","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIOLOGY","Score":null,"Total":0}
引用次数: 16
Abstract
Recent methodological work at the intersection of culture, cognition, and computational methods has drawn attention to how cultural schemas can be 'recovered' from social survey data. Defining cultural schemas as slowly learned, implicit, and unevenly distributed relational memory structures, researchers show how schemas—or rather, the downstream consequences of people drawing upon them—can be operationalized and measured from domain-specific survey modules. Respondents can then be sorted into 'classes' on the basis of the schema to which their survey response patterns best align. In this article, we extend this 'schematic class analysis' method to text data. We introduce concept class analysis (CoCA): a hybrid model that combines word embeddings and correlational class analysis to group documents across a corpus by the similarity of schemas recovered from them. We introduce the CoCA model, illustrate its validity and utility using simulations, and conclude with considerations for future research and applications.
期刊介绍:
Sociological Science is an open-access, online, peer-reviewed, international journal for social scientists committed to advancing a general understanding of social processes. Sociological Science welcomes original research and commentary from all subfields of sociology, and does not privilege any particular theoretical or methodological approach.