利用深度学习模型划分政治党派的两步法

IF 3 2区社会学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Social Science Computer Review Pub Date : 2023-12-06 DOI:10.1177/08944393231219685

Lingshu Hu

{"title":"利用深度学习模型划分政治党派的两步法","authors":"Lingshu Hu","doi":"10.1177/08944393231219685","DOIUrl":null,"url":null,"abstract":"Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.","PeriodicalId":49509,"journal":{"name":"Social Science Computer Review","volume":"32 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Two-Step Method for Classifying Political Partisanship Using Deep Learning Models\",\"authors\":\"Lingshu Hu\",\"doi\":\"10.1177/08944393231219685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.\",\"PeriodicalId\":49509,\"journal\":{\"name\":\"Social Science Computer Review\",\"volume\":\"32 2\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2023-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Science Computer Review\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1177/08944393231219685\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Science Computer Review","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1177/08944393231219685","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

政治党派关系是一种重要的群体认同，它对个人的投票行为产生重大影响，并塑造了个人的思想和文化观点。虽然传统的调查和实验研究可以通过询问参与者来直接捕捉政治身份，但当使用来自社交媒体的数字跟踪数据时，这项任务变得复杂起来。以往的分类方法试图从用户的网络或文本内容中推断政治身份，其效率和概括性有限。为此，本研究引入了一种利用深度学习模型来提高分类效率、泛化性和可解释性的两步方法。在第一步中，两个深度学习模型接受了来自美国825名国会议员的250万条推文的训练，在基于个人推文检测政客的党派倾向方面，准确率分别达到了87.71%和89.54%。随后，在第二步中，通过使用一个简单的机器学习模型，利用从第一步模型中获得的汇总预测值，基于50条和200条推文识别非政治家用户的政治身份的准确率分别达到了94.92%和96.61%。此外，将注意力机制集成到深度学习模型中，以评估每个单词在分类过程中的贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Two-Step Method for Classifying Political Partisanship Using Deep Learning Models

Political partisanship constitutes a pivotal group identity that significantly influences individuals’ voting behaviors and shapes their ideological and cultural perspectives. While traditional surveys and experimental studies can directly capture political identity by asking the participants, this task has become intricate when employing digital trace data sourced from social media. Previous classification methods, attempting to infer political identity from users’ networks or textual content, suffered from limited efficiency or generalizability. In response, this study introduces a two-step method that utilizes deep learning models to enhance classification efficiency, generalizability, and interpretability. In the first step, two deep learning models, trained on 2.5 million tweets from 825 Congressional politicians in the U.S., achieved accuracy rates of 87.71% and 89.54%, respectively, in detecting politicians’ partisanships based on their individual tweets. Subsequently, in the second step, by employing a simple machine learning model that leverages the aggregated predicted values derived from the first-step models, accuracy rates of 94.92% and 96.61% were attained for identifying non-politician users’ political identities based off their 50 and 200 tweets, respectively. In addition, an attention mechanism was integrated into the deep learning model to assess the contribution of each word in the classification process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Social Science Computer Review 社会科学-计算机：跨学科应用

CiteScore

9.00

自引率

4.90%

发文量

审稿时长

>12 weeks

期刊介绍： Unique Scope Social Science Computer Review is an interdisciplinary journal covering social science instructional and research applications of computing, as well as societal impacts of informational technology. Topics included: artificial intelligence, business, computational social science theory, computer-assisted survey research, computer-based qualitative analysis, computer simulation, economic modeling, electronic modeling, electronic publishing, geographic information systems, instrumentation and research tools, public administration, social impacts of computing and telecommunications, software evaluation, world-wide web resources for social scientists. Interdisciplinary Nature Because the Uses and impacts of computing are interdisciplinary, so is Social Science Computer Review. The journal is of direct relevance to scholars and scientists in a wide variety of disciplines. In its pages you''ll find work in the following areas: sociology, anthropology, political science, economics, psychology, computer literacy, computer applications, and methodology.