Julio Cesar Valente Ferreira, Thiago Ribeiro Furtado, Rafael Dirques David Regis, Gabriela Rodrigues Diniz, Paula Gonçalves, Vitor Pedro da Silva Castelo Tavares
{"title":"用于人口统计自动分类和配置的动漫聚类","authors":"Julio Cesar Valente Ferreira, Thiago Ribeiro Furtado, Rafael Dirques David Regis, Gabriela Rodrigues Diniz, Paula Gonçalves, Vitor Pedro da Silva Castelo Tavares","doi":"10.7764/cdi.54.53193","DOIUrl":null,"url":null,"abstract":"The cultural industry assumed greater relevance as a productive system and expanded its market share with different forms of reception, transmission, and communication with the public, increasingly using the so called classification and recommendation algorithms and manipulation of mass processed data, which do not require cyber-physical systems for cataloging andconstant feedback from all parties involved for cataloging. In this regard, this paper proposes a methodology to support the classification and creation of corresponding groups, automatically, of cultural productions of certain segments through Robot Process Automation (RPA) techniques, to first extract public data created by fans of certain cultural segments, and Latent Dirichlet Allocation (LDA), for the clustering of these productions based on the data of the terms extracted by RPA. As a case study for this proposal, we specifically observed the anime market, defined as an originally Japanese cultural product with high fan engagement and high annual production scale, supported by data obtained from two public databases data: MyAnimeList and AniDB, built collaboratively by fans. The application of the methodology allowed the automatic classification of anime, grouping them into topics that allow the proposal of a new demography of products of this genre in relation to the current one, providing a greater level of detail and allowing to contemplate the expansion of new themes.","PeriodicalId":44666,"journal":{"name":"Cuadernos Info","volume":"1 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Anime clustering for automatic classification and configuration of demographics\",\"authors\":\"Julio Cesar Valente Ferreira, Thiago Ribeiro Furtado, Rafael Dirques David Regis, Gabriela Rodrigues Diniz, Paula Gonçalves, Vitor Pedro da Silva Castelo Tavares\",\"doi\":\"10.7764/cdi.54.53193\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The cultural industry assumed greater relevance as a productive system and expanded its market share with different forms of reception, transmission, and communication with the public, increasingly using the so called classification and recommendation algorithms and manipulation of mass processed data, which do not require cyber-physical systems for cataloging andconstant feedback from all parties involved for cataloging. In this regard, this paper proposes a methodology to support the classification and creation of corresponding groups, automatically, of cultural productions of certain segments through Robot Process Automation (RPA) techniques, to first extract public data created by fans of certain cultural segments, and Latent Dirichlet Allocation (LDA), for the clustering of these productions based on the data of the terms extracted by RPA. As a case study for this proposal, we specifically observed the anime market, defined as an originally Japanese cultural product with high fan engagement and high annual production scale, supported by data obtained from two public databases data: MyAnimeList and AniDB, built collaboratively by fans. The application of the methodology allowed the automatic classification of anime, grouping them into topics that allow the proposal of a new demography of products of this genre in relation to the current one, providing a greater level of detail and allowing to contemplate the expansion of new themes.\",\"PeriodicalId\":44666,\"journal\":{\"name\":\"Cuadernos Info\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cuadernos Info\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7764/cdi.54.53193\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cuadernos Info","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7764/cdi.54.53193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 1
摘要
文化产业作为一个生产系统具有更大的相关性,并通过不同形式的接收、传播和与公众的交流扩大了其市场份额,越来越多地使用所谓的分类和推荐算法以及对大量处理数据的操作,这些不需要网络物理系统进行编目,也不需要编目各方的持续反馈。为此,本文提出了一种方法,通过机器人过程自动化(Robot Process Automation, RPA)技术,支持特定领域的文化产品自动分类和创建相应的组,首先提取特定文化领域粉丝创建的公共数据,然后基于RPA提取的术语数据,提出了一种潜在狄利克雷分配(Latent Dirichlet Allocation, LDA)方法,用于对这些产品进行聚类。作为本提案的案例研究,我们具体观察了动漫市场,该市场被定义为具有高粉丝参与度和高年生产规模的日本文化产品,并从两个公共数据库数据中获得数据支持:MyAnimeList和AniDB,由粉丝合作建立。该方法的应用允许对动画进行自动分类,将它们分组到主题中,允许提出与当前类型相关的该类型产品的新人口统计,提供更高层次的细节,并允许考虑新主题的扩展。
Anime clustering for automatic classification and configuration of demographics
The cultural industry assumed greater relevance as a productive system and expanded its market share with different forms of reception, transmission, and communication with the public, increasingly using the so called classification and recommendation algorithms and manipulation of mass processed data, which do not require cyber-physical systems for cataloging andconstant feedback from all parties involved for cataloging. In this regard, this paper proposes a methodology to support the classification and creation of corresponding groups, automatically, of cultural productions of certain segments through Robot Process Automation (RPA) techniques, to first extract public data created by fans of certain cultural segments, and Latent Dirichlet Allocation (LDA), for the clustering of these productions based on the data of the terms extracted by RPA. As a case study for this proposal, we specifically observed the anime market, defined as an originally Japanese cultural product with high fan engagement and high annual production scale, supported by data obtained from two public databases data: MyAnimeList and AniDB, built collaboratively by fans. The application of the methodology allowed the automatic classification of anime, grouping them into topics that allow the proposal of a new demography of products of this genre in relation to the current one, providing a greater level of detail and allowing to contemplate the expansion of new themes.