{"title":"视觉艺术家中的小丑:","authors":"Michael Mandiberg, Danara Sarıoğlu","doi":"10.1086/719999","DOIUrl":null,"url":null,"abstract":"This case study explores the challenges of defining a data set to analyze changes in Wikipedia’s gender gap for articles about visual art. Wikipedia and Wikidata each group content in data structures that produce overlapping and incomplete visual art datasets and include non-visual art data. To circumvent potentially biased editorial decisions about what to include or exclude, this case study describes the process of using a topic model algorithm that identifies a dataset by analyzing the words in each article and grouping the articles into topics.","PeriodicalId":43009,"journal":{"name":"Art Documentation","volume":"241 1","pages":"19 - 37"},"PeriodicalIF":0.2000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clowns in the Visual Artists:\",\"authors\":\"Michael Mandiberg, Danara Sarıoğlu\",\"doi\":\"10.1086/719999\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This case study explores the challenges of defining a data set to analyze changes in Wikipedia’s gender gap for articles about visual art. Wikipedia and Wikidata each group content in data structures that produce overlapping and incomplete visual art datasets and include non-visual art data. To circumvent potentially biased editorial decisions about what to include or exclude, this case study describes the process of using a topic model algorithm that identifies a dataset by analyzing the words in each article and grouping the articles into topics.\",\"PeriodicalId\":43009,\"journal\":{\"name\":\"Art Documentation\",\"volume\":\"241 1\",\"pages\":\"19 - 37\"},\"PeriodicalIF\":0.2000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Art Documentation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1086/719999\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ART\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Art Documentation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1086/719999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ART","Score":null,"Total":0}
This case study explores the challenges of defining a data set to analyze changes in Wikipedia’s gender gap for articles about visual art. Wikipedia and Wikidata each group content in data structures that produce overlapping and incomplete visual art datasets and include non-visual art data. To circumvent potentially biased editorial decisions about what to include or exclude, this case study describes the process of using a topic model algorithm that identifies a dataset by analyzing the words in each article and grouping the articles into topics.