{"title":"数据实践中的系统性种族主义","authors":"T. Watkins, J. Cain","doi":"10.29173/iq1079","DOIUrl":null,"url":null,"abstract":"Positionality statement \nAs we begin to discuss this issue, its origins, and its importance in contemporary society, I wanted to acknowledge my positionality and the role that it may play in the formation of this issue. Jonathan O. Cain is an African-American male working in the LIS field. Before moving into administration, I taught data and digital literacy and worked on developing programs that focused on improving access to these critical skills at zero cost to learners.\nIt is important to acknowledge my positionality and the lens through which I see the data science field. Trevor Watkins is an African American male working in the LIS field at an academic institution in an academic library. I teach critical data literacy workshops and engage in diversity and BIPOC-related digital projects with faculty, students, and the broader academic community across the country. I am also a researcher and practitioner in artificial intelligence (AI) and data science.\nThe global pandemic, its impacts, and why it matters\nWe first met in August 2020 to discuss the possibilities of this special issue about five months into the pandemic. We spent a good chunk of that meeting getting to know each other and, most importantly, discussed the toll the pandemic placed on our communities and us. It is probably safe to say that many of you, at some point, were uncertain of the future. Like most people worldwide, we lost family and friends or knew of people who succumbed to Covid-19 and other illnesses that weren't treated because the focus shifted to Covid-19. We get it. At one point, Covid-19 killed over three thousand people per day (Centers for Disease Control and Prevention (CDC), 2022). According to data from the CDC, 90% of the 385,676 people who died between March and December 2020 had Covid-19 listed as the underlying cause of death on their death certificate. The murders of Ahmaud Arbery in February, Breonna Taylor in March, and George Floyd in May 2020 sparked civic unrest across the United States (US) and protests across the globe in solidarity against racial injustice. When we announced this special issue and initiated a call for papers, we didn't get much of a response initially. We expected and acknowledged that it would probably take some time before we received inquiries or proposals about the issue, the intent to submit, or any submissions.\nLike many of you, we are still picking up the pieces from 2020 and dealing with the aftermath of Covid-19. The pandemic may be over now, depending on whom you ask, but the emotional scars are still there and may remain so for quite some time. Patience was the one quality we all had throughout this process, which is why we can present this publication today.\nData and liberatory technology\nLiberatory technology. This is a concept that invited contemplation as we sat down to record our reflections on this special issue. In drawing together scholars, educators, and practitioners to address the issue of data and its relationship to race, ethnicity, and representation, we, as coeditors, were making a statement about the importance of data, the material impact that this seemingly abstract and ethereal object can and does have on individual and community lives. And thinking about that impact brought liberatory technology to the front of our minds. The definition of liberator technology offered by the IDA B. Wells Just Data Lab intrigues us and invites us to grapple with that topic. They defined liberatory as something that \"supports the increased freedom and wellbeing of marginalized people, especially black people outside of capitalism and settler colonial power structures\" and technology as \"a tool used to accomplish a task.\" And as we contemplate this set of definitions, we are left to question whether data can be a liberatory technology or not. (LIBERATORY TECHNOLOGY AND DIGITAL MARRONAGE, n.d.)\nIn Liberation Technology: Black Protest in the Age of Franklin, Richard S. Newman draws parallels with the asserting ownership and mastery of new communication technologies and black liberation activities. Reflecting on the transformative nature of print technology, he writes, \"If the Marquis de Condorcet was right in 1793 that print had unshackled Europe from medieval modes of thought and action, then it is also true that print was perhaps the first technology to liberate blacks from the servile images that had long haunted their existence in Western culture.\" And draws a 19th-century example of how it expressly connects to black lives post-emancipation noting \"W. E. B. Du Bois certainly thought that black history and print history worked in tandem. Wherever one found newspapers in the post-Civil War South, he observed, one found some form of black freedom\" (Richard S. Newman, 2009, p. 175). He even notes how scholars note that black activists embraced other communication technologies like photography \"to reshape the image of African Americans in nineteenth-century culture.\" (Richard S. Newman, 2009, p. 175)\nWe have no shortage of examples of how data and data-driven technologies fail to support the \"increased freedom and wellbeing of marginalized people outside of capitalism and settler colonial power structures.\" In 2016, ProPublica published Machine Bias, a report that looks at Risk assessment technologies used in arraignment and sentencing. They report that \"The formula was particularly likely to falsely flag black defendants as future, wrongly labeling them this way at almost twice the rate as white defendants\" and \"white defendants were mislabeled as low risk more often than black defendants\" (Julia Angwin, 2016). A 2021 article, Fairness in Criminal Justice Risk Assessments: The State of the Art, in their analysis, noted, \"The false negative rate is much higher for whites so that violent white offenders are more likely than violent black offenders to be incorrectly classified as nonviolent. The false positive rate is much higher for blacks so that nonviolent black offenders are more likely than nonviolent white offenders to be incorrectly classified as violent. Both error rates mistakenly inflate the relative representation of blacks predicted to be violent. Such differences can support claims of racial injustice. In this application, the trade-off between two different kinds of fairness has real bite.\" (Berk et al., 2021, p. 33)\nThese are just a few examples of how these technological developments, on their own merits, fail to meet the definition offered by the authors of the \"Liberatory Technology and Digital Marronage\" Zine from the Ida B. Wells Just Data Labs. Reflecting on the technological path illustrated by Newman, the work of ownership and mastery of the tool provides the potential for it to be liberatory. Through this lens, the work of the Just Data Lab is exemplary for this meditation; it draws a direct line from technology, education, mastery, and liberatory technology.\nData in higher education\nData literacy education is an area that has been a focus of our careers in librarianship. It's a space where we saw the libraries' ability to make a meaningful impact. Data has had a tremendous impact on college campuses, from how research is conducted to the pressures colleges feel from stakeholder groups: students, governments, funders, donors, and employers to prepare students with the data and technology skills to gain employment in the knowledge economy.\nAs colleges and universities have turned (with varying degrees of success) to meet the needs of these communities, a myriad of explorations on the importance of the representation of these marginalized communities in these systems—to combat and dismantle the harmful practices that we see embedded in the systems that drive society and the potentially debilitating consequences they produce. That is partly why the works in this special issue are so important at this moment in time. These scholars and scholar-practitioners are engaging with these issues that drive the opaque structures surrounding us. And hopefully, their work can give us another perspective on how to engage with these structures and transform them to support liberatory practices.\nThe entries in this issue\nWe have some fantastic articles for you to read in this issue. We open with an article by Kevin Manuel, Rosa Orlandini, and Alexandra Cooper, who discuss how the collection process of racial, ethnic, and indigenous data has evolved in the Canadian Census since 1871, the erasure of minorities and indigenous citizens from those censuses, and the work to restore and accurately identify and categorize racialized groups.\nIn the next article, Leigh Phan, Stephanie Labou, Erin Foster, and Ibraheem Ali present a model for data ethics instruction for non-experts by designing and implementing two data ethics workshops. They make important points about the failure of academia to incorporate the ethical use of data in course curriculums and digital literacy training and demonstrate how academic libraries have become an essential resource for the academic community. Their workshop structure can be modeled for any academic library that endeavors to provide a similar service to its community.\nIn the third article, Natasha Johnson, Megan Sapp Nelson, and Katherine Yngve, interrogate the collective and local purposes of institutional data collection and its impact on student belongingness and propose a framework based on data feminism that centers the student as a person rather than a commodity.\nFinally, our closing article from Thema Monroe-White focuses on marginalized and underrepresented people in the data science field. The author proposes that racially relevant and responsive teaching is necessary to recruit more people from these groups and diversify the field. She discusses how the Ladson-Billings model of cultural relevant pedagogy has been applied and is beneficial to STEM curriculums, and how a liberatory data science curriculum could promote a student's voice and se","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Systemic racism in data practices\",\"authors\":\"T. Watkins, J. Cain\",\"doi\":\"10.29173/iq1079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Positionality statement \\nAs we begin to discuss this issue, its origins, and its importance in contemporary society, I wanted to acknowledge my positionality and the role that it may play in the formation of this issue. Jonathan O. Cain is an African-American male working in the LIS field. Before moving into administration, I taught data and digital literacy and worked on developing programs that focused on improving access to these critical skills at zero cost to learners.\\nIt is important to acknowledge my positionality and the lens through which I see the data science field. Trevor Watkins is an African American male working in the LIS field at an academic institution in an academic library. I teach critical data literacy workshops and engage in diversity and BIPOC-related digital projects with faculty, students, and the broader academic community across the country. I am also a researcher and practitioner in artificial intelligence (AI) and data science.\\nThe global pandemic, its impacts, and why it matters\\nWe first met in August 2020 to discuss the possibilities of this special issue about five months into the pandemic. We spent a good chunk of that meeting getting to know each other and, most importantly, discussed the toll the pandemic placed on our communities and us. It is probably safe to say that many of you, at some point, were uncertain of the future. Like most people worldwide, we lost family and friends or knew of people who succumbed to Covid-19 and other illnesses that weren't treated because the focus shifted to Covid-19. We get it. At one point, Covid-19 killed over three thousand people per day (Centers for Disease Control and Prevention (CDC), 2022). According to data from the CDC, 90% of the 385,676 people who died between March and December 2020 had Covid-19 listed as the underlying cause of death on their death certificate. The murders of Ahmaud Arbery in February, Breonna Taylor in March, and George Floyd in May 2020 sparked civic unrest across the United States (US) and protests across the globe in solidarity against racial injustice. When we announced this special issue and initiated a call for papers, we didn't get much of a response initially. We expected and acknowledged that it would probably take some time before we received inquiries or proposals about the issue, the intent to submit, or any submissions.\\nLike many of you, we are still picking up the pieces from 2020 and dealing with the aftermath of Covid-19. The pandemic may be over now, depending on whom you ask, but the emotional scars are still there and may remain so for quite some time. Patience was the one quality we all had throughout this process, which is why we can present this publication today.\\nData and liberatory technology\\nLiberatory technology. This is a concept that invited contemplation as we sat down to record our reflections on this special issue. In drawing together scholars, educators, and practitioners to address the issue of data and its relationship to race, ethnicity, and representation, we, as coeditors, were making a statement about the importance of data, the material impact that this seemingly abstract and ethereal object can and does have on individual and community lives. And thinking about that impact brought liberatory technology to the front of our minds. The definition of liberator technology offered by the IDA B. Wells Just Data Lab intrigues us and invites us to grapple with that topic. They defined liberatory as something that \\\"supports the increased freedom and wellbeing of marginalized people, especially black people outside of capitalism and settler colonial power structures\\\" and technology as \\\"a tool used to accomplish a task.\\\" And as we contemplate this set of definitions, we are left to question whether data can be a liberatory technology or not. (LIBERATORY TECHNOLOGY AND DIGITAL MARRONAGE, n.d.)\\nIn Liberation Technology: Black Protest in the Age of Franklin, Richard S. Newman draws parallels with the asserting ownership and mastery of new communication technologies and black liberation activities. Reflecting on the transformative nature of print technology, he writes, \\\"If the Marquis de Condorcet was right in 1793 that print had unshackled Europe from medieval modes of thought and action, then it is also true that print was perhaps the first technology to liberate blacks from the servile images that had long haunted their existence in Western culture.\\\" And draws a 19th-century example of how it expressly connects to black lives post-emancipation noting \\\"W. E. B. Du Bois certainly thought that black history and print history worked in tandem. Wherever one found newspapers in the post-Civil War South, he observed, one found some form of black freedom\\\" (Richard S. Newman, 2009, p. 175). He even notes how scholars note that black activists embraced other communication technologies like photography \\\"to reshape the image of African Americans in nineteenth-century culture.\\\" (Richard S. Newman, 2009, p. 175)\\nWe have no shortage of examples of how data and data-driven technologies fail to support the \\\"increased freedom and wellbeing of marginalized people outside of capitalism and settler colonial power structures.\\\" In 2016, ProPublica published Machine Bias, a report that looks at Risk assessment technologies used in arraignment and sentencing. They report that \\\"The formula was particularly likely to falsely flag black defendants as future, wrongly labeling them this way at almost twice the rate as white defendants\\\" and \\\"white defendants were mislabeled as low risk more often than black defendants\\\" (Julia Angwin, 2016). A 2021 article, Fairness in Criminal Justice Risk Assessments: The State of the Art, in their analysis, noted, \\\"The false negative rate is much higher for whites so that violent white offenders are more likely than violent black offenders to be incorrectly classified as nonviolent. The false positive rate is much higher for blacks so that nonviolent black offenders are more likely than nonviolent white offenders to be incorrectly classified as violent. Both error rates mistakenly inflate the relative representation of blacks predicted to be violent. Such differences can support claims of racial injustice. In this application, the trade-off between two different kinds of fairness has real bite.\\\" (Berk et al., 2021, p. 33)\\nThese are just a few examples of how these technological developments, on their own merits, fail to meet the definition offered by the authors of the \\\"Liberatory Technology and Digital Marronage\\\" Zine from the Ida B. Wells Just Data Labs. Reflecting on the technological path illustrated by Newman, the work of ownership and mastery of the tool provides the potential for it to be liberatory. Through this lens, the work of the Just Data Lab is exemplary for this meditation; it draws a direct line from technology, education, mastery, and liberatory technology.\\nData in higher education\\nData literacy education is an area that has been a focus of our careers in librarianship. It's a space where we saw the libraries' ability to make a meaningful impact. Data has had a tremendous impact on college campuses, from how research is conducted to the pressures colleges feel from stakeholder groups: students, governments, funders, donors, and employers to prepare students with the data and technology skills to gain employment in the knowledge economy.\\nAs colleges and universities have turned (with varying degrees of success) to meet the needs of these communities, a myriad of explorations on the importance of the representation of these marginalized communities in these systems—to combat and dismantle the harmful practices that we see embedded in the systems that drive society and the potentially debilitating consequences they produce. That is partly why the works in this special issue are so important at this moment in time. These scholars and scholar-practitioners are engaging with these issues that drive the opaque structures surrounding us. And hopefully, their work can give us another perspective on how to engage with these structures and transform them to support liberatory practices.\\nThe entries in this issue\\nWe have some fantastic articles for you to read in this issue. We open with an article by Kevin Manuel, Rosa Orlandini, and Alexandra Cooper, who discuss how the collection process of racial, ethnic, and indigenous data has evolved in the Canadian Census since 1871, the erasure of minorities and indigenous citizens from those censuses, and the work to restore and accurately identify and categorize racialized groups.\\nIn the next article, Leigh Phan, Stephanie Labou, Erin Foster, and Ibraheem Ali present a model for data ethics instruction for non-experts by designing and implementing two data ethics workshops. They make important points about the failure of academia to incorporate the ethical use of data in course curriculums and digital literacy training and demonstrate how academic libraries have become an essential resource for the academic community. Their workshop structure can be modeled for any academic library that endeavors to provide a similar service to its community.\\nIn the third article, Natasha Johnson, Megan Sapp Nelson, and Katherine Yngve, interrogate the collective and local purposes of institutional data collection and its impact on student belongingness and propose a framework based on data feminism that centers the student as a person rather than a commodity.\\nFinally, our closing article from Thema Monroe-White focuses on marginalized and underrepresented people in the data science field. The author proposes that racially relevant and responsive teaching is necessary to recruit more people from these groups and diversify the field. She discusses how the Ladson-Billings model of cultural relevant pedagogy has been applied and is beneficial to STEM curriculums, and how a liberatory data science curriculum could promote a student's voice and se\",\"PeriodicalId\":84870,\"journal\":{\"name\":\"IASSIST quarterly\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IASSIST quarterly\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29173/iq1079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IASSIST quarterly","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29173/iq1079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
立场声明当我们开始讨论这个问题、它的起源及其在当代社会中的重要性时,我想承认我的立场及其在这个问题的形成中可能发挥的作用。Jonathan O.Cain是一名在LIS领域工作的非裔美国男性。在进入行政部门之前,我教授数据和数字素养,并致力于开发项目,重点是以零成本提高学习者获得这些关键技能的机会。重要的是要承认我的立场和我看待数据科学领域的视角。Trevor Watkins是一名非裔美国男性,在一家学术图书馆的学术机构从事LIS领域的工作。我教授关键的数据素养研讨会,并与全国各地的教职员工、学生和更广泛的学术界一起参与多样性和BIPOC相关的数字项目。我还是人工智能(AI)和数据科学的研究员和实践者。全球疫情、其影响以及为什么重要我们于2020年8月首次会面,讨论在疫情爆发约五个月后发行这期特刊的可能性。在那次会议上,我们花了很大一部分时间相互了解,最重要的是,讨论了疫情给我们的社区和我们带来的损失。可以肯定地说,你们中的许多人在某个时候对未来感到不确定。像世界上大多数人一样,我们失去了家人和朋友,或者知道有人死于新冠肺炎和其他没有得到治疗的疾病,因为重点转移到了新冠肺炎。我们明白了。有一次,新冠肺炎每天导致3000多人死亡(美国疾病控制与预防中心,2022年)。根据美国疾病控制与预防中心的数据,在2020年3月至12月期间死亡的385676人中,90%的人的死亡证明中已将新冠肺炎列为潜在死亡原因。2020年2月Ahmaud Arbery、3月Breonna Taylor和5月George Floyd的谋杀案在美国引发了内乱,并在全球范围内引发了声援种族不公正的抗议活动。当我们宣布这期特刊并发起论文征集时,最初并没有得到太多回应。我们预计并承认,我们可能需要一段时间才能收到有关该问题、提交意向或任何提交材料的询问或建议。和你们中的许多人一样,我们仍在收拾2020年的残局,处理新冠肺炎的后果。疫情现在可能已经结束,这取决于你问谁,但情感创伤仍然存在,而且可能会持续很长一段时间。在整个过程中,耐心是我们所有人的一种品质,这就是为什么我们今天能够发表这份出版物。数据和解放技术解放技术。当我们坐下来记录我们对这个特刊的思考时,这个概念引起了沉思。在召集学者、教育工作者和从业者来解决数据及其与种族、族裔和代表性的关系问题时,我们作为合著者,就数据的重要性以及这个看似抽象和空灵的物体能够而且确实对个人和社区生活产生的物质影响发表了声明。思考这种影响将解放性技术带到了我们的脑海中。IDA B.Wells Just Data Lab提供的解放者技术的定义吸引了我们,并邀请我们讨论这个话题。他们将解放定义为“支持边缘化人群,特别是资本主义和定居者殖民权力结构之外的黑人,增加自由和福祉”,将技术定义为“用于完成任务的工具”。当我们思考这组定义时,我们不得不质疑数据是否是一种解放技术。(《解放技术与数字婚姻》,n.d.)在《解放技术:富兰克林时代的黑人抗议》一书中,理查德·S·纽曼将其与新通信技术和黑人解放活动的所有权和掌握权相提并论。在反思印刷技术的变革性时,他写道:“如果1793年孔多塞侯爵认为印刷将欧洲从中世纪的思想和行动模式中解放出来是正确的,那么印刷也许是第一种将黑人从长期困扰他们的西方文化中的卑躬屈膝的形象中解放出来的技术。并引用了一个19世纪的例子,说明它如何明确地与解放后的黑人生活联系在一起。杜波依斯当然认为黑人历史和印刷历史是相辅相成的。他观察到,在南北战争后的南方,无论在哪里找到报纸,都能找到某种形式的黑人自由”(Richard S.Newman,2009,第175页)。他甚至注意到,学者们注意到,黑人活动家接受了摄影等其他传播技术,“以重塑19世纪文化中非裔美国人的形象”。”(理查德S。 Newman,2009,第175页)我们不乏数据和数据驱动技术未能支持“资本主义和定居者殖民权力结构之外边缘化人群的自由和福祉增加”的例子。2016年,ProPublica发表了一份关于传讯和判刑中使用的风险评估技术的报告《机器偏见》。他们报告称,“该公式特别有可能错误地将黑人被告标记为未来,错误地将他们标记为白人被告的比率几乎是白人被告的两倍”,“白人被告比黑人被告更经常被错误地标记为低风险”(Julia Angwin,2016)。2021年的一篇文章《刑事司法风险评估中的公平:最新技术》在其分析中指出,“白人的假阴性率要高得多,因此暴力的白人罪犯比暴力的黑人罪犯更有可能被错误地归类为非暴力罪犯。黑人的假阳性率要高得多。因此非暴力的黑人犯罪者比非暴力的白人犯罪者更有可能被不正确地归类为暴力罪犯黑人预言会有暴力倾向。这种差异可以支持种族不公正的说法。在这个应用程序中,两种不同类型的公平性之间的权衡具有实际意义。“(Berk等人,2021,第33页)这些只是这些技术发展如何因其自身优点而未能达到Ida B。Wells Just数据实验室。反思纽曼所描绘的技术道路,拥有和掌握工具的工作为其解放提供了潜力。通过这个镜头,正义数据实验室的工作是这种冥想的典范;它与技术、教育、掌握和解放技术有着直接的联系。高等教育中的数据数据素养教育是我们图书馆事业中关注的一个领域。在这个空间里,我们看到了图书馆产生有意义影响的能力。数据对大学校园产生了巨大影响,从研究的进行方式到大学感受到的来自利益相关者群体的压力:学生、政府、资助者、捐赠者和雇主,让学生掌握数据和技术技能,在知识经济中就业。随着学院和大学转向满足这些社区的需求(取得了不同程度的成功),对这些边缘化社区在这些系统中的代表性的重要性进行了无数次探索,以打击和消除我们所看到的嵌入驱动社会的系统中的有害做法及其产生的潜在削弱性后果。这就是为什么本期特刊中的作品在这个时刻如此重要的部分原因。这些学者和学者从业者正在参与这些推动我们周围不透明结构的问题。希望他们的工作能给我们提供另一个视角,让我们了解如何参与这些结构,并将其转化为支持解放实践。本期的文章我们有一些精彩的文章供您阅读。我们以Kevin Manuel、Rosa Orlandini和Alexandra Cooper的一篇文章开场,他们讨论了自1871年以来加拿大人口普查中种族、族裔和土著数据的收集过程是如何演变的,从人口普查中删除少数民族和土著公民,以及恢复和准确识别和分类种族化群体的工作。在下一篇文章中,Leigh Phan、Stephanie Labou、Erin Foster和Ibraheem Ali通过设计和实施两个数据伦理研讨会,为非专家提供了一个数据伦理指导模型。他们对学术界未能将数据的道德使用纳入课程和数字素养培训提出了重要观点,并展示了学术图书馆如何成为学术界的重要资源。他们的工作室结构可以为任何试图为其社区提供类似服务的学术图书馆建模。在第三篇文章中,Natasha Johnson、Megan Sapp Nelson和Katherine Yngve质疑了机构数据收集的集体和地方目的及其对学生归属感的影响,并提出了一个基于数据女权主义的框架,将学生作为一个人而非商品。最后,我们来自Thema Monroe White的闭幕文章聚焦于数据科学领域中被边缘化和代表性不足的人群。作者提出,为了从这些群体中招募更多的人并使该领域多样化,有必要进行与种族相关的、反应灵敏的教学。她讨论了文化相关教育学的Ladson-Billings模式是如何被应用并有益于STEM课程的,以及解放性的数据科学课程如何促进学生的声音和技能
Positionality statement
As we begin to discuss this issue, its origins, and its importance in contemporary society, I wanted to acknowledge my positionality and the role that it may play in the formation of this issue. Jonathan O. Cain is an African-American male working in the LIS field. Before moving into administration, I taught data and digital literacy and worked on developing programs that focused on improving access to these critical skills at zero cost to learners.
It is important to acknowledge my positionality and the lens through which I see the data science field. Trevor Watkins is an African American male working in the LIS field at an academic institution in an academic library. I teach critical data literacy workshops and engage in diversity and BIPOC-related digital projects with faculty, students, and the broader academic community across the country. I am also a researcher and practitioner in artificial intelligence (AI) and data science.
The global pandemic, its impacts, and why it matters
We first met in August 2020 to discuss the possibilities of this special issue about five months into the pandemic. We spent a good chunk of that meeting getting to know each other and, most importantly, discussed the toll the pandemic placed on our communities and us. It is probably safe to say that many of you, at some point, were uncertain of the future. Like most people worldwide, we lost family and friends or knew of people who succumbed to Covid-19 and other illnesses that weren't treated because the focus shifted to Covid-19. We get it. At one point, Covid-19 killed over three thousand people per day (Centers for Disease Control and Prevention (CDC), 2022). According to data from the CDC, 90% of the 385,676 people who died between March and December 2020 had Covid-19 listed as the underlying cause of death on their death certificate. The murders of Ahmaud Arbery in February, Breonna Taylor in March, and George Floyd in May 2020 sparked civic unrest across the United States (US) and protests across the globe in solidarity against racial injustice. When we announced this special issue and initiated a call for papers, we didn't get much of a response initially. We expected and acknowledged that it would probably take some time before we received inquiries or proposals about the issue, the intent to submit, or any submissions.
Like many of you, we are still picking up the pieces from 2020 and dealing with the aftermath of Covid-19. The pandemic may be over now, depending on whom you ask, but the emotional scars are still there and may remain so for quite some time. Patience was the one quality we all had throughout this process, which is why we can present this publication today.
Data and liberatory technology
Liberatory technology. This is a concept that invited contemplation as we sat down to record our reflections on this special issue. In drawing together scholars, educators, and practitioners to address the issue of data and its relationship to race, ethnicity, and representation, we, as coeditors, were making a statement about the importance of data, the material impact that this seemingly abstract and ethereal object can and does have on individual and community lives. And thinking about that impact brought liberatory technology to the front of our minds. The definition of liberator technology offered by the IDA B. Wells Just Data Lab intrigues us and invites us to grapple with that topic. They defined liberatory as something that "supports the increased freedom and wellbeing of marginalized people, especially black people outside of capitalism and settler colonial power structures" and technology as "a tool used to accomplish a task." And as we contemplate this set of definitions, we are left to question whether data can be a liberatory technology or not. (LIBERATORY TECHNOLOGY AND DIGITAL MARRONAGE, n.d.)
In Liberation Technology: Black Protest in the Age of Franklin, Richard S. Newman draws parallels with the asserting ownership and mastery of new communication technologies and black liberation activities. Reflecting on the transformative nature of print technology, he writes, "If the Marquis de Condorcet was right in 1793 that print had unshackled Europe from medieval modes of thought and action, then it is also true that print was perhaps the first technology to liberate blacks from the servile images that had long haunted their existence in Western culture." And draws a 19th-century example of how it expressly connects to black lives post-emancipation noting "W. E. B. Du Bois certainly thought that black history and print history worked in tandem. Wherever one found newspapers in the post-Civil War South, he observed, one found some form of black freedom" (Richard S. Newman, 2009, p. 175). He even notes how scholars note that black activists embraced other communication technologies like photography "to reshape the image of African Americans in nineteenth-century culture." (Richard S. Newman, 2009, p. 175)
We have no shortage of examples of how data and data-driven technologies fail to support the "increased freedom and wellbeing of marginalized people outside of capitalism and settler colonial power structures." In 2016, ProPublica published Machine Bias, a report that looks at Risk assessment technologies used in arraignment and sentencing. They report that "The formula was particularly likely to falsely flag black defendants as future, wrongly labeling them this way at almost twice the rate as white defendants" and "white defendants were mislabeled as low risk more often than black defendants" (Julia Angwin, 2016). A 2021 article, Fairness in Criminal Justice Risk Assessments: The State of the Art, in their analysis, noted, "The false negative rate is much higher for whites so that violent white offenders are more likely than violent black offenders to be incorrectly classified as nonviolent. The false positive rate is much higher for blacks so that nonviolent black offenders are more likely than nonviolent white offenders to be incorrectly classified as violent. Both error rates mistakenly inflate the relative representation of blacks predicted to be violent. Such differences can support claims of racial injustice. In this application, the trade-off between two different kinds of fairness has real bite." (Berk et al., 2021, p. 33)
These are just a few examples of how these technological developments, on their own merits, fail to meet the definition offered by the authors of the "Liberatory Technology and Digital Marronage" Zine from the Ida B. Wells Just Data Labs. Reflecting on the technological path illustrated by Newman, the work of ownership and mastery of the tool provides the potential for it to be liberatory. Through this lens, the work of the Just Data Lab is exemplary for this meditation; it draws a direct line from technology, education, mastery, and liberatory technology.
Data in higher education
Data literacy education is an area that has been a focus of our careers in librarianship. It's a space where we saw the libraries' ability to make a meaningful impact. Data has had a tremendous impact on college campuses, from how research is conducted to the pressures colleges feel from stakeholder groups: students, governments, funders, donors, and employers to prepare students with the data and technology skills to gain employment in the knowledge economy.
As colleges and universities have turned (with varying degrees of success) to meet the needs of these communities, a myriad of explorations on the importance of the representation of these marginalized communities in these systems—to combat and dismantle the harmful practices that we see embedded in the systems that drive society and the potentially debilitating consequences they produce. That is partly why the works in this special issue are so important at this moment in time. These scholars and scholar-practitioners are engaging with these issues that drive the opaque structures surrounding us. And hopefully, their work can give us another perspective on how to engage with these structures and transform them to support liberatory practices.
The entries in this issue
We have some fantastic articles for you to read in this issue. We open with an article by Kevin Manuel, Rosa Orlandini, and Alexandra Cooper, who discuss how the collection process of racial, ethnic, and indigenous data has evolved in the Canadian Census since 1871, the erasure of minorities and indigenous citizens from those censuses, and the work to restore and accurately identify and categorize racialized groups.
In the next article, Leigh Phan, Stephanie Labou, Erin Foster, and Ibraheem Ali present a model for data ethics instruction for non-experts by designing and implementing two data ethics workshops. They make important points about the failure of academia to incorporate the ethical use of data in course curriculums and digital literacy training and demonstrate how academic libraries have become an essential resource for the academic community. Their workshop structure can be modeled for any academic library that endeavors to provide a similar service to its community.
In the third article, Natasha Johnson, Megan Sapp Nelson, and Katherine Yngve, interrogate the collective and local purposes of institutional data collection and its impact on student belongingness and propose a framework based on data feminism that centers the student as a person rather than a commodity.
Finally, our closing article from Thema Monroe-White focuses on marginalized and underrepresented people in the data science field. The author proposes that racially relevant and responsive teaching is necessary to recruit more people from these groups and diversify the field. She discusses how the Ladson-Billings model of cultural relevant pedagogy has been applied and is beneficial to STEM curriculums, and how a liberatory data science curriculum could promote a student's voice and se