Mrinalini Luthra, Konstantin Todorov, C. Jeurgens, Giovanni Colavizza
{"title":"通过自动实体识别解除殖民地档案的沉默","authors":"Mrinalini Luthra, Konstantin Todorov, C. Jeurgens, Giovanni Colavizza","doi":"10.1108/jd-02-2022-0038","DOIUrl":null,"url":null,"abstract":"PurposeThis paper aims to expand the scope and mitigate the biases of extant archival indexes.Design/methodology/approachThe authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people.FindingsThe authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible.Originality/valueColonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.","PeriodicalId":47969,"journal":{"name":"Journal of Documentation","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsilencing colonial archives via automated entity recognition\",\"authors\":\"Mrinalini Luthra, Konstantin Todorov, C. Jeurgens, Giovanni Colavizza\",\"doi\":\"10.1108/jd-02-2022-0038\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PurposeThis paper aims to expand the scope and mitigate the biases of extant archival indexes.Design/methodology/approachThe authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people.FindingsThe authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible.Originality/valueColonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.\",\"PeriodicalId\":47969,\"journal\":{\"name\":\"Journal of Documentation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Documentation\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1108/jd-02-2022-0038\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Documentation","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/jd-02-2022-0038","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
Unsilencing colonial archives via automated entity recognition
PurposeThis paper aims to expand the scope and mitigate the biases of extant archival indexes.Design/methodology/approachThe authors use automatic entity recognition on the archives of the Dutch East India Company to extract mentions of underrepresented people.FindingsThe authors release an annotated corpus and baselines for a shared task and show that the proposed goal is feasible.Originality/valueColonial archives are increasingly a focus of attention for historians and the public, broadening access to them is a pressing need for archives.
期刊介绍:
The scope of the Journal of Documentation is broadly information sciences, encompassing all of the academic and professional disciplines which deal with recorded information. These include, but are certainly not limited to: ■Information science, librarianship and related disciplines ■Information and knowledge management ■Information and knowledge organisation ■Information seeking and retrieval, and human information behaviour ■Information and digital literacies