Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research
Joe Nockels, Paul Gooding, Sarah Ames, Melissa Terras
{"title":"Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research","authors":"Joe Nockels, Paul Gooding, Sarah Ames, Melissa Terras","doi":"10.1007/s10502-022-09397-0","DOIUrl":null,"url":null,"abstract":"<div><p>Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were <i>humanities applications</i> (67%), <i>technological (25%), users</i> (5%) and <i>tutorials (3%)</i>. This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.</p></div>","PeriodicalId":46131,"journal":{"name":"ARCHIVAL SCIENCE","volume":"22 3","pages":"367 - 392"},"PeriodicalIF":1.4000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10502-022-09397-0.pdf","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ARCHIVAL SCIENCE","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10502-022-09397-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 8
Abstract
Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were humanities applications (67%), technological (25%), users (5%) and tutorials (3%). This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.
期刊介绍:
Archival Science promotes the development of archival science as an autonomous scientific discipline. The journal covers all aspects of archival science theory, methodology, and practice. Moreover, it investigates different cultural approaches to creation, management and provision of access to archives, records, and data. It also seeks to promote the exchange and comparison of concepts, views and attitudes related to recordkeeping issues around the world.Archival Science''s approach is integrated, interdisciplinary, and intercultural. Its scope encompasses the entire field of recorded process-related information, analyzed in terms of form, structure, and context. To meet its objectives, the journal draws from scientific disciplines that deal with the function of records and the way they are created, preserved, and retrieved; the context in which information is generated, managed, and used; and the social and cultural environment of records creation at different times and places.Covers all aspects of archival science theory, methodology, and practiceInvestigates different cultural approaches to creation, management and provision of access to archives, records, and dataPromotes the exchange and comparison of concepts, views, and attitudes related to recordkeeping issues around the worldAddresses the entire field of recorded process-related information, analyzed in terms of form, structure, and context