From manuscript inventories to data: Artificial intelligence and databases, a new chapter for the history of science at “G. Marconi” Central Library of the CNR
{"title":"From manuscript inventories to data: Artificial intelligence and databases, a new chapter for the history of science at “G. Marconi” Central Library of the CNR","authors":"Santorsa Sara, Bartolucci Monia, Cilione Emanuela, Florio Isabella, Migliorelli Giorgia, Ranchino Maria Adelaide, Tiberi Luca","doi":"10.1016/j.daach.2025.e00459","DOIUrl":null,"url":null,"abstract":"<div><div>Established in 1927, the “G. Marconi” Central Library of the National Research Council (CNR) has, from its inception, benefited from the legal deposit of Italian technical-scientific publications. This legacy has earned it the reputation of being Italy’s National Library of Science and Technology. This paper explores the evolving role of the digital librarian by focusing on a project to enhance and improve accessibility to the Library’s heritage through the analysis and digitisation of its manuscript inventory books (15/10/1931–10/01/1991). These volumes record a summary description of 205,683 bibliographic documents. By defining criteria for preservation, restoration, digitisation, management, and dissemination, the project aims to reconstruct the Library’s historical development, with a focus on its preservation efforts, and document the key stages of its growth from its foundation to the 1990s. Analysis of these inventories will shed light on significant events that shaped the Library’s history – such as major acquisitions, losses due to natural events, or collaborations with other institutions – and contribute to a cultural history of science and technology. To facilitate data consultation, the project includes the development of a structured database. Additionally, Optical Character Recognition (OCR) techniques, particularly Handwritten Text Recognition (HTR), will be used for transcription and text search. HTR methods will identify keywords, analyse layouts, and automate author recognition to streamline transcription. This study not only deepens our understanding of the Library’s history and role within its region but also enhances our insights into the evolution of scientific research over time.</div></div>","PeriodicalId":38225,"journal":{"name":"Digital Applications in Archaeology and Cultural Heritage","volume":"39 ","pages":"Article e00459"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Applications in Archaeology and Cultural Heritage","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221205482500061X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Established in 1927, the “G. Marconi” Central Library of the National Research Council (CNR) has, from its inception, benefited from the legal deposit of Italian technical-scientific publications. This legacy has earned it the reputation of being Italy’s National Library of Science and Technology. This paper explores the evolving role of the digital librarian by focusing on a project to enhance and improve accessibility to the Library’s heritage through the analysis and digitisation of its manuscript inventory books (15/10/1931–10/01/1991). These volumes record a summary description of 205,683 bibliographic documents. By defining criteria for preservation, restoration, digitisation, management, and dissemination, the project aims to reconstruct the Library’s historical development, with a focus on its preservation efforts, and document the key stages of its growth from its foundation to the 1990s. Analysis of these inventories will shed light on significant events that shaped the Library’s history – such as major acquisitions, losses due to natural events, or collaborations with other institutions – and contribute to a cultural history of science and technology. To facilitate data consultation, the project includes the development of a structured database. Additionally, Optical Character Recognition (OCR) techniques, particularly Handwritten Text Recognition (HTR), will be used for transcription and text search. HTR methods will identify keywords, analyse layouts, and automate author recognition to streamline transcription. This study not only deepens our understanding of the Library’s history and role within its region but also enhances our insights into the evolution of scientific research over time.