Mladen Popović, Maruf A. Dhali, Lambert Schomaker, Johannes van der Plicht, Kaare Lund Rasmussen, Jacopo La Nasa, Ilaria Degano, Maria Perla Colombini, Eibert Tigchelaar
{"title":"Dating ancient manuscripts using radiocarbon and AI-based writing style analysis","authors":"Mladen Popović, Maruf A. Dhali, Lambert Schomaker, Johannes van der Plicht, Kaare Lund Rasmussen, Jacopo La Nasa, Ilaria Degano, Maria Perla Colombini, Eibert Tigchelaar","doi":"arxiv-2407.12013","DOIUrl":null,"url":null,"abstract":"Determining the chronology of ancient handwritten manuscripts is essential\nfor reconstructing the evolution of ideas. For the Dead Sea Scrolls, this is\nparticularly important. However, there is an almost complete lack of\ndate-bearing manuscripts evenly distributed across the timeline and written in\nsimilar scripts available for palaeographic comparison. Here, we present Enoch,\na state-of-the-art AI-based date-prediction model, trained on the basis of new\nradiocarbon-dated samples of the scrolls. Enoch uses established\nhandwriting-style descriptors and applies Bayesian ridge regression. The\nchallenge of this study is that the number of radiocarbon-dated manuscripts is\nsmall, while current machine learning requires an abundance of training data.\nWe show that by using combined angular and allographic writing style feature\nvectors and applying Bayesian ridge regression, Enoch could predict the\nradiocarbon-based dates from style, supported by leave-one-out validation, with\nvaried MAEs of 27.9 to 30.7 years relative to the radiocarbon dating. Enoch was\nthen used to estimate the dates of 135 unseen manuscripts, revealing that 79\nper cent of the samples were considered 'realistic' upon palaeographic post-hoc\nevaluation. We present a new chronology of the scrolls. The radiocarbon ranges\nand Enoch's style-based predictions are often older than the traditionally\nassumed palaeographic estimates. In the range of 300-50 BCE, Enoch's date\nprediction provides an improved granularity. The study is in line with current\ndevelopments in multimodal machine-learning techniques, and the methods can be\nused for date prediction in other partially-dated manuscript collections. This\nresearch shows how Enoch's quantitative, probability-based approach can be a\ntool for palaeographers and historians, re-dating ancient Jewish key texts and\ncontributing to current debates on Jewish and Christian origins.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.12013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Determining the chronology of ancient handwritten manuscripts is essential
for reconstructing the evolution of ideas. For the Dead Sea Scrolls, this is
particularly important. However, there is an almost complete lack of
date-bearing manuscripts evenly distributed across the timeline and written in
similar scripts available for palaeographic comparison. Here, we present Enoch,
a state-of-the-art AI-based date-prediction model, trained on the basis of new
radiocarbon-dated samples of the scrolls. Enoch uses established
handwriting-style descriptors and applies Bayesian ridge regression. The
challenge of this study is that the number of radiocarbon-dated manuscripts is
small, while current machine learning requires an abundance of training data.
We show that by using combined angular and allographic writing style feature
vectors and applying Bayesian ridge regression, Enoch could predict the
radiocarbon-based dates from style, supported by leave-one-out validation, with
varied MAEs of 27.9 to 30.7 years relative to the radiocarbon dating. Enoch was
then used to estimate the dates of 135 unseen manuscripts, revealing that 79
per cent of the samples were considered 'realistic' upon palaeographic post-hoc
evaluation. We present a new chronology of the scrolls. The radiocarbon ranges
and Enoch's style-based predictions are often older than the traditionally
assumed palaeographic estimates. In the range of 300-50 BCE, Enoch's date
prediction provides an improved granularity. The study is in line with current
developments in multimodal machine-learning techniques, and the methods can be
used for date prediction in other partially-dated manuscript collections. This
research shows how Enoch's quantitative, probability-based approach can be a
tool for palaeographers and historians, re-dating ancient Jewish key texts and
contributing to current debates on Jewish and Christian origins.