Machine learning regression algorithms for generating chemical element maps from X-ray fluorescence data of paintings

IF 3.7 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS
Juan Ruiz de Miras , María José Gacto , María Rosario Blanc , Germán Arroyo , Luis López , Juan Carlos Torres , Domingo Martín
{"title":"Machine learning regression algorithms for generating chemical element maps from X-ray fluorescence data of paintings","authors":"Juan Ruiz de Miras ,&nbsp;María José Gacto ,&nbsp;María Rosario Blanc ,&nbsp;Germán Arroyo ,&nbsp;Luis López ,&nbsp;Juan Carlos Torres ,&nbsp;Domingo Martín","doi":"10.1016/j.chemolab.2024.105116","DOIUrl":null,"url":null,"abstract":"<div><p>Generating chemical element maps of paintings from X-ray fluorescence (XRF) data is a very valuable tool for the scientific community of conservators and art historians. Hand-held XRF scanners are cheap and easily portable but their use provides scans with a few data, so additional analytical tools are needed to obtain reliable chemical element maps from them. Recently, the software tool <em>SmART_Scan</em> was released, which uses an algorithm based on the minimum hypercube distance (MHD) to compute this kind of maps. In this paper, we propose a new methodology to address this problem by using machine learning algorithms for regression as alternative and more accurate techniques than MHD. We tested MHD versus eight machine learning regression algorithms on two paintings with different features. Our results showed that machine learning algorithms Random Forest and kNN significantly outperformed MHD in Mean Squared Error (MSE) and coefficient of determination (R<sup>2</sup>) for all the experiments. When using experts’ data and a hold-out validation, kNN was the best-ranked algorithm. Random Forest was the best-ranked algorithm when cross-validation was used. We did not find significant differences in average MSE nor in R<sup>2</sup> between kNN and Random Forest, so we can conclude that Random Forest is the best-suited algorithm for computing chemical element maps of paintings from XRF data.</p></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"248 ","pages":"Article 105116"},"PeriodicalIF":3.7000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S016974392400056X/pdfft?md5=09e533e1d29129df1d947e2c62c61ef0&pid=1-s2.0-S016974392400056X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016974392400056X","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Generating chemical element maps of paintings from X-ray fluorescence (XRF) data is a very valuable tool for the scientific community of conservators and art historians. Hand-held XRF scanners are cheap and easily portable but their use provides scans with a few data, so additional analytical tools are needed to obtain reliable chemical element maps from them. Recently, the software tool SmART_Scan was released, which uses an algorithm based on the minimum hypercube distance (MHD) to compute this kind of maps. In this paper, we propose a new methodology to address this problem by using machine learning algorithms for regression as alternative and more accurate techniques than MHD. We tested MHD versus eight machine learning regression algorithms on two paintings with different features. Our results showed that machine learning algorithms Random Forest and kNN significantly outperformed MHD in Mean Squared Error (MSE) and coefficient of determination (R2) for all the experiments. When using experts’ data and a hold-out validation, kNN was the best-ranked algorithm. Random Forest was the best-ranked algorithm when cross-validation was used. We did not find significant differences in average MSE nor in R2 between kNN and Random Forest, so we can conclude that Random Forest is the best-suited algorithm for computing chemical element maps of paintings from XRF data.

从绘画的 X 射线荧光数据生成化学元素图的机器学习回归算法
根据 X 射线荧光 (XRF) 数据生成绘画作品的化学元素图谱,对于保护工作者和艺术史学家等科学界人士来说是一个非常有价值的工具。手持式 XRF 扫描仪价格低廉,便于携带,但其扫描数据较少,因此需要额外的分析工具才能从中获得可靠的化学元素图谱。最近发布的软件工具 SmART_Scan,使用基于最小超立方距离(MHD)的算法来计算这类地图。在本文中,我们提出了一种解决这一问题的新方法,即使用机器学习算法进行回归,作为比 MHD 更准确的替代技术。我们在两幅具有不同特征的绘画作品上测试了 MHD 和八种机器学习回归算法。结果表明,在所有实验中,机器学习算法随机森林(Random Forest)和kNN在平均平方误差(MSE)和判定系数(R2)方面明显优于MHD。在使用专家数据和排除验证时,kNN 是排名最好的算法。在使用交叉验证时,随机森林是排名最好的算法。我们没有发现 kNN 和随机森林在平均 MSE 和 R2 上有明显差异,因此我们可以得出结论,随机森林是最适合根据 XRF 数据计算绘画化学元素图谱的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.50
自引率
7.70%
发文量
169
审稿时长
3.4 months
期刊介绍: Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines. Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data. The journal deals with the following topics: 1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.) 2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered. 3) Development of new software that provides novel tools or truly advances the use of chemometrical methods. 4) Well characterized data sets to test performance for the new methods and software. The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信