Chemometrics reveals not-so-obvious analytical information

IF 1.1 Q4 CHEMISTRY, ANALYTICAL
F. Pereira
{"title":"Chemometrics reveals not-so-obvious analytical information","authors":"F. Pereira","doi":"10.30744/brjac.2179-3425.letter-fabiolaverbi.n37","DOIUrl":null,"url":null,"abstract":"The application of chemometric tools in analytical chemistry or other areas of chemistry has become essential. This is mainly due to the large amount and nature of the generated data1,2 and the need to extract useful information from these and optimize steps throughout a process. It allows the quick decision-making visualization of interactions among variables, such as synergism or antagonism between parameters, during the development of a method,3 as shown in Figure 1. Classical chemometric techniques have been disseminated and can be divided according to the study approach, among which exploratory data analysis stands out. Principal component analysis (PCA) is one of the most accessible and well-established ways to perform an initial exploration and extract relevant information from a given dataset and has been used quite successfully in various spectroscopic techniques.4 Principal component analysis consists of projecting the data in a smaller dimension, enabling the detection of anomalous samples (outliers), the selection of essential variables in a given system, and unsupervised classification.1,2,4 Another branch of chemometrics involves the design of experiments (DoE). The primary purpose of the factorial design is to study the influence or effect of a given variable and its interactions in a specific system.5-9 Multivariate calibration is another aspect of chemometrics, where several variables are used to calibrate one (or more) property or the concentration of a given chemical analyte.10,11 Since the first publications of chemometric tools, numerous variations of these techniques, proposals for data fusion strategies, and applications using hyphenated instrumental techniques have been proposed.12-14 Industrial quality control and development (R&D) laboratories require an approach addressing adequate quality by design (QbD). The QbD strategies consider four steps that include an analytical target profile (ATP), a risk assessment, a design space (DS), and control strategy and validation based on figures of merit, for instance.9 Principal component analysis is the most widely multivariate technique used for data analysis. Jolliffe wrote a review reporting his wonderful experience with PCA in the last 50 years.15 Indeed, PCA is an invaluable method for data, and I agree with it. PCA is the algorithm of choice for numerous chemometric techniques.16 Other computational languages, such as Python, are currently experiencing a rise in popularity in the field of chemistry. The R language has also become more popular than it was ten years ago. The scripts, functions, or codes are easily written with fewer lines and specific commands that minimize steps and help speed up calculations. The dissemination of free software has also become popular, and the sharing of codes through publications, social media, communities, or websites has become relatively easy. From my point of view, chemometrics is no longer faced as a giant monster or a way to become scientific papers fancier without helpful content. Chemometrics extract information that is not easy to visualize at first through univariate evaluations or using simple plots. Nowadays, thousands of instrumental data can furnish important chemical information, and we must use them for significant proposals. Indeed, QbD is proof of that, as Industry 4.0 is a reality.","PeriodicalId":9115,"journal":{"name":"Brazilian Journal of Analytical Chemistry","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2022-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brazilian Journal of Analytical Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30744/brjac.2179-3425.letter-fabiolaverbi.n37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The application of chemometric tools in analytical chemistry or other areas of chemistry has become essential. This is mainly due to the large amount and nature of the generated data1,2 and the need to extract useful information from these and optimize steps throughout a process. It allows the quick decision-making visualization of interactions among variables, such as synergism or antagonism between parameters, during the development of a method,3 as shown in Figure 1. Classical chemometric techniques have been disseminated and can be divided according to the study approach, among which exploratory data analysis stands out. Principal component analysis (PCA) is one of the most accessible and well-established ways to perform an initial exploration and extract relevant information from a given dataset and has been used quite successfully in various spectroscopic techniques.4 Principal component analysis consists of projecting the data in a smaller dimension, enabling the detection of anomalous samples (outliers), the selection of essential variables in a given system, and unsupervised classification.1,2,4 Another branch of chemometrics involves the design of experiments (DoE). The primary purpose of the factorial design is to study the influence or effect of a given variable and its interactions in a specific system.5-9 Multivariate calibration is another aspect of chemometrics, where several variables are used to calibrate one (or more) property or the concentration of a given chemical analyte.10,11 Since the first publications of chemometric tools, numerous variations of these techniques, proposals for data fusion strategies, and applications using hyphenated instrumental techniques have been proposed.12-14 Industrial quality control and development (R&D) laboratories require an approach addressing adequate quality by design (QbD). The QbD strategies consider four steps that include an analytical target profile (ATP), a risk assessment, a design space (DS), and control strategy and validation based on figures of merit, for instance.9 Principal component analysis is the most widely multivariate technique used for data analysis. Jolliffe wrote a review reporting his wonderful experience with PCA in the last 50 years.15 Indeed, PCA is an invaluable method for data, and I agree with it. PCA is the algorithm of choice for numerous chemometric techniques.16 Other computational languages, such as Python, are currently experiencing a rise in popularity in the field of chemistry. The R language has also become more popular than it was ten years ago. The scripts, functions, or codes are easily written with fewer lines and specific commands that minimize steps and help speed up calculations. The dissemination of free software has also become popular, and the sharing of codes through publications, social media, communities, or websites has become relatively easy. From my point of view, chemometrics is no longer faced as a giant monster or a way to become scientific papers fancier without helpful content. Chemometrics extract information that is not easy to visualize at first through univariate evaluations or using simple plots. Nowadays, thousands of instrumental data can furnish important chemical information, and we must use them for significant proposals. Indeed, QbD is proof of that, as Industry 4.0 is a reality.
化学计量学揭示了不太明显的分析信息
化学计量工具在分析化学或其他化学领域的应用已经变得必不可少。这主要是由于生成的数据数量和性质都很大1,2,并且需要从这些数据中提取有用的信息并优化整个流程中的步骤。它允许在方法开发过程中对变量之间的交互进行快速决策可视化,例如参数之间的协同作用或对抗作用,如图1所示。经典的化学计量学技术已经广为传播,并可以根据研究方法进行划分,其中探索性数据分析是最突出的。主成分分析(PCA)是从给定数据集中进行初步探索和提取相关信息的最容易获得和最完善的方法之一,并已成功地用于各种光谱技术主成分分析包括在较小的维度上投射数据,能够检测异常样本(异常值),在给定系统中选择基本变量以及无监督分类。化学计量学的另一个分支涉及实验设计(DoE)。析因设计的主要目的是研究特定系统中给定变量的影响或效应及其相互作用。5-9多元校准是化学计量学的另一个方面,其中使用几个变量来校准一个(或多个)属性或给定化学分析物的浓度。10,11自从化学计量工具首次发表以来,已经提出了这些技术的许多变体,数据融合策略的建议,以及使用连字仪器技术的应用。12-14工业质量控制和开发(R&D)实验室需要一种方法来解决充分的质量设计(QbD)。QbD策略考虑四个步骤,包括分析目标概要(ATP)、风险评估、设计空间(DS)、控制策略和基于价值数字的验证主成分分析是应用最广泛的多变量数据分析技术。15 . Jolliffe写了一篇综述,报告了他在过去50年里使用PCA的美妙经历的确,PCA是处理数据的一种非常宝贵的方法,我同意这一点。PCA是许多化学计量学技术的首选算法其他计算语言,如Python,目前在化学领域也越来越受欢迎。R语言也比十年前更受欢迎。脚本、函数或代码可以用更少的行和特定的命令轻松编写,从而最小化步骤并帮助加快计算速度。自由软件的传播也变得流行起来,通过出版物、社会媒体、社区或网站共享代码也变得相对容易。从我的角度来看,化学计量学不再是一个巨大的怪物,也不再是一种没有有用内容的科学论文的花哨方式。化学计量学首先通过单变量评估或使用简单的图提取不容易可视化的信息。如今,成千上万的仪器数据可以提供重要的化学信息,我们必须利用它们来提出有意义的建议。事实上,QbD证明了这一点,因为工业4.0已经成为现实。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
14.30%
发文量
46
期刊介绍: BrJAC is dedicated to the diffusion of significant and original knowledge in all branches of Analytical Chemistry, and is addressed to professionals involved in science, technology and innovation projects at universities, research centers and in industry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信