Deep or Shallow? A Comparative Analysis on the Oil Species Identification Based on Excitation-Emission Matrix and Multiple Machine Learning Algorithms.
{"title":"Deep or Shallow? A Comparative Analysis on the Oil Species Identification Based on Excitation-Emission Matrix and Multiple Machine Learning Algorithms.","authors":"Ming Xie, Qintuan Xu, Ying Li","doi":"10.1007/s10895-023-03511-w","DOIUrl":null,"url":null,"abstract":"<p><p>With the continuous expansion of petroleum extraction, transportation, and storage, the risk of oil spills at sea has also increased, posing a serious threat to marine safety. The excitation-emission matrix (EEM), which is composed of the fluorometric spectra under multiple excitation wavelengths, becomes a feasible approach to identify oil species. Despite the fact that various machine learning models have been applied to analyse EEMs of oil pollutants, it is unclear how much improvements the deep learning models have achieved, especially comparing with the shallow learning models. This paper presents a comparative analysis on the oil species identification using four types of machine learning models: random forest (RF), support vector machine (SVM), back propagation neural network (BPNN), and deep convolutional neural network (DCNN). The fluorescence of some common oils was excited using a tuneable xenon lamp and collected with a high-resolution spectrometer to form the EEMs for model training and testing.The results show that SVM, BPNN, and DCNN achieved high identification accuracies that are more than 93% on all types of oils tested in the study. The two deep learning models didn't have significant improvement over the SVM model. Considering the fact that the deep learning models require much larger number of calculations and longer running time, the SVM tends to be more suitable for oil species identification when considering the balance between the model accuracy and efficiency. This study provides some guidance on the choices of oil species identification model in the cases of oil spills.</p>","PeriodicalId":15800,"journal":{"name":"Journal of Fluorescence","volume":" ","pages":"2907-2915"},"PeriodicalIF":2.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Fluorescence","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s10895-023-03511-w","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/11/14 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
With the continuous expansion of petroleum extraction, transportation, and storage, the risk of oil spills at sea has also increased, posing a serious threat to marine safety. The excitation-emission matrix (EEM), which is composed of the fluorometric spectra under multiple excitation wavelengths, becomes a feasible approach to identify oil species. Despite the fact that various machine learning models have been applied to analyse EEMs of oil pollutants, it is unclear how much improvements the deep learning models have achieved, especially comparing with the shallow learning models. This paper presents a comparative analysis on the oil species identification using four types of machine learning models: random forest (RF), support vector machine (SVM), back propagation neural network (BPNN), and deep convolutional neural network (DCNN). The fluorescence of some common oils was excited using a tuneable xenon lamp and collected with a high-resolution spectrometer to form the EEMs for model training and testing.The results show that SVM, BPNN, and DCNN achieved high identification accuracies that are more than 93% on all types of oils tested in the study. The two deep learning models didn't have significant improvement over the SVM model. Considering the fact that the deep learning models require much larger number of calculations and longer running time, the SVM tends to be more suitable for oil species identification when considering the balance between the model accuracy and efficiency. This study provides some guidance on the choices of oil species identification model in the cases of oil spills.
期刊介绍:
Journal of Fluorescence is an international forum for the publication of peer-reviewed original articles that advance the practice of this established spectroscopic technique. Topics covered include advances in theory/and or data analysis, studies of the photophysics of aromatic molecules, solvent, and environmental effects, development of stationary or time-resolved measurements, advances in fluorescence microscopy, imaging, photobleaching/recovery measurements, and/or phosphorescence for studies of cell biology, chemical biology and the advanced uses of fluorescence in flow cytometry/analysis, immunology, high throughput screening/drug discovery, DNA sequencing/arrays, genomics and proteomics. Typical applications might include studies of macromolecular dynamics and conformation, intracellular chemistry, and gene expression. The journal also publishes papers that describe the synthesis and characterization of new fluorophores, particularly those displaying unique sensitivities and/or optical properties. In addition to original articles, the Journal also publishes reviews, rapid communications, short communications, letters to the editor, topical news articles, and technical and design notes.