{"title":"基于气体传感器阵列多维数据的机器学习方法对排水和河流水质分类的可行性。","authors":"Magdalena Piłat-Rożek, Grzegorz Łagód","doi":"10.26444/aaem/196101","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The aim of the study is to verify whether the electronic nose system - an array of 17 gas sensors with a signal analysis system - is a useful tool for the classification and preliminary assessment of the quality of drainage water.</p><p><strong>Material and methods: </strong>Water samples for analysis were collected in the Park Ludowy (People's Park), located next to the Bystrzyca River, near the city center of Lublin in eastern Poland. Drainage water was sampled at 4 different points. Samples of synthetic air and river water taken from the Bystrzyca River were used for reference. All water samples were tested using an MOS gas sensor array. In order to assess how the e-nose performed in screening and discriminating/preliminarily classifying and grouping samples, their properties were tested using reference methods and assessing surface water quality. The PCA method, Kohonen's SOM with superimposed cluster boundaries by McQuitty's method, random forest and MLP neural network were used to visualize and classify the multivariate data.</p><p><strong>Results: </strong>The visualization and multidimensionality reduction methods (PCA and SOM) did not enable to clearly distinguish the observations from different drainage water samples. The supervised random forest and MLP methods coped with the classification of samples much better, achieving 84.3% and 87.6% correct classifications on the test set, respectively.</p><p><strong>Conclusions: </strong>Statistical analysis of the chemical properties of the samples showed that even reference tests are unable to clearly distinguish the samples in terms of a single parameter. However, the e-nose method makes it possible to distinguish these samples from a reference sample derived from river water and a clean air sample.</p>","PeriodicalId":50970,"journal":{"name":"Annals of Agricultural and Environmental Medicine","volume":"31 4","pages":"513-519"},"PeriodicalIF":1.3000,"publicationDate":"2024-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feasibility of classification of drainage and river water quality using machine learning methods based on multidimensional data from a gas sensor array.\",\"authors\":\"Magdalena Piłat-Rożek, Grzegorz Łagód\",\"doi\":\"10.26444/aaem/196101\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>The aim of the study is to verify whether the electronic nose system - an array of 17 gas sensors with a signal analysis system - is a useful tool for the classification and preliminary assessment of the quality of drainage water.</p><p><strong>Material and methods: </strong>Water samples for analysis were collected in the Park Ludowy (People's Park), located next to the Bystrzyca River, near the city center of Lublin in eastern Poland. Drainage water was sampled at 4 different points. Samples of synthetic air and river water taken from the Bystrzyca River were used for reference. All water samples were tested using an MOS gas sensor array. In order to assess how the e-nose performed in screening and discriminating/preliminarily classifying and grouping samples, their properties were tested using reference methods and assessing surface water quality. The PCA method, Kohonen's SOM with superimposed cluster boundaries by McQuitty's method, random forest and MLP neural network were used to visualize and classify the multivariate data.</p><p><strong>Results: </strong>The visualization and multidimensionality reduction methods (PCA and SOM) did not enable to clearly distinguish the observations from different drainage water samples. The supervised random forest and MLP methods coped with the classification of samples much better, achieving 84.3% and 87.6% correct classifications on the test set, respectively.</p><p><strong>Conclusions: </strong>Statistical analysis of the chemical properties of the samples showed that even reference tests are unable to clearly distinguish the samples in terms of a single parameter. However, the e-nose method makes it possible to distinguish these samples from a reference sample derived from river water and a clean air sample.</p>\",\"PeriodicalId\":50970,\"journal\":{\"name\":\"Annals of Agricultural and Environmental Medicine\",\"volume\":\"31 4\",\"pages\":\"513-519\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Agricultural and Environmental Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.26444/aaem/196101\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/2 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Agricultural and Environmental Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.26444/aaem/196101","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/2 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
目的:本研究的目的是验证电子鼻系统-由17个气体传感器组成的阵列和信号分析系统-是否是一种有用的工具,用于分类和初步评估排水质量。材料和方法:用于分析的水样收集于波兰东部卢布林市中心附近Bystrzyca河旁的Ludowy公园(人民公园)。在4个不同的点取样排水。从Bystrzyca河提取的合成空气和河水样本作为参考。所有水样均使用MOS气体传感器阵列进行测试。为了评价电子鼻对样品的筛选和鉴别/初步分类和分组效果,采用参考方法对其性能进行了测试,并对地表水水质进行了评价。采用主成分分析法(PCA)、Kohonen’s SOM (McQuitty’s method叠加聚类边界)、随机森林和MLP神经网络对多变量数据进行可视化分类。结果:可视化和多维降维方法(PCA和SOM)不能清晰区分不同排水水样的观测结果。监督随机森林和MLP方法对样本的分类处理要好得多,在测试集上的分类正确率分别为84.3%和87.6%。结论:对样品化学性质的统计分析表明,即使参考试验也无法根据单一参数明确区分样品。然而,电子鼻方法可以将这些样品与取自河水的参考样品和洁净空气样品区分开来。
Feasibility of classification of drainage and river water quality using machine learning methods based on multidimensional data from a gas sensor array.
Objective: The aim of the study is to verify whether the electronic nose system - an array of 17 gas sensors with a signal analysis system - is a useful tool for the classification and preliminary assessment of the quality of drainage water.
Material and methods: Water samples for analysis were collected in the Park Ludowy (People's Park), located next to the Bystrzyca River, near the city center of Lublin in eastern Poland. Drainage water was sampled at 4 different points. Samples of synthetic air and river water taken from the Bystrzyca River were used for reference. All water samples were tested using an MOS gas sensor array. In order to assess how the e-nose performed in screening and discriminating/preliminarily classifying and grouping samples, their properties were tested using reference methods and assessing surface water quality. The PCA method, Kohonen's SOM with superimposed cluster boundaries by McQuitty's method, random forest and MLP neural network were used to visualize and classify the multivariate data.
Results: The visualization and multidimensionality reduction methods (PCA and SOM) did not enable to clearly distinguish the observations from different drainage water samples. The supervised random forest and MLP methods coped with the classification of samples much better, achieving 84.3% and 87.6% correct classifications on the test set, respectively.
Conclusions: Statistical analysis of the chemical properties of the samples showed that even reference tests are unable to clearly distinguish the samples in terms of a single parameter. However, the e-nose method makes it possible to distinguish these samples from a reference sample derived from river water and a clean air sample.
期刊介绍:
All papers within the scope indicated by the following sections of the journal may be submitted:
Biological agents posing occupational risk in agriculture, forestry, food industry and wood industry and diseases caused by these agents (zoonoses, allergic and immunotoxic diseases).
Health effects of chemical pollutants in agricultural areas , including occupational and non-occupational effects of agricultural chemicals (pesticides, fertilizers) and effects of industrial disposal (heavy metals, sulphur, etc.) contaminating the atmosphere, soil and water.
Exposure to physical hazards associated with the use of machinery in agriculture and forestry: noise, vibration, dust.
Prevention of occupational diseases in agriculture, forestry, food industry and wood industry.
Work-related accidents and injuries in agriculture, forestry, food industry and wood industry: incidence, causes, social aspects and prevention.
State of the health of rural communities depending on various factors: social factors, accessibility of medical care, etc.