Manuela Ortega Monsalve, Mario Cerón-Muñoz, Luis Galeano-Vasco, Marisol Medina-Sierra
{"title":"利用机器学习模型从实验室高光谱图像预测土壤中的有机碳和氮含量","authors":"Manuela Ortega Monsalve, Mario Cerón-Muñoz, Luis Galeano-Vasco, Marisol Medina-Sierra","doi":"10.1155/2023/4389885","DOIUrl":null,"url":null,"abstract":"Organic carbon and total nitrogen are essential nutrients for plant growth. The presence of these nutrients at acceptable levels can create an optimal environment for the development of crops of interest. The application of spectroscopic techniques and the use of machine learning algorithms have made it possible to calibrate models capable of predicting the number of elements present in the soil. One of these techniques is hyperspectral imaging, which captures portions of the electromagnetic spectrum where the materials present in the soil can be differentiated due to the vibrations of chemical bonds. The objective of this research is to use statistical models to predict OC and N in soils from hyperspectral images. Transformations were applied to spectral and chemical data and the models used were Random Forest (RF) and Support Vector Machine (SVM). To select the best model, the values of the coefficient of determination (<span><svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"></path></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"></path></g></svg>),</span> root mean square error of prediction (RMSEP), and the ratio of performance to deviation (RPD) were considered. For OC, the values found for the RF model were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.87, an RMSEP of 0.10, and an RPD of 6.74; the SVM model presented an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.92, an RMSEP of 0.20, and an RPD of 3.56. For the variable N, the values found for the RF model were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.79, an RMSEP of 0.03, and an RPD of 5.44; for the SVM model, they were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.87, an RMSEP of 0.08, and an RPD of 2.76. The RF model showed a better fit for both variables. The SVM model also produced acceptable results. The results show that machine learning models are a good alternative for analysing soil-related variables.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Use of Machine Learning Models for Prediction of Organic Carbon and Nitrogen in Soil from Hyperspectral Imagery in Laboratory\",\"authors\":\"Manuela Ortega Monsalve, Mario Cerón-Muñoz, Luis Galeano-Vasco, Marisol Medina-Sierra\",\"doi\":\"10.1155/2023/4389885\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Organic carbon and total nitrogen are essential nutrients for plant growth. The presence of these nutrients at acceptable levels can create an optimal environment for the development of crops of interest. The application of spectroscopic techniques and the use of machine learning algorithms have made it possible to calibrate models capable of predicting the number of elements present in the soil. One of these techniques is hyperspectral imaging, which captures portions of the electromagnetic spectrum where the materials present in the soil can be differentiated due to the vibrations of chemical bonds. The objective of this research is to use statistical models to predict OC and N in soils from hyperspectral images. Transformations were applied to spectral and chemical data and the models used were Random Forest (RF) and Support Vector Machine (SVM). To select the best model, the values of the coefficient of determination (<span><svg height=\\\"11.7978pt\\\" style=\\\"vertical-align:-0.2063999pt\\\" version=\\\"1.1\\\" viewbox=\\\"-0.0498162 -11.5914 13.2276 11.7978\\\" width=\\\"13.2276pt\\\" xmlns=\\\"http://www.w3.org/2000/svg\\\" xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\"><g transform=\\\"matrix(.013,0,0,-0.013,0,0)\\\"></path></g><g transform=\\\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\\\"></path></g></svg>),</span> root mean square error of prediction (RMSEP), and the ratio of performance to deviation (RPD) were considered. For OC, the values found for the RF model were an <svg height=\\\"11.7978pt\\\" style=\\\"vertical-align:-0.2063999pt\\\" version=\\\"1.1\\\" viewbox=\\\"-0.0498162 -11.5914 13.2276 11.7978\\\" width=\\\"13.2276pt\\\" xmlns=\\\"http://www.w3.org/2000/svg\\\" xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\"><g transform=\\\"matrix(.013,0,0,-0.013,0,0)\\\"><use xlink:href=\\\"#g113-83\\\"></use></g><g transform=\\\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\\\"><use xlink:href=\\\"#g50-51\\\"></use></g></svg> of 0.87, an RMSEP of 0.10, and an RPD of 6.74; the SVM model presented an <svg height=\\\"11.7978pt\\\" style=\\\"vertical-align:-0.2063999pt\\\" version=\\\"1.1\\\" viewbox=\\\"-0.0498162 -11.5914 13.2276 11.7978\\\" width=\\\"13.2276pt\\\" xmlns=\\\"http://www.w3.org/2000/svg\\\" xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\"><g transform=\\\"matrix(.013,0,0,-0.013,0,0)\\\"><use xlink:href=\\\"#g113-83\\\"></use></g><g transform=\\\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\\\"><use xlink:href=\\\"#g50-51\\\"></use></g></svg> of 0.92, an RMSEP of 0.20, and an RPD of 3.56. For the variable N, the values found for the RF model were an <svg height=\\\"11.7978pt\\\" style=\\\"vertical-align:-0.2063999pt\\\" version=\\\"1.1\\\" viewbox=\\\"-0.0498162 -11.5914 13.2276 11.7978\\\" width=\\\"13.2276pt\\\" xmlns=\\\"http://www.w3.org/2000/svg\\\" xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\"><g transform=\\\"matrix(.013,0,0,-0.013,0,0)\\\"><use xlink:href=\\\"#g113-83\\\"></use></g><g transform=\\\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\\\"><use xlink:href=\\\"#g50-51\\\"></use></g></svg> of 0.79, an RMSEP of 0.03, and an RPD of 5.44; for the SVM model, they were an <svg height=\\\"11.7978pt\\\" style=\\\"vertical-align:-0.2063999pt\\\" version=\\\"1.1\\\" viewbox=\\\"-0.0498162 -11.5914 13.2276 11.7978\\\" width=\\\"13.2276pt\\\" xmlns=\\\"http://www.w3.org/2000/svg\\\" xmlns:xlink=\\\"http://www.w3.org/1999/xlink\\\"><g transform=\\\"matrix(.013,0,0,-0.013,0,0)\\\"><use xlink:href=\\\"#g113-83\\\"></use></g><g transform=\\\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\\\"><use xlink:href=\\\"#g50-51\\\"></use></g></svg> of 0.87, an RMSEP of 0.08, and an RPD of 2.76. The RF model showed a better fit for both variables. The SVM model also produced acceptable results. The results show that machine learning models are a good alternative for analysing soil-related variables.\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1155/2023/4389885\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1155/2023/4389885","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Use of Machine Learning Models for Prediction of Organic Carbon and Nitrogen in Soil from Hyperspectral Imagery in Laboratory
Organic carbon and total nitrogen are essential nutrients for plant growth. The presence of these nutrients at acceptable levels can create an optimal environment for the development of crops of interest. The application of spectroscopic techniques and the use of machine learning algorithms have made it possible to calibrate models capable of predicting the number of elements present in the soil. One of these techniques is hyperspectral imaging, which captures portions of the electromagnetic spectrum where the materials present in the soil can be differentiated due to the vibrations of chemical bonds. The objective of this research is to use statistical models to predict OC and N in soils from hyperspectral images. Transformations were applied to spectral and chemical data and the models used were Random Forest (RF) and Support Vector Machine (SVM). To select the best model, the values of the coefficient of determination (), root mean square error of prediction (RMSEP), and the ratio of performance to deviation (RPD) were considered. For OC, the values found for the RF model were an of 0.87, an RMSEP of 0.10, and an RPD of 6.74; the SVM model presented an of 0.92, an RMSEP of 0.20, and an RPD of 3.56. For the variable N, the values found for the RF model were an of 0.79, an RMSEP of 0.03, and an RPD of 5.44; for the SVM model, they were an of 0.87, an RMSEP of 0.08, and an RPD of 2.76. The RF model showed a better fit for both variables. The SVM model also produced acceptable results. The results show that machine learning models are a good alternative for analysing soil-related variables.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.