Noor Syafina Mahamad Jainalabidin, Aqib Fawwaz Mohd Amidon, N. Ismail, Z. Mohd Yusoff, S. N. Tajuddin, M. Taib
{"title":"The k-Nearest Neighbor modelling by varying Mahalanobis and Correlation in distance metric for agarwood oil quality classification","authors":"Noor Syafina Mahamad Jainalabidin, Aqib Fawwaz Mohd Amidon, N. Ismail, Z. Mohd Yusoff, S. N. Tajuddin, M. Taib","doi":"10.11591/ijaas.v11.i3.pp242-252","DOIUrl":null,"url":null,"abstract":"Agarwood oil is well known for its unique scent and has many usages; as an incense, as ingredient in perfume, is burnt during religious ceremonies and is used in traditional medical preparation. Therefore, agarwood oil has high demand and is traded at different price based on its quality. Basically, the oil quality is classified by using physical properties (odor and color) and this technique has several problems: not consistent in term of accuracy. Thus, this study presented a new technique to classify the quality of agarwood oil based on chemical properties. The work focused on the k-Nearest Neighbor (k-NN) modelling by varying Mahalanobis and Correlation in distance metric for agarwood oil quality classification. It involved of 96 samples of agarwood oil, data pre-processing (data randomization, data normalization, and data division to testing and training datasets) and the development of k-NN model. The training dataset is used to train the k-NN model, and the testing dataset is used to test the developed model. During the model development, Mahalanobis and Correlation are varied in k-NN distance metric. The k-NN values are ranging from 1 to 10. Several performance criteria including resubstitution error (closs), cross-validation error (kloss) and accuracy were applied to measure the performance of the built k-NN model. All the analytical work was performed via MATLAB software version R2020a. The result showed that the accuracy of Mahalanobis distance metric has a better performance compared to Correlation from k=1 to k=5 with the value of 100.00%. This finding is important as it proved the capabilities of k-NN modelling in classifying the agarwood oil quality. Not limited to that, it also contributed to the agarwood oil research area as well as its industry.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijaas.v11.i3.pp242-252","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Agarwood oil is well known for its unique scent and has many usages; as an incense, as ingredient in perfume, is burnt during religious ceremonies and is used in traditional medical preparation. Therefore, agarwood oil has high demand and is traded at different price based on its quality. Basically, the oil quality is classified by using physical properties (odor and color) and this technique has several problems: not consistent in term of accuracy. Thus, this study presented a new technique to classify the quality of agarwood oil based on chemical properties. The work focused on the k-Nearest Neighbor (k-NN) modelling by varying Mahalanobis and Correlation in distance metric for agarwood oil quality classification. It involved of 96 samples of agarwood oil, data pre-processing (data randomization, data normalization, and data division to testing and training datasets) and the development of k-NN model. The training dataset is used to train the k-NN model, and the testing dataset is used to test the developed model. During the model development, Mahalanobis and Correlation are varied in k-NN distance metric. The k-NN values are ranging from 1 to 10. Several performance criteria including resubstitution error (closs), cross-validation error (kloss) and accuracy were applied to measure the performance of the built k-NN model. All the analytical work was performed via MATLAB software version R2020a. The result showed that the accuracy of Mahalanobis distance metric has a better performance compared to Correlation from k=1 to k=5 with the value of 100.00%. This finding is important as it proved the capabilities of k-NN modelling in classifying the agarwood oil quality. Not limited to that, it also contributed to the agarwood oil research area as well as its industry.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.