Guo-Zhu Wen, Xiaolian Guo, De-shuang Huang, KunHong Liu
{"title":"Application of Self-Organizing Map in Aerosol Single Particles Data Clustering","authors":"Guo-Zhu Wen, Xiaolian Guo, De-shuang Huang, KunHong Liu","doi":"10.1109/IJCNN.2007.4371093","DOIUrl":null,"url":null,"abstract":"In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, \"calcium-containing\" particles, biogenic soot particles, and carbonaceous particles etc. are identified.","PeriodicalId":350091,"journal":{"name":"2007 International Joint Conference on Neural Networks","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Joint Conference on Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.2007.4371093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, "calcium-containing" particles, biogenic soot particles, and carbonaceous particles etc. are identified.