Guo-Zhu Wen, Xiaolian Guo, De-shuang Huang, KunHong Liu
{"title":"自组织映射在气溶胶单粒子数据聚类中的应用","authors":"Guo-Zhu Wen, Xiaolian Guo, De-shuang Huang, KunHong Liu","doi":"10.1109/IJCNN.2007.4371093","DOIUrl":null,"url":null,"abstract":"In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, \"calcium-containing\" particles, biogenic soot particles, and carbonaceous particles etc. are identified.","PeriodicalId":350091,"journal":{"name":"2007 International Joint Conference on Neural Networks","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Application of Self-Organizing Map in Aerosol Single Particles Data Clustering\",\"authors\":\"Guo-Zhu Wen, Xiaolian Guo, De-shuang Huang, KunHong Liu\",\"doi\":\"10.1109/IJCNN.2007.4371093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, \\\"calcium-containing\\\" particles, biogenic soot particles, and carbonaceous particles etc. are identified.\",\"PeriodicalId\":350091,\"journal\":{\"name\":\"2007 International Joint Conference on Neural Networks\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 International Joint Conference on Neural Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IJCNN.2007.4371093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Joint Conference on Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.2007.4371093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of Self-Organizing Map in Aerosol Single Particles Data Clustering
In this paper, self-organizing map (SOM) is used to visualize and cluster the data set of aerosol single particle mass spectrum, which was collected by aerosol time-of-flight mass spectrometry (ATOFMS). In view of the characteristic feature of aerosol particle data, the TF-IDF scheme used widely in document clustering is employed to preprocess. Subsequently for data clustering analysis, a two-level clustering framework is proposed, wherein SOM is firstly used to cluster input data and get the primary results, and then the results are again clustered by semiautomatic k-means algorithm. In order to demonstrate the validity of clustering, the chemical significance for cluster centroid is also investigated, wherein inorganic salts, "calcium-containing" particles, biogenic soot particles, and carbonaceous particles etc. are identified.