N. Priatna, A. Huda, Q. U. Safitri, W. Darmalaksana
{"title":"英语圣训翻译聚类文本的混合降维","authors":"N. Priatna, A. Huda, Q. U. Safitri, W. Darmalaksana","doi":"10.1109/ICWT47785.2019.8978239","DOIUrl":null,"url":null,"abstract":"Clustering results are strongly influenced by the selected technique and data dimensions. Large data dimensions become the main problem that must be considered. Therefore, a dimensional reduction is needed to select sub-feature that provides important information. One of the dimensions reduction methods is the hybrid method. The hybrid method combines the method of feature selection and feature extraction to select informative sub-feature. Furthermore, the simplest clustering technique is the k-means algorithm, which divides n data into k cluster based on the centroid. This study carried out clustering using the k-means algorithm after reducing dimensions on 892 English translation hadith documents. The clustering results are validated using the silhouette coefficient and Davies Bouldin index (DBI). Experimental results show that dimensional reduction can improve the cluster quality.","PeriodicalId":220618,"journal":{"name":"2019 IEEE 5th International Conference on Wireless and Telematics (ICWT)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid Reduction Dimension on Clustering Text of English Hadith Translation\",\"authors\":\"N. Priatna, A. Huda, Q. U. Safitri, W. Darmalaksana\",\"doi\":\"10.1109/ICWT47785.2019.8978239\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering results are strongly influenced by the selected technique and data dimensions. Large data dimensions become the main problem that must be considered. Therefore, a dimensional reduction is needed to select sub-feature that provides important information. One of the dimensions reduction methods is the hybrid method. The hybrid method combines the method of feature selection and feature extraction to select informative sub-feature. Furthermore, the simplest clustering technique is the k-means algorithm, which divides n data into k cluster based on the centroid. This study carried out clustering using the k-means algorithm after reducing dimensions on 892 English translation hadith documents. The clustering results are validated using the silhouette coefficient and Davies Bouldin index (DBI). Experimental results show that dimensional reduction can improve the cluster quality.\",\"PeriodicalId\":220618,\"journal\":{\"name\":\"2019 IEEE 5th International Conference on Wireless and Telematics (ICWT)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 5th International Conference on Wireless and Telematics (ICWT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWT47785.2019.8978239\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 5th International Conference on Wireless and Telematics (ICWT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWT47785.2019.8978239","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid Reduction Dimension on Clustering Text of English Hadith Translation
Clustering results are strongly influenced by the selected technique and data dimensions. Large data dimensions become the main problem that must be considered. Therefore, a dimensional reduction is needed to select sub-feature that provides important information. One of the dimensions reduction methods is the hybrid method. The hybrid method combines the method of feature selection and feature extraction to select informative sub-feature. Furthermore, the simplest clustering technique is the k-means algorithm, which divides n data into k cluster based on the centroid. This study carried out clustering using the k-means algorithm after reducing dimensions on 892 English translation hadith documents. The clustering results are validated using the silhouette coefficient and Davies Bouldin index (DBI). Experimental results show that dimensional reduction can improve the cluster quality.