Jun-Hui Her, Sunghae Jun, Jun-Hyeog Choi, Jung-Hyun Lee
{"title":"动态网页文档聚类的贝叶斯神经网络模型","authors":"Jun-Hui Her, Sunghae Jun, Jun-Hyeog Choi, Jung-Hyun Lee","doi":"10.1109/TENCON.1999.818696","DOIUrl":null,"url":null,"abstract":"There has been lots of research to improve the precision of IR system. These research have been studied on the document ranking, user profiles, relevance feedback and the information processing that includes document classification, clustering, routing and filtering. This paper proposes and incarnates method of neural approach about the information processing which makes users can search documents effectively and of the document clustering. In this paper the system calculates entropy between the query, the profile and the each of the web documents each other; and clusters documents using the calculated entropy as the value of the clustering variable through SOM. As the Bayesian Neural Network model has high classification accuracy with a rapid learning speed and clustering, it is possible that dynamic document clustering as it was combined with Bayesian probability model used in real-time document classification. We used KTSET which is a test collection to evaluate Korean IR system for the experiment.","PeriodicalId":121142,"journal":{"name":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A Bayesian neural network model for dynamic web document clustering\",\"authors\":\"Jun-Hui Her, Sunghae Jun, Jun-Hyeog Choi, Jung-Hyun Lee\",\"doi\":\"10.1109/TENCON.1999.818696\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There has been lots of research to improve the precision of IR system. These research have been studied on the document ranking, user profiles, relevance feedback and the information processing that includes document classification, clustering, routing and filtering. This paper proposes and incarnates method of neural approach about the information processing which makes users can search documents effectively and of the document clustering. In this paper the system calculates entropy between the query, the profile and the each of the web documents each other; and clusters documents using the calculated entropy as the value of the clustering variable through SOM. As the Bayesian Neural Network model has high classification accuracy with a rapid learning speed and clustering, it is possible that dynamic document clustering as it was combined with Bayesian probability model used in real-time document classification. We used KTSET which is a test collection to evaluate Korean IR system for the experiment.\",\"PeriodicalId\":121142,\"journal\":{\"name\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"volume\":\"174 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1999-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.1999.818696\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.1999.818696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Bayesian neural network model for dynamic web document clustering
There has been lots of research to improve the precision of IR system. These research have been studied on the document ranking, user profiles, relevance feedback and the information processing that includes document classification, clustering, routing and filtering. This paper proposes and incarnates method of neural approach about the information processing which makes users can search documents effectively and of the document clustering. In this paper the system calculates entropy between the query, the profile and the each of the web documents each other; and clusters documents using the calculated entropy as the value of the clustering variable through SOM. As the Bayesian Neural Network model has high classification accuracy with a rapid learning speed and clustering, it is possible that dynamic document clustering as it was combined with Bayesian probability model used in real-time document classification. We used KTSET which is a test collection to evaluate Korean IR system for the experiment.