{"title":"基于多层随机漫步的中文问题分类","authors":"Kepei Zhang, Jieyu Zhao","doi":"10.1109/ICICISYS.2010.5658460","DOIUrl":null,"url":null,"abstract":"Question classification is crucial for the automatically question answering. And Random Walk is a promising approach for semi-supervised learning problems of learning from labeled and unlabeled data. Given a set of points, some of them are labeled, and the remaining points are unlabeled, the goal is to predict the labels of the unlabeled points. Since labeling often requires expensive human labor, whereas unlabelled data is easier to obtain, semi-supervised learning is very useful in many real-world problems, such as text classification. Here we proposed an approach for Chinese question Classification using Multilevel Random Walk (MRK), which is an improvement of random walk. In this paper, we selected four kinds of features (words, pos, named entity, semantic) to present Chinese questions, and carried out experiments to validate the method on a large-scale real-world dataset.","PeriodicalId":339711,"journal":{"name":"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Chinese question Classification using Multilevel Random Walk\",\"authors\":\"Kepei Zhang, Jieyu Zhao\",\"doi\":\"10.1109/ICICISYS.2010.5658460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Question classification is crucial for the automatically question answering. And Random Walk is a promising approach for semi-supervised learning problems of learning from labeled and unlabeled data. Given a set of points, some of them are labeled, and the remaining points are unlabeled, the goal is to predict the labels of the unlabeled points. Since labeling often requires expensive human labor, whereas unlabelled data is easier to obtain, semi-supervised learning is very useful in many real-world problems, such as text classification. Here we proposed an approach for Chinese question Classification using Multilevel Random Walk (MRK), which is an improvement of random walk. In this paper, we selected four kinds of features (words, pos, named entity, semantic) to present Chinese questions, and carried out experiments to validate the method on a large-scale real-world dataset.\",\"PeriodicalId\":339711,\"journal\":{\"name\":\"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICISYS.2010.5658460\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICISYS.2010.5658460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Chinese question Classification using Multilevel Random Walk
Question classification is crucial for the automatically question answering. And Random Walk is a promising approach for semi-supervised learning problems of learning from labeled and unlabeled data. Given a set of points, some of them are labeled, and the remaining points are unlabeled, the goal is to predict the labels of the unlabeled points. Since labeling often requires expensive human labor, whereas unlabelled data is easier to obtain, semi-supervised learning is very useful in many real-world problems, such as text classification. Here we proposed an approach for Chinese question Classification using Multilevel Random Walk (MRK), which is an improvement of random walk. In this paper, we selected four kinds of features (words, pos, named entity, semantic) to present Chinese questions, and carried out experiments to validate the method on a large-scale real-world dataset.