Shangqing Zhang, Yinglin Wang, D. Zhu, Jun Shi, Ruixin Zhang
{"title":"基于支持向量机和特征值控制的文本蕴涵识别","authors":"Shangqing Zhang, Yinglin Wang, D. Zhu, Jun Shi, Ruixin Zhang","doi":"10.1109/ICCP.2012.6356167","DOIUrl":null,"url":null,"abstract":"Recognizing Textual Entailment, as one of the branches of Nature Language Processing, has been widely adopted in Human Computer Interaction and Question Answering System. RTE problem is trying to build an intelligent system which can analyze the content of an input text (T), and then raises a hypothesis (H) inferred from that. My self-design RTE system, which is called SNRTE, combines lexical, syntax, and semantic 3 levels of analysis, under the support of NLP tools including Stemmer, Tokenize, Parser, POS Tag, Name Finder, WordNet2.1, and Support Vector Machine, etc. All these modules fetch useful information elements in the target text to define 49 feature values to train the system to make judgments by SVM. The training data is token from RTE official contest including 1600 pairs of tests and hypothesizes P(T,H). The average correct judgment rate is 67.5%, far above the average system correctness in RTE1 contest (55.12%) and better than the 2nd system (60.6%).","PeriodicalId":205738,"journal":{"name":"2012 IEEE International Conference on Computer Science and Automation Engineering","volume":"66 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Recognizing Textual Entailment with synthetic analysis based on SVM and feature value control\",\"authors\":\"Shangqing Zhang, Yinglin Wang, D. Zhu, Jun Shi, Ruixin Zhang\",\"doi\":\"10.1109/ICCP.2012.6356167\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recognizing Textual Entailment, as one of the branches of Nature Language Processing, has been widely adopted in Human Computer Interaction and Question Answering System. RTE problem is trying to build an intelligent system which can analyze the content of an input text (T), and then raises a hypothesis (H) inferred from that. My self-design RTE system, which is called SNRTE, combines lexical, syntax, and semantic 3 levels of analysis, under the support of NLP tools including Stemmer, Tokenize, Parser, POS Tag, Name Finder, WordNet2.1, and Support Vector Machine, etc. All these modules fetch useful information elements in the target text to define 49 feature values to train the system to make judgments by SVM. The training data is token from RTE official contest including 1600 pairs of tests and hypothesizes P(T,H). The average correct judgment rate is 67.5%, far above the average system correctness in RTE1 contest (55.12%) and better than the 2nd system (60.6%).\",\"PeriodicalId\":205738,\"journal\":{\"name\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"volume\":\"66 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCP.2012.6356167\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Computer Science and Automation Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCP.2012.6356167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recognizing Textual Entailment with synthetic analysis based on SVM and feature value control
Recognizing Textual Entailment, as one of the branches of Nature Language Processing, has been widely adopted in Human Computer Interaction and Question Answering System. RTE problem is trying to build an intelligent system which can analyze the content of an input text (T), and then raises a hypothesis (H) inferred from that. My self-design RTE system, which is called SNRTE, combines lexical, syntax, and semantic 3 levels of analysis, under the support of NLP tools including Stemmer, Tokenize, Parser, POS Tag, Name Finder, WordNet2.1, and Support Vector Machine, etc. All these modules fetch useful information elements in the target text to define 49 feature values to train the system to make judgments by SVM. The training data is token from RTE official contest including 1600 pairs of tests and hypothesizes P(T,H). The average correct judgment rate is 67.5%, far above the average system correctness in RTE1 contest (55.12%) and better than the 2nd system (60.6%).