{"title":"基于多任务套索的词对关系相似度测量","authors":"Dongbin Yan, Zhao Lu","doi":"10.1109/CSC.2012.35","DOIUrl":null,"url":null,"abstract":"Relational similarity measurement as a popular research area in the field of natural language processing, is widely used in information retrieval, word sense disambiguation, machine translation and so on. The existing approaches are mostly based on extracting semantic features as feature matrixes from the large-scale corpus and using the corresponding method to process these feature matrixes to compute the relational similarity between word-pairs. However, the extracted semantic features are loosely distributed, which make the sparseness of feature matrixes. This paper proposes a Multi-Task Lasso based Relational similarity measure method (MTLRel), which makes snippets retrieved from a web search engine as the semantic information sources of a word-pair, then builds the feature matrix by extracting predefined patterns from snippets, compress and denoise the feature matrix into a feature vector using a multi-task lasso method, finally measures the relational similarity between two word-pairs by computing the cosine of the angle between two feature vectors. The MTLRel approach achieves an accuracy rate of 50.3% by testing 374 SAT analogy questions with lower time consumption.","PeriodicalId":183800,"journal":{"name":"2012 International Conference on Cloud and Service Computing","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Relational Similarity Measurement between Word-pairs Using Multi-Task Lasso\",\"authors\":\"Dongbin Yan, Zhao Lu\",\"doi\":\"10.1109/CSC.2012.35\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Relational similarity measurement as a popular research area in the field of natural language processing, is widely used in information retrieval, word sense disambiguation, machine translation and so on. The existing approaches are mostly based on extracting semantic features as feature matrixes from the large-scale corpus and using the corresponding method to process these feature matrixes to compute the relational similarity between word-pairs. However, the extracted semantic features are loosely distributed, which make the sparseness of feature matrixes. This paper proposes a Multi-Task Lasso based Relational similarity measure method (MTLRel), which makes snippets retrieved from a web search engine as the semantic information sources of a word-pair, then builds the feature matrix by extracting predefined patterns from snippets, compress and denoise the feature matrix into a feature vector using a multi-task lasso method, finally measures the relational similarity between two word-pairs by computing the cosine of the angle between two feature vectors. The MTLRel approach achieves an accuracy rate of 50.3% by testing 374 SAT analogy questions with lower time consumption.\",\"PeriodicalId\":183800,\"journal\":{\"name\":\"2012 International Conference on Cloud and Service Computing\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 International Conference on Cloud and Service Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSC.2012.35\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Cloud and Service Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSC.2012.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Relational Similarity Measurement between Word-pairs Using Multi-Task Lasso
Relational similarity measurement as a popular research area in the field of natural language processing, is widely used in information retrieval, word sense disambiguation, machine translation and so on. The existing approaches are mostly based on extracting semantic features as feature matrixes from the large-scale corpus and using the corresponding method to process these feature matrixes to compute the relational similarity between word-pairs. However, the extracted semantic features are loosely distributed, which make the sparseness of feature matrixes. This paper proposes a Multi-Task Lasso based Relational similarity measure method (MTLRel), which makes snippets retrieved from a web search engine as the semantic information sources of a word-pair, then builds the feature matrix by extracting predefined patterns from snippets, compress and denoise the feature matrix into a feature vector using a multi-task lasso method, finally measures the relational similarity between two word-pairs by computing the cosine of the angle between two feature vectors. The MTLRel approach achieves an accuracy rate of 50.3% by testing 374 SAT analogy questions with lower time consumption.