{"title":"词与句子相似度分析方法的比较研究","authors":"Farooq Ahmad, Mohd. Faisal","doi":"10.1109/INDIACom51348.2021.00107","DOIUrl":null,"url":null,"abstract":"This study is intended to analyze the methods used to test resemblance of sentences. For many Natural Language Processing applications such as text grouping, information recovery, brief reaction reviewing, machine learning, passage summary and text categorization, measuring resemblance between sentences is a vital activity. In this paper, we classify the approaches to measuring the resemblance of sentences based on the methods implemented into three groups. The most frequently used methods to finding phrase resemblance are word-to-word based, structure-based, and vector-based. Centered on a particular viewpoint, each approach tests the interaction between short texts. Furthermore, to provide a full view of this problem, datasets that are often used as benchmarks for testing techniques in this field are added. Better outcomes are obtained through methods that incorporate more than one viewpoint. In addition, resemblance of sentences is based on the correspondence of their meanings that tests the semantic resemblance between two concepts, words or sentences needs further research.","PeriodicalId":415594,"journal":{"name":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparative Study of Techniques used for Word and Sentence Similarity\",\"authors\":\"Farooq Ahmad, Mohd. Faisal\",\"doi\":\"10.1109/INDIACom51348.2021.00107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study is intended to analyze the methods used to test resemblance of sentences. For many Natural Language Processing applications such as text grouping, information recovery, brief reaction reviewing, machine learning, passage summary and text categorization, measuring resemblance between sentences is a vital activity. In this paper, we classify the approaches to measuring the resemblance of sentences based on the methods implemented into three groups. The most frequently used methods to finding phrase resemblance are word-to-word based, structure-based, and vector-based. Centered on a particular viewpoint, each approach tests the interaction between short texts. Furthermore, to provide a full view of this problem, datasets that are often used as benchmarks for testing techniques in this field are added. Better outcomes are obtained through methods that incorporate more than one viewpoint. In addition, resemblance of sentences is based on the correspondence of their meanings that tests the semantic resemblance between two concepts, words or sentences needs further research.\",\"PeriodicalId\":415594,\"journal\":{\"name\":\"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIACom51348.2021.00107\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIACom51348.2021.00107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative Study of Techniques used for Word and Sentence Similarity
This study is intended to analyze the methods used to test resemblance of sentences. For many Natural Language Processing applications such as text grouping, information recovery, brief reaction reviewing, machine learning, passage summary and text categorization, measuring resemblance between sentences is a vital activity. In this paper, we classify the approaches to measuring the resemblance of sentences based on the methods implemented into three groups. The most frequently used methods to finding phrase resemblance are word-to-word based, structure-based, and vector-based. Centered on a particular viewpoint, each approach tests the interaction between short texts. Furthermore, to provide a full view of this problem, datasets that are often used as benchmarks for testing techniques in this field are added. Better outcomes are obtained through methods that incorporate more than one viewpoint. In addition, resemblance of sentences is based on the correspondence of their meanings that tests the semantic resemblance between two concepts, words or sentences needs further research.