{"title":"Data","authors":"B. Ruddell","doi":"10.1017/9781108779678.008","DOIUrl":null,"url":null,"abstract":": [ Objective ] This paper analyzes the popular text similarity measures and discusses their latest developments. [ Coverage ] We retrieved 69 key articles from CNKI and Web of Science databases by searching “TI: ‘text similarity’ or ‘semantic similarity’ or ‘lexical similarity’ ” in Chinese and English respectively. [ Methods ] We systematically reviewed the text similarity measures focusing on their basic concepts, characteristics and future directions. [ Results ] There were four types of text similarity measures: String-based, Corpus-based, Knowledge-based and others. Measures based on the neural network, Knowledge-based measures and inter-disciplinary measures could be the future research directions. [ Limitations ] We did not discuss the applications of those measures. [ Conclusions ] This paper is a comprehensive review of text similarity measure research.","PeriodicalId":349969,"journal":{"name":"The Food-Energy-Water Nexus","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Food-Energy-Water Nexus","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/9781108779678.008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
: [ Objective ] This paper analyzes the popular text similarity measures and discusses their latest developments. [ Coverage ] We retrieved 69 key articles from CNKI and Web of Science databases by searching “TI: ‘text similarity’ or ‘semantic similarity’ or ‘lexical similarity’ ” in Chinese and English respectively. [ Methods ] We systematically reviewed the text similarity measures focusing on their basic concepts, characteristics and future directions. [ Results ] There were four types of text similarity measures: String-based, Corpus-based, Knowledge-based and others. Measures based on the neural network, Knowledge-based measures and inter-disciplinary measures could be the future research directions. [ Limitations ] We did not discuss the applications of those measures. [ Conclusions ] This paper is a comprehensive review of text similarity measure research.