Dan Liang , Yongjiu Feng , Liang Hong , Xiaohua Tong
{"title":"Longest similar subsequence-modified Hausdorff distance: A new robust approach for measuring distances between linear features","authors":"Dan Liang , Yongjiu Feng , Liang Hong , Xiaohua Tong","doi":"10.1016/j.eswa.2025.127380","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately measuring the distance between linear features is crucial for understanding the similarity between objects. Traditional measures such as the Hausdorff distance (<em>HD</em>) are inadequate in handling length variations and outliers among linear features, and thus reducing the accuracy of similarity assessment. In this paper, we propose a new concept termed Longest Similar Subsequence (<em>LSS</em>), which aims to identify the longest overlapping subsequence of paired linear features. By integrating <em>LSS</em> with a modified Hausdorff Distance (<em>LSS-HD</em>), we propose a matching method that is based on <em>LSS-HD</em> and adaptive threshold-selection. The adaptive threshold-selection method, named the Connected Bin Threshold Selection Algorithm (<em>CBTSA</em>), is based on the distance histogram analysis. Experimental results clearly demonstrate that the <em>LSS</em>-based <em>HD</em> measure is more robust compared to the <em>HD</em> measure without <em>LSS</em> constraints. In the road matching experiments, our matching accuracy, with an F1-score of over 0.96, is significantly higher than that of the matching algorithms (Opt and OILRM), which are also solely based on distance-similarity metrics.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"278 ","pages":"Article 127380"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425010024","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately measuring the distance between linear features is crucial for understanding the similarity between objects. Traditional measures such as the Hausdorff distance (HD) are inadequate in handling length variations and outliers among linear features, and thus reducing the accuracy of similarity assessment. In this paper, we propose a new concept termed Longest Similar Subsequence (LSS), which aims to identify the longest overlapping subsequence of paired linear features. By integrating LSS with a modified Hausdorff Distance (LSS-HD), we propose a matching method that is based on LSS-HD and adaptive threshold-selection. The adaptive threshold-selection method, named the Connected Bin Threshold Selection Algorithm (CBTSA), is based on the distance histogram analysis. Experimental results clearly demonstrate that the LSS-based HD measure is more robust compared to the HD measure without LSS constraints. In the road matching experiments, our matching accuracy, with an F1-score of over 0.96, is significantly higher than that of the matching algorithms (Opt and OILRM), which are also solely based on distance-similarity metrics.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.