Deep neural network based learning to rank for address standardization

2021 RIVF International Conference on Computing and Communication Technologies (RIVF) Pub Date : 2021-08-19 DOI:10.1109/RIVF51545.2021.9642079

Hai Cao, Viet-Trung Tran

引用次数: 1

Abstract

Address standardization is the process of converting and mapping free-form addresses into a standard structured format. For many business cases, the addresses are entered into the information systems by end-users. They are often noisy, uncompleted, and in different formatted styles. In this paper, we propose a deep learning-based approach to the address standardization challenge. Our key idea is to leverage a Siamese neural network model to embed raw inputs and standardized addresses into a single latent multi-dimensional space. Thus, the corresponding of the raw input address is the one with the highest-ranking score. Our experiments demonstrate that our best model achieved 95.41% accuracy, which is 6.6% improvement from the current state of the art.

查看原文本刊更多论文

基于深度神经网络学习的地址排序标准化

地址标准化是将自由格式地址转换和映射为标准结构化格式的过程。对于许多业务案例，地址由最终用户输入到信息系统中。它们通常是嘈杂的、未完成的，并且格式风格不同。在本文中，我们提出了一种基于深度学习的方法来解决标准化挑战。我们的关键思想是利用暹罗神经网络模型将原始输入和标准化地址嵌入到单个潜在的多维空间中。因此，原始输入地址对应的是排名得分最高的地址。我们的实验表明，我们的最佳模型达到了95.41%的准确率，比目前的技术水平提高了6.6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 RIVF International Conference on Computing and Communication Technologies (RIVF)

自引率

0.00%

发文量